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METHODS OF DIAGNOSIS OF ANGIOGENESIS, COMPOSITIONS 
AND METHODS OF SCREENING FOR ANGIOGENESIS 
5 MODULATORS 

CROSS-REFERENCES TO RELATED APPLICATIONS 
This application claims priority to USSN 09/784,356, filed Febraary 14 2001; 
10 USSN 09/791,390, filed February 22, 2001; USSN 60/285,475, filed April 19, 2001, USSN 
60/310,025, filed August 3, 2001, and USSN 60/334,244, filed November 29, 2001, each of 
which is herein incorporated by reference in its entirety. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein 
expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
angiogenesis; and to the use of such expression profiles and compositions in diagnosis and 
therapy of angiogenesis. The invention further relates to methods for identifying and using 
agents and/or targets that modulate angiogenesis. 

BACKGROUND OF THE INVENTION 
Both vasculogenesis, the development of an interactive vascular system 
comprising arteries and veins, and angiogenesis, the generation of new blood vessels, play a 
role in embryonic development. In contrast, angiogenesis is limited in a normal adult to the 
placenta, ovary, endometrium and sites of wound healing. However, angiogenesis, or its 
absence, plays an important role in the maintenance of a variety of pathological states. Some 
of these states are characterized by neovascularization, e.g.^ cancer, diabetic retinopathy, 
glaucoma, and age related macular degeneration. Others, e.g., stroke, infertility, heart 
disease, ulcers, and scleroderma, are diseases of angiogenic insufficiency. 

Angiogenesis has a number of stages (see, e.g., Folkman, J.Natl Cancer Inst 
82:4-6, 1990; Fu:estein,yC/m//2ve^^. 103:3-4, 1999;Koch, Arthritis Rheum A1:95U62, 1998; 
Carter, Oncologist 5(Suppl l):51-4, 2000; Browder a/., Cancer Res. 60:1878-86, 2000; and 
Zhu and Witte, Invest New Drugs 17:195-212, 1999). The early stages of angiogenesis 
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include endothelial cell protease production, migration of cells, and proliferation. The early 
stages also appear to require some growth factors, with VEGF, TGF-a, angiostatin, and 
selected chemokines all putatively playing a role. Later stages of angiogenesis include 
population of the vessels with mural cells (pericytes or smooth muscle cells), basement 

5 membrane production, and the induction of vessel bed specializations. The final stages of 
vessel formation include what is known as ^'remodeling", wherein a fprming vasculature 
becomes a stable, mature vessel bed. Thus, the process is highly dynamic, often requiring 
coordinated spatial and temporal waves of gene expression. 

Conversely, the complex process may be subject to disruption by interfering 

10 with one or more critical steps. Thus, the lack of understanding of the dynamics of 

angiogenesis prevents therapeutic intervention in serious diseases such as those indicated. It 
is an object of the invention to provide methods that can be used to screen compounds for the 
ability to modulate angiogenesis. Additionally, it is an object to provide molecular targets for 
therapeutic intervention in disease states which either have an undesirable excess or a deficit 

15 in angiogenesis. The present invention provides solutions to both. 

SUMMARY OF THE INVENTION 
The present invention provides compositions and methods for detecting or 
modulating angiogenesis associated sequences. 
20 In one aspect, the invention provides a method of detecting an angiogenesis- 

associated transcript in a cell in a patient, the method comprising contacting a biological 
sample from the patient with a polynucleotide that selectively hybridized to a sequence at 
least 80% identical to a sequence as shown in Tables 1-8. In one embodiment, the biological 
sample is a tissue sample. In another embodiment, the biological sample comprises isolated 
25 nucleic acids, which are often mRNA. 

In another embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 
Often, the polynucleotide comprises a sequence as shown in Tables 1-8. The polynucleotide 
can be labeled, for example, with a fluorescent label and can be immobilized on a solid 
30 surface. 

In other embodiments the patient is imdergoing a therapeutic regimen to treat a 
disease associated with angiogenesis or the patient is suspected of having an angiogenesis- 
associated disorder. 
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In another aspect, the invention comprises an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-8, The nucleic acid molecule 
can be labeled, for example, with a fluorescent label. 

In other aspects, the invention provides an expression vector comprising an 
5 isolated nucleic acid molecule consisting of a polynucleotide sequence as shown in Tables 1- 
8 or a host cell comprising the expression vector. 

In another embodiment, the isolated nucleic acid molecule encodes a 
polypeptide having an amino acid sequence as shown in Table 8. 

In another aspect, the invention provides an isolated polypeptide which is 
10 encoded by a nucleic acid molecule having polynucleotide sequence as shown m Tables 1-8, 
In one embodiment, the isolated polypeptide has an amino acid sequence as shown in Table 
8. 

In another embodiment, flie invention provides an antibody that specifically 

binds a polypeptide that has an amino acid sequence as shown in Table 8 or which is encoded 
15 by a nucleotide sequence of Tables 1-8 . The antibody can be conjugated or fused to an 

effector component such as a fluorescent label, a toxin, or a radioisotope. In some 

embodiments, the antibody is an antibody fragment or a humanized antibody. 

In another aspect, the invention provides a method of detecting a cell 

undergoing angiogenesis in a biological sample from a patient, the method comprising 
20 contacting the biological sample with an antibody that specifically binds to a polypeptide that 

has an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide 

sequence of Tables 1-8 . In some embodiments, the antibody is further conjugated or fused to 

an effector component, for example, a fluorescent label. 

In another embodiment, the invention provides a method of detecting 
25 antibodies specific to angiogenesis in a patient, the method comprising contacting a 

biological sample from the patient with a polypeptide which is encoded by a nucleotide 

sequence of Tables 1-8. 

The invention also provides a method of identifying a compound that 

modulates the activity of an angiogenesis-associated polypeptide, the method comprising the 
30 steps of: (i) contacting the compound with a polypeptide that comprises at least 80% identity 

to an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide sequence 

of Tables 1-8; and (ii) detecting an increase or a decrease in the activity of the polypeptide. 

In one embodiment, the polypeptide has an amino acid sequence as shown in Table 8 or is a 
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polypeptide encoded by a nucleotide sequence of Tables 1-8. In another embodiment, the 
polypeptide is expressed in a cell. 

The invention also provides a method of identifying a compound that 
modulates angiogenesis, the method comprising steps of: (i) contacting the compound with a 
cell undergoing angiogenesis; and (ii) detecting an increase or a decrease in the expression of 
a polypeptide sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8. In one embodiment, the detecting step comprises 
hybridizing a nucleic acid sample from the cell with a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-8. In 
another embodiment, the method further comprises detecting an increase or decrease in the 
expression of a second sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8 . 

In another embodiment, the invention provides a method of inhibiting 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 
shown in Table 8 or which is 80% identical to a polypeptide encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
therapeutically effective amoimt of an inhibitor of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8 . In another embodiment, the inhibitor is an 
antibody. 

In other embodiments, the invention provides a method of activating 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 
shown in Table 8 or at least 80% identical to a polypeptide which is encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
therapeutically effective amount of an activator of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8. 

Other aspects of the invention will become apparent to the skilled artisan by 
the following description of the invention. 

Tables 1-8 provide nucleotide sequence of genes that exhibit changes in 
expression levels as a function of time in tissue undergoing angiogenesis compared to tissue 
that is not. 

4 
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DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and treatment of disorders associated with angiogenesis 
(sometimes referred to herein as angiogenesis disorders or AD), as well as methods for 
5 screening for compositions which modulate angiogenesis. By "disorder associated with 

angiogenesis" or "disease associated with angiogenesis" herein is meant a disease state which 
is marked by either an excess or a deficit of blood vessel development. Angiogenesis 
disorders asociated with increased angiogenesis include, but are not limited to, cancer and 
prohferative diabetic retinopathy. Pathological states for which it may be desirable to 

10 increase angiogenesis include stroke, heart disease, infertility, ulcers, wound healing, 

ischemia, and scleradoma. Solid tumors typically require angiogenesis to support or sustain 
growth, e.g., breast, colon, lung, brain, bladder, and prostate tumors. Other AD include, e.g., 
arthritis, inflammatory bowel disease, diabetis retinopathy, macular degeneration, 
atherosclerosis, and psoriasis. Also provided are methods for treating AD. 

IS Definitions 

The term "angiogenesis protein" or "angiogenesis polynucleotide" refers to 
nucleic acid and polypeptide polymorphic variants, alleles, mutants, and interspecies 
homologs that: (1) have an amino acid sequence that has greater than about 60% amino acid 
sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 

20 96%, 97%, 98% or 99% or greater amino acid sequence identity, preferably over a region of 
over a region of at least about 25, 50, 100, 200, 500, 1000, or more amino acids, to an 
angiogenesis protein sequence of Table 8; (2) bind to antibodies, e.g., polyclonal antibodies, 
raised against an immunogen comprising an amino acid sequence of Table 8, and 
conservatively modified variants thereof; (3) specifically hybridize under stringent 

25 hybridization conditions to an anti-sense strand corresponding to a nucleic acid sequence of 
Tables 1-8 and conservatively modified variants thereof; (4) have a nucleic acid sequence 
that has greater than about 95%, preferably greater than about 96%, 97%, 98%, 99%, or 
higher nucleotide sequence identity, preferably over a region of at least about 25, 50, 100, 
200, 500, 1000, or more nucleotides, to a sense sequence corresponding to one set out in 

30 Tables 1-8 . A polynucleotide or polypeptide sequence is typically fi-om a mammal 

including, but not limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, 
horse, sheep, or any mammal. An "angiogenesis polypeptide" and an "angiogenesis 
polynucleotide," include both naturally occxuring or recombinant. 
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A "full length^' angiogenesis protein or nucleic acid refers to an agiogenesis 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type angiogenesis polynucleotide 
or polypeptide sequences. The "full length" may be prior to, or after, various stages of post- 
5 translation processing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polj^ep tides, e.^., of an angiogenic protein. Such samples include, 
but are not limited to, tissue isolated from primates, e.g., humans, or rodents, e.g., nuce, and 
rats. Biological samples may also include sections of tissues such as biopsy and autopsy 
10 samples, and frozen sections taken for histologic pmposes. A biological sample is typically 
obtained from a eukaryotic organism, most preferably a mammal such as a primate e.g., 
chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or a bird; 
reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
15 methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome histroy, will 
be particularly usefiil. 

20 The terms "identical" or percent "identity," in the context of two or more 

nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
same (i.e., about 70% identity, preferably 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or higher identity over a specified region (e.g., SEQ ID NOS: 1-229), 

25 whrai compared and aligned for maximimi correspondence over a comparison window or 
designated region) as measured using a BLAST or BLAST 2.0 sequence comparison 
algorithms with default parameters described below, or by manual alignment and visual 
inspection (see, e.g., NCBI web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such 
sequences are then said to be "substantially identical." This definition also refers to, or may 

30 be applied to, the compliment of a test sequence. The definition also includes sequences that 
have deletions and/or additions, as well as those that have substitutions. As described below, 
the preferred algorithms can account for gaps and the like. Preferably, identity exists over a 
region that is at least about 25 amino acids or nucleotides in length, or more preferably over a 
region that is 50-100 amino acids or nucleotides in length. 

6 
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For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithcn, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
5 program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
any one of the nxmiber of contiguous positions selected from the group consisting of from 20 

10 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a 
sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
for comparison are well-known in the art. Optimal alignment of sequmces for comparison 
can be conducted, e.g., by the local homology algorithm of Snnth & Waterman, i4rfv. AppL 

15 Math, 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. 
BioL 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat 'I 
Acad. Set USA 85:2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFIT, PASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 

20 visual inspection {see, e.g.. Current Protocols in Molecular Biology (Ausubel et al., eds. 
1995 supplement)), 

A preferred example of algorithm that is suitable for determining percent 
sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which 
are described in Altschul et aL, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et al., J. 

25 MoL BioL 215:403-410 (1990), respectively. BLAST and BLAST 2.0 are used, with the 

parameters described herein, to determine percent sequence identity for the nucleic acids and 
proteins of the invention. Software for performing BLAST analyses is publicly available 
through the National Crater for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). 
This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 

30 short words of length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score threshold (Altschul et aL, supra). 
These initial neighborhood word hits act as seeds for mitiating searches to find longer HSPs 
containing them. The word hits are extended in both directions along each sequence for as 
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far as the cumulative alignment score can be increased. Ciunulative scores are calculated 
using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino 
acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of fhe 
5 word hits in each direction are halted when: the cumulative alignment score falls ofFby the 
quantity X from its maximum achieved value; the cxraiulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

10 uses as defaults a wordlength (W) of 1 1 , an expectation (E) of 10, M=5, N==-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (B) of 10, and the BLOSUM62 scoring matrix 
{see Henikoff & Henikoff, Proc. Natl Acad. Set USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

15 The BLAST algorithm also perfomis a statistical analysis of the similarity 

between two sequences (see, e.g., Karlin & Altschul, Proc. Nat 'I Acad. Set USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 

20 nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001. 

An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 

25 reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, for example, where the two peptides differ only by conservative substitutions. 
Another indication that two nucleic acid sequences are substantially identical is that the two 
molecules or their complements hybridize to each other imder stringent conditions, as 

30 described below. Yet another indication that two nucleic acid sequences are substantially 
identical is that the same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
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prokaryotic cells such as E. colU or eukaiyotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, the American Type Culture 
Collection catalog or web site, www.atcc.org). 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
5 herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers and 
non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 

10 as well as amino acid analogs and amino acid mimetics that function in a manner similar to 
the naturally occurring amino acids. Naturally occurring amino acids are those encoded by 
the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Ammo acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is 

1 5 bound to a hydrogen, a carboxyl group, an amino groiq), and an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfoniimi. Such analogs have modified 
R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical 
structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical 
compounds that have a stmcture that is different fi-om the general chemical structure of an 

20 amino acid, but that functions in a manner similar to a naturally occurring ammo acid. 

Amino acids may be referred to herein by either their commonly known three 
letter symbols or by the one-letter symbols recommended by the lUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, hkewise, may be referred to by their commonly 
accepted single-letter codes. 

25 "Conservatively modified variants" applies to both amino acid and nucleic 

acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 

30 number of fimctionally identical nucleic acids encode any given protem. For mstance, the 
codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every 
position where an alanine is specified by a codon, the codon can be alt^d to any of the 
corresponding codons described without altering the encoded polyp^tide. Such nucleic acid 
variations are "silent variations," which are one species of conservatively modified 
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variations. Bvery nucleic acid sequence herein which encodes a polypeptide also describes 
every possible silent variation of the nucleic acid. One of skill will recognize that each codon 
in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, 
which is ordinarily the only codon for tryptophan) can be modified to yield a functionally 
identical molecule. Accordingly, each silent variation of a nucleic acid which mcodes a 
polypeptide is implicit in each described sequence with respect to the expression product, but 
not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention. 

The following eight groups each contain amino acids that are conservative 
substitutions for one another:!) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid 
(E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), 
Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan 
(W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see. e.g., Creighton, 
Proteins (19S4)y 

Macromolecular structures such as polypeptide structures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 
e.g,, Alberts ei al.j Molecular Biology of the Cell (3^ ed., 1994) and Cantor and Schimmel, 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
**Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional stmctures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypeptide that 
form a compact unit of the polypeptide and are typically 25 to approximately 500 amino 
acids long. Typical domains are made up of sections of lesser organization such as stretches 
of P-sheet and a-hebces. "Tertiary structure" refers to the complete three dimensional 
structure of a polypeptide monomer. "Quaternary structure" refers to the three dimensional 
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Structure formed, usually by the noncovalent association of independent tertiary units. 
Anisotropic terms are also known as energy terms. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immimochemical, chemical, or other physical 
5 means. For example, useful labels include ^^P, flnorescmt dyes, electron-dense reagents, 
enzymes {e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
which can be made detectable, e.g., by incorporating a radiolabel into the peptide or used to 
detect antibodies specifically reactive with the peptide. 

An "effector" or "effector moiety" or "effector component" is a molecule that 

10 is boxmd (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "efiTector" can be a variety of molecules including, for example, detection moieties 
including radioactive compoimds, fluroescent compounds, an en2yme or substrate, tags such 
as epitope tags, a toxin; a chemotherapeutic agent; a lipase; an antibiotic; or a radioisotope 

1 5 emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 

20 using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 

25 through hydrogen bond fomiation. As used herein, a probe may include natural A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not inteifere 
with hybridization. Thus, for example, probes may be peptide nucleic acids in which the 
constituent bases are joined by peptide bonds rather than phosphodiester linkages. It will be 

30 imderstood by one of skill in the art that probes may bind target sequences lacking complete 
complementarity with the probe sequence depending upon the stringency of the hybridization 
conditions. The probes are preferably directly labeled as with isotopes, chromOphores, 
lumiphores, chromogens, or indirectly labeled such as with biotin to which a streptavidin 
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complex may later bind. By assaying for the presence or absence of the probe, one can detect 
the presence or absence of the select sequence or subsequence. 

The term "recombinant" when used with reference, e,g,^ to a cell, or nucleic 
acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
5 modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived fi-om a cell so modified. Thus, for 
example, recombinant cells express genes that are not found within the native (non- 
recombinant) form of the cell or express native genes that are otherwise abnormally 
expressed, under expressed or not expressed at all. 

1 0 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences that are not found in 
the same relationship to each other in nature. For instance, the nucleic acid is typically 
recombinantly produced, having two or more sequences fi*om unrelated genes arranged to 
make a new fimctional nucleic acid, e.g., a promoter firom one source and a coding region 

1 5 fi-om another source. Similarly, a heterologous protein indicates that the protein comprises 
two or more subsequences that are not found in the same relationship to each other in nature 
(e.g., a fiision protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 

20 acid sequences near the start site of transcription, such as, in the case of a polymerase n type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs fi'om the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

25 active under environmental or developmental regulation. The term "operably linked" refers 
to a fimctional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

30 An "expression vector" is a nucleic acid construct, generated recombinantly or 

synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid firagment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

12 
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The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is iH-esent in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
5 The phrase "stringent hybridization conditions" refers to conditions under 

which a probe will hybridize to its target subsequence, typically in a complex mixture of 
nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 

10 Tij ssen, Techniques in Biochemistry and Molecular Biology-^Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, strmgent conditions are selected to be about 5-1 O^C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The T^ is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 

15 of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T^, 50% of the probes ar« occupied at 
equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium ion, typically about 0,01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about SO^'C for short probes (e.g., 10 to 

20 50 nucleotides) and at least about eOf'C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
formamide. For selective or specific hybridization, a positive signal is at least two times 
background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 

25 incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PGR, a temperature of about 36°C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32°C and 48®C 
depending on primer length. For high stringency PGR amplification, a temperature of about 
62^C is typical, although high stringency annealing temperatures can range fi-om about 50°C 

30 to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amphfications include a denaturation phase of 90°C - 95°C for 
30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min,, and an extension phase of about 
72°C for 1 - 2 min. Protocols and guidelines for low and high staingency ampUfication 
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reactions are provided, e.g., in Innis et al (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical. 
5 This occurs, for example, when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 3TC, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 

10 background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al 

The phrase "functional effects" in the context of assays for testing compounds 

1 5 that modulate activity of an angiogenesis protein includes the determination of a parameter 
that is indirectly or directly under the influence of the angiogenesis protein, e.g., a functional, 
physical, or chemical effect, such as the ability to increase or decrease angiogenesis. It 
includes binding activity, the ability of cells to proliferate, expression in cells undergoing 
angiogenesis, and other characteristics of angiogenic cells. **Functional effects" include in 

20 vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compoimd that 
increases or decreases a parameter that is indirectly or directly under the influence of an 
angiogenesis protein sequence, e.g., functional, physical and chemical effects. Such 
functional effects can be measured by any means known to those skilled in the art, e,g,, 

25 changes in spectroscopic characteristics {e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the angiogenesis protein; 
measuring binding activity or binding assays, e.g. binding to antibodies, and measuring 
cellular proliferation, particularly endothelial cell proliferation, cell viabiUty, cell division 

30 especially of endothelial cells, lumen foraiation and capillary or vessel growth or formation. 
Determination of the functional effect of a compound on angiogenesis can also be performed 
using angiogenesis assays known to those of skill in the art such as an in \ntro assays, e.g., in 
vitro endothelial cell tube formation, assays, and other assays such as the chick CAM assay, 
the mouse corneal assay, and assays that assess vascularization of an implanted tumor. The 
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functional effects can be evaluated by many means known to those skilled in the art, eg., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
e,g,^ tube or blood vessel formation, measurement of changes in RNA or protein levels for 
angiogenesis-associated sequences, measurement of RNA stability, identification of 
5 downstream or reporter gene expression (CAT, luciferase, P-gal, GFP and the like), e.g., via 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, and ligand binding assays, 

"Inhibitors", "activators", and "modulators" of angiogenic polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modiilating molecules 

1 0 identified using in vitro and in Wvo assays of angiogenic polynucleotide and polypeptide 
sequences. Lihibitors are compounds that, e.g., bind to, partially or totally block activity, 
decrease, prevent, delay activation, inactivate, desensitize, or down regulate the activity or 
expression of angiogenesis proteins, e.g., antagonists. "Activators" are compounds that 
increase, open, activate, facilitate, enhance activation, sensitize, agonize, or up regulate 

1 5 angiogenesis protein activity. Inhibitors, activators, or modulators also include genetically 
modified versions of angiogenesis proteins, e.g., versions with altered activity, as well as 
naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small chemical 
molecules and the like. Such assays for inhibitors and activators include, e.g., expressing the 
angiogenic protein z;z vitro, in cells, or cell membranes, applying putative modulator 

20 compounds, and then determining the fimctional effects on activity, as described above. 
Activators and inhibitors of angiogenesis can also be identified by incubating angiogenic 
cells with the test compound and determining increases or decreases in the expression of 1 or 
more angiogenesis proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more angiogenesis 
proteins, such as angiogenesis proteins comprising the sequences set out in Table 8. 

25 Samples or assays comprising angiogenesis proteins that are treated with a 

potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine flie extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 
of a polypeptide is achieved when the activity value relative to the control is about 80%, 

30 preferably 50%, more preferably 25-0%. Activation of an angiogenesis polypeptide is 

achieved when the activity value relative to the control (untreated Avith activators) is 110%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more prefisrably 1000-3000% higher. 
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"Antibody*' refers to a polypeptide comprising a framework region from an 
immimoglobulin gene or fragments th^eof that specifically binds and recognizes an antigra. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genesj, as well as the myriad immunoglobulin variable region 

5 genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in tum define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody 
will be most critical in specificity and affinity of binding. 

An exemplary immunoglobulin (antibody) structural unit comprises a 

10 tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (Vl) and variable heavy 
chain (Vh) refer to these li^t and heavy chains respectively. 

15 Antibodies exist, ag^., as intact immtmoglobulins or as a number of well- 

characterized fragments produced by digestion with various peptidases. Thus, for example, 
pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, 
a dimer of Fab which itself is a light chain joined to Vh-Ch1 by a disulfide bond. The F(ab)'2 
may be reduced under mild conditions to break the disulfide linkage in the hinge region, 

20 thereby converting the F(ab)'2 dimer into an Fab' monomer. The Fab' monomer is 

essentially Fab with part of the hinge region {see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 
antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 

25 herein, also includes antibody fragments either produced by the modification of whole 

antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (jee, e.g., McCafferty et aL^ Nature 
348:552-554 (1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 

30 antibodies, many technique known in the art can be used {see, e.g., Kohler & Milstein, 

Nature 256:495-497 (1975); Kozbor et al. Immunology Today 4: 72 (1983); Cole et al, pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985); Coligan, 
Cun^ent Protocols in Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual 
(1988); and Goding^ Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). 
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Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can be 
adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 
other organisms such as other mammals, may be used to express humanized antibodies. 
Alternatively, phage display technology can be used to identify antibodies and heteromeric 
5 Fab fragments that specifically bind to selected antigens {see^ e.g., McCafferty et al^ Nature 
348:552-554 (1990); Marks et al. Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a dififerent or altered class, effector function 
1 0 and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, dmg, «tc.; or (b) flie variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

The detailed description of the invention includes discussion of the following 
1 5 aspects of the invention: Expression of angiogenesis-associated sequences 

Informatics 

Angiogenesis-associated sequences 

Detection of angiogenesis sequence for diagnostic and 
ther^eutic applications 

20 Modulators of angiogenesis 

Methods of identifying variant angiogenesis-associated 
sequences 

Administration of pharmaceutical and vaccine compositions 
Kits for use in diagnostic and/or prognostic applications. 

25 Expression of angiogenesis-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 

30 a number of genes simultaneously allows the generation of a gene expression profile that is 
unique to the state of the cell. That is, normal tissue may be distinguished fi-om AD tissue. 
By comparing expression profiles of tissue in known different angiogenesis states, 
information regarding which genes are important (including both up- and down-regulation of 
genes) in each of these states is obtained. The identification of sequences that are 
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difTerentially expressed in angiogenic versus non-angiogenic tissue allows the use of this 
information in a number of ways. For example, a particular treatment regime may be 
evaluated: does a chemotherapeutic drug act to down-regulate angiogenesis, and thus tumor 
growth or recurrence, in a particular patient. Similarly, diagnosis and treatment outcomes 
5 may be done or confirmed by comparing patient samples with the known expression profiles. 
Angiogenic tissue can also be analyzed to determine the stage of angiogenesis in the tissue. 
Furthermore, these gene expression profiles (or individual genes) allow screening of drug 
candidates with an eye to mimicking or altering a particular expression profile; for example, 
screening can be done for drugs that suppress the angiogenic expression profile. This may be 

10 done by making biochips comprising sets of the important angiogenesis genes, which can 
then be used in these screens. These methods can also be done on the protein basis; that is, 
protein expression levels of the angiogenic proteins can be evaluated for diagnostic purposes 
or to screen candidate agents. In addition, the angiogenic nucleic acid sequences can be 
administered for gene therapy purposes, including the administration of antisense nucleic 

1 5 acids, or the angiogenic proteins (including antibodies and other modulators thereof) 
administered as therapeutic dmgs. 

Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in angiogenesis, herein termed "angiogenesis sequences". As 
outlined below, angiogenesis sequences include those that are up-regulated (i.e. expressed at 

20 a higher level) in disorders associated with angiogenesis, as well as those that are down- 
regulated (i.e. expressed at a lower level). In a preferred embodiment, the angiogenesis 
sequences are from hxmians; however, as will be appreciated by those in the art, angiogenesis 
sequences from other organisms may be usefiil in animal models of disease and drag 
evaluation; thus, other angiogenesis sequences are provided, from vertebrates, including 

25 manmials, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm animals 
(including sheep, goats, pigs, cows, horses, etc). Angiogenesis sequences fix)m other 
organisms may be obtained using the techniques outlined below. 

Angiogenesis sequences can include both nucleic acid and amino acid 
sequences. In a preferred embodiment, the angiogenesis sequences are recombinant nucleic 

30 acids. By the term "recombinant nucleic acid" herein is meant nucleic acid, originally formed 
in vitro, in general, by the manipulation of nucleic acid e.g., using polymerases and 
endonucleases, in a form not normally foimd in nature. Thus an isolated nucleic acid, in a 
linear form, or an expression vector formed in vitj-o by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
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imderstood that once a recombinant nucldc acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, te. using the in vivo cellular machinery of the 
host cell rather than iji vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
5 recombinant for the purposes of the invention. 

Similarly, a "recombinant protein" is a protein made using recombinaat 
techniques, i.e. through the expression of a recombinant nucleic acid as depicted above. A 
recombinant protein is distinguished from naturally occurring protein by at least one or more 
characteristics. For example, the protein may be isolated or purified away from some or all 

10 of the proteins and compounds with which it is normally associated in its wild type host, and 
thus may be substantially pure. For example, an isolated protein is unaccompanied by at least 
some of the material with which it is normally associated in its natural state, preferably 
constituting at least about 0.5%, more preferably at least about 5% by weight of the total 
protein in a given sample. A substantially pure protein comprises at least about 75% by 

1 5 weight of the total protein, with at least about 80% being preferred, and at least about 90% 

being particularly preferred. The definition includes the production of an angiogenesis protein 
from one organism in a different organism or host cell. Altematively, the protein may be 
made at a significantly higher concentration than is noraially seen, through the use of an 
inducible promoter or high expression promoter, such that the protein is made at increased 

20 concentration levels. Altematively, the protein may be in a form not normally found in 
nature, as in the addition of an epitope tag or amino acid substitutions, insertions and 
deletions, as discussed below. 

In a preferred embodiment, the angiogenesis sequences are nucleic acids. As 
will be appreciated by those in the art and is more fiilly outlined below, angiogenesis 

25 sequences are usefiil in a variety of applications, including diagnostic applications, which will 
detect naturally occuixing nucleic acids, as well as screening applications; for example, 
biochips comprising nucleic acid probes to the angiogenesis sequence can be generated. In 
the broadest sense, then, by **nucleic acid" or "oligonucleotide" or grammatical equivalents 
herein means at least two nucleotides covalently linked together. A nucleic acid of the 

30 present invention will generally contain phosphodiester bonds, although in some cases, 
nucleic acid analogs are included that may have alternate backbones, comprising, for 
example, phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: A Practical 
Approach, Oxford University Press); and peptide nucleic acid backbones and linkages. Other 
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analog nucleic acids include those with positive backbones; non-ionic backbones, and non- 
ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, 
and Chapters 6 and 7, ASC Symposium Series 580, "'Carbohydrate Modifications in 
Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing one or 
5 more carbocyclic sugars are also included within one definition of nucleic acids. 

Modifications of the ribose-phosphate backbone may be done for a variety of reasons, for 
example to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. 

As will be appreciated by those in the art, nucleic acid analogs may find use in 

10 the present invention. In addition, mixtures of naturally occurring nucleic acids and analogs 
can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of 
naturally, occurring nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 

1 5 contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the meltmg temperature (Tm) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in Tm for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 

20 due to their non-ionic nature, hybridization of the bases attached to these backbones is 

relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 

25 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 

30 xanthine hypoxanthine, isocytosine, isoguanine, etc. As used herein, the term **nucleoside" 
includes nucleotides and nucleoside and nucleotide analogs, and modified nucleosides such 
as amino modified nucleosides. In addition, "nucleoside" includes non-naturally occurring 
analog structures. Thus for example the individual units of a peptide nucleic acid, each 
containing a base, are referred to herein as a nucleoside. 

20 
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• If 

An angiogenesis sequence can be initially identified by substantial nucleic 
acid and/or amino acid sequence homology to the angiogenesis sequences outlined herein. 
Such homology can be based upon the overall nucleic acid or amino acid sequence, and is 
generally determined as outlined below, using either homology programs or hybridization 
5 conditions. 

For identifying angiogenesis-associated sequences, the angiogenesis screen 
typically includes comparing genes identified in a modification of an in vitro model of 
angiogenesis as described in Hiraoka, Cell 95:365 (1998) with genes identified in controls. 
Samples of normal tissue and tissue imdergoing angiogenesis are applied to biochips 

1 0 comprising nucleic acid probes. The samples are first microdissected, if applicable, and 
treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, for example fi-om Affymetrix. Gene expression profiles as described 
herein are generated and the data analyzed. 

In a preferred embodiment, the genes showing changes in expression as 

1 5 between normal and disease states are compared to genes expressed in other normal tissues, 
including, but not limited to lung, heart, brain, liver, breast, kidney, muscle, prostate, small 
intestine, large intestine, spleen, bone and placenta. In a preferred embodiment, those genes 
identified during the angiogenesis screen that are expressed in any significant amount in other 
tissues are removed fi-om the profile, although in some embodiments, this is not necessary. 

20 That is, when screening for drugs, it is usually preferable that the target be disease specific, to 
minimize possible side effects. 

In a preferred embodiment, angiogenesis sequences are those that are tq)- 
regulated in angiogenesis disorders; that is, the expression of tiiese genes is higher in tiie 
disease tissue as compared to normal tissue. *TJp-regulation" as used herein means at least 

25 about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. All accession numbers herein are for the GenBank 
sequence database and the sequences of the accession numbers are hereby expressly 
incorporated by reference. GenBank is known in the art, see, eg:, Benson, DA, et al.. 
Nucleic Acids Research 26:1-7 (1998) and http.7/www.ncbi.nlm.nih.gov/. Sequences are also 

30 avialable in other databases, e.g., European Molecular Biology Laboratory (EMBL) and 
DNA Database of Japan (DDBJ). In addition, most preferred genes were found to be 
expressed in a limited amount or not at all in heart, brain, limg, liver, breast, kidney, prostate, 
small intestine and spleen. 
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In another preferred embodiment, angiogenesis sequences are those that are 
down-regulated in the angiogenesis disorder; that is, the expression of these genes is lower in 
angiogenic tissue as compared to normal tissue. '"Down-regulation*' as used herein means at 
least about a two-fold change, preferably at least about a three fold change, with at least about 

5 five-fold or higher being preferred. 

Angiogenesis sequences according to the invention may be classified into 
discrete clusters of sequences based on common expression profiles of the sequences. 
Expression levels of angiogenesis sequences may increase or decrease as a fimction of time in 
a manner that correlates with the induction of angiogenesis. Alternatively, expression levels 

10 of angiogenesis sequences may both increase and decrease as a fimction of time. For 
example, expression levels of some angiogenesis sequences are temporarily induced or 
diminished during the switch to the angiogenesis phenotype, followed by a return to baseline 
expression levels. Tables 1-8 provides genes, the mRNA expression of which varies as a 
fimction of time in angiogenesis tissue when compared to normal tissue. 

15 In a particularly preferred embodiment, angiogenesis sequences are those that 

are induced for a period of time, typically by positive angiogenic factors, followed by a return 
to the baseline levels. Sequences that are temporarily induced provide a means to target 
angiogenesis tissue, for example neovascularized tumors, at a particular stage of 
angiogenesis, while avoiding rapidly growing tissue that require perpetual vascularization. . 

20 Such positive angiogenic factors include aFGF, PFGF, VEGF, angiogenin and the like. 

Induced angiogenesis sequences also are fiirther categorized with respect to 
the timing of induction. For example, some angiogenesis genes may be induced at an early 
time period, such as within 10 minutes of the induction of angiogenesis. Others may be 
induced later, such as between 5 and 60 minutes, while yet others may be induced for a time 

25 period of about two hours or more followed by a return to baseline expression levels. 

In another prefenred embodiment are angiogenesis sequences that are inhibited 
or reduced as a fimction of time followed by a return to '^normal" expression levels. 
Inhibitors of angiogenesis are examples of molecules that have this expression profile. These 
sequences also can be firrther divided into groups depending on the timing of diminished 

30 expression. For example, some molecules may display reduced expression within 10 minutes 
of the induction of angiogenesis. Others may be diminished later, such as between 5 and 60 
minutes, while others may be diminished for a time period of about two hours or more 
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followed by a return to baseline. Examples of such negative angiogenic factors include 
thrombospondin and endostatin to name a few. 

In yet another preferred embodiment are angiogenesis sequences that are 
induced for prolonged periods. These sequences are typically associated with induction of 
5 angiogenesis and may participate in induction and/or maintenance of the angiogenesis 
phenotype. 

In another preferred embodiment are angiogenesis sequences, the expression 
of which is reduced or diminished for prolonged periods in angiogenic tissue. These 
sequences are typically angiogenesis inhibitors and their diminution is correlated with an 
1 0 increase in angiogenesis. 

Informatics 

The ability to identify genes that undergo changes in expression with time 
during angiogenesis can additionally provide high-resolution, high-sensitivity datasets which 

15 can be used in the areas of diagnostics, therapeutics, drug development, biosensor 

development, and other related areas. For example, the expression profiles can be used in 
diagnostic or prognostic evaluation of patients with angiogenesis-associated disease. Or as 
another example, subcellular toxicological information can be generated to better direct drug 
structure and activity correlation {see^ Anderson, L., "Pharmaceutical Proteomics: Targets, 

20 Mechanism, and Function," paper presented at the IBC Proteomics conference, Coronado, 
CA (June 11-12, 1998)). Subcellular toxicological information can also be utilized in a 
biological sensor device to predict the likely toxicological effect of chemical exposures and 
likely tolerable exposure thresholds {see, U.S. Patent No. 5,81 1,231). Similar advantages 
accme from datasets relevant to other biomolecules and bioactive agents nucleic acids, 

25 saccharides, lipids, drags, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of data assay data. The data contained in the database is acquired , 
e.^., using array analysis either singly or in a library format. The database can be in 
substantially any form in which data can be maintained and transmitted, but is preferably an 

30 electronic database. The electronic database of the invention can be maintained on any 

electronic device allowing for the storage of and access to the database, such as a personal 
computer, but is preferably distributed on a wide area.network, such as the World Wide Web. 
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The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
5 and/or absolute abimdance of a variety of molecular and macromolecular species from a 

biological sample undergoing angiogenesis, i.e., the identification of angiogenesis-associated 
sequences described herein, provide an abxmdance of information, which can be correlated 
with pathological conditions, predisposition to disease, drug testing, ther^eutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and physiological 

10 status, among others. Although the data generated firom the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is 
known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 

15 database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein fimction 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 
containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 

20 for obtaining full-length sequences from the collection of partial length sequences. U.S. 

Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 

25 in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a frmctionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 

30 Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 
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The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g.y 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 
5 In an exemplary embodiment, at least one of the sources of target-containing 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for angiogenesis. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 

10 target species in a sample: (1) a unique identification code, which can include, e,g.y a target 
molecular stmcture and/or characteristic separation coordinate {e.g,^ electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target q>ecies 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 

15 target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable mediimi or as an array of charge states or 

20 transistor gate states, such as an array of cells in a DRAM device (eg^., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising imique 
identifiers for at least 10 target data records cross-tabulated with target source. 

25 When the target is a peptide or nucleic acid, the invention preferably provides 

a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucl^c acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 

30 embodiment thereof {e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amoxmt of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 
compatible jpOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other foraiat 



wo 02/079492 PCT/US02/04915 

/ 

Linux, SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data fi-om an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 
5 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethemet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal tranmission medimn, 
whereby at least one network device {e.g.^ computer, disk array, etc.) comprises a pattem of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 

10 cells) composing a bit pattem encoding data acquired firom an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic commimications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattem encoding data from an assay or a 

1 5 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 

20 initialized to load and execute the computer program for alignment and/or comparison of the 
assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
firom the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 

25 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 

30 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.)\ a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.)\ an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
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an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
5 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

10 A ngiogenesis-associated sequences 

Angiogenesis proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment,the 
angiogenesis protein is an intracellular protein. Intracellular proteins may be foxmd in the 
cytoplasm and/or in the nucleus or associated with the intracellular side of the plasma 

1 5 membrane. Intracellular proteins are involved in all aspects of cellular function and 

replication (including, e,g,^ signaling pathways); aberrant expression of such proteins often 
results in unregulated or disregulated cellular processes (see, e.g.. Molecular Biology of the 
Cell, 3rd Edition, Alberts, Ed., Garland Pub., 1994). For example, many intracellular 
proteins have en2ymatic activity such as protein kinase activity, protein phosphatase activity, 

20 protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of proteins, 
or targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 

25 in the proteins of one or more motifs for which defined functions have been attributed. In 

addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct fi-om SH2 

30 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricbpeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phosphohpids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
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sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

In another embodiment, the angiogenesis sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
5 They may have an intracellular domain, an extracellular domain, or both. The intracellular 
domains of such proteins may have a number of functions including those akeady described 
for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
10 kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tjrrosine kinases, certain cj^okine receptors, receptor 
1 S guanylyl cyclases and receptor serine/threonine protein kinases contain a single 

transmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
20 transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 
may be followed or flanked by charged amino acids. Therefore, upon analysis of the amino 
acid sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, e.g. PSORT web site http://psort.nibb.ac.jp/)- 

The extracellular domains of transmembrane proteins are diverse; however, 
25 conserved motifs are found repeatedly among various extracellular domains. Conserved 
structure and/or functions have been ascribed to different extracellular motifs. Many 
extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
30 For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-ceU interactions. Cell- 
associated Ugands can be tethered to the cell for example via a glycosylphosphatidylinositol 
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(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains also 
associate with the extracellular matrix and contribute to the maintenance of the cell stmcture. 

Angiogenesis proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
5 described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide acess to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 

10 be made soluble by removing transmembrane sequences, for example through recombinant 
methods. Furthermore, transmembrane proteins that have been made soluble can be made to 
be secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the angiogenesis proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 

1 5 peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 

proteins are involved in nxmierous physiological events; by virtue of their circulating nature, 
they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 

20 on cells at a distance). Thus secreted molecules find use in modulating or altering nimierous 
aspects of physiology. Angiogenesis proteins that are secreted proteins are particularly 
preferred in the present invention as they serve as good targets for diagnostic markers, eg-., 
for blood or serum tests. 

An angiogenesis sequence is typically initially identified by substantial nucleic 

25 acid and/or amino acid sequence homology or linkage to tiie angiogenesis sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

30 As detailed in the definitions, percent identity can be determined using an 

algorithm such as BLAST. A preferred method utilizes the BLASTN module of WU- 
BLAST-2 set to the default parameters, with overlap span and overlap fi"action set to 1 and 
0.125, respectively. The alignment may include the introduction of g^s in the sequences to 
be aligned. In addition, for sequences which contain either more or fewer nucleotides than 
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those of the nucleic acids of the figures, it is understood that the percentage of homology will 
he determined hased on the number of homologous nucleosides in relation to the total number 
of nucleosides. Thus, for example, homology of sequences shorter than those of the 
sequences identified herein and as discussed below, will be determined using the number of 
S nucleosides in the shorter sequence. 

Di one embodiment, the nucleic acid homology is determined through 
hybridization studies. Thus, e.g,, nucleic acids which hybridize imder high stringency to a 
nucleic acid of Tables 1-8 , or its complement, or is also found on naturally occurring 
mRNAs is considered an angiogenesis sequence. In another embodiment, less stringent 

lb hybridization conditions are used; for example, moderate or low stringency conditions may 
be used, as are known in the art; see Ausubel, supra, and Tijssen, supra. 

In addition, the angiogenesis nucleic acid sequences of the invention, e.g, the 
sequence in Tables 1-8 , are firagments of larger genes, i.e. they are nucleic acid segments. 
'"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 

1 S and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 

sequences provided herein, extended sequences, in either direction, of the angiogenesis genes 
can be obtained, using techniques well known in the art for cloning either longer sequences or 
the fiiU length sequences; see Ausubel, et aL, supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences, e.g., systems such as 

20 UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once the angiogenesis nucleic acid is identified, it can be cloned and, if 
necessary, its constituent parts recombined to form the entire angiogenesis nucleic acid 
coding regions or the entire mRNA sequence. Once isolated fi-om its natural source, e.g., 
contained within a plasmid or other vector or excised therefi-om as a linear nucleic acid 

25 segment, the recombinant angiogenesis nucleic acid can be fiirther-used as a probe to identify 
and isolate other angiogenesis nucleic acids, for example extended coding regions. It can 
also be used as a "precursor" nucleic acid to make modified or variant angiogenesis nucleic 
acids and proteins. 

The angiogenesis nucleic acids of the present invention are used in several 
30 ways. In a first embodiment, nucleic acid probes to the angiogenesis nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
or for administration, for example for gene therapy, vaccine, and/or antisense applications. 
Alternatively, the angiogenesis nucleic acids that include coding regions of angiogenesis 
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proteins can be put into expression vectors for the expression of angiogenesis proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to angiogenesis nucleic acids 
(both the nucleic acid sequences outlined in the figures and/or the complem^ts thereof) are 
5 made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the angiogenesis nucleic acids, i.e. the target sequence (either the target 
sequence of the sample or to other probe sequences, for example in sandwich assays), such 
that hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be any number of base 

10 pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 

15 sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 

20 firom about 8 to about 100 bases long, with fi-om about 10 to about 80 bases being preferred, 
and fi-om about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with 
25 either overlapping probes or probes to different sections of the target being used. That is, 

two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e. have some sequence in consmon), 
or separate. In some cases, PGR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
30 immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable tmder the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and granamatical univalents herein is meant Mie or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are 

5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules, hnmobilization may also involve a combination of 
covalent and non-covalent interactions. 

10 Li general, the probes are attached to the biochip in a wide variety of ways, as 

will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 

1 5 support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or fimctionalized glass, plastics (including acrylics, polystyrene and 

20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or siUca- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluorescese. A preferred substrate is described in copending application entitled Reusable 

25 Low Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 
1999, herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be ^predated by those in 
the art, other configurations of substmtes may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 

30 sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, for 
example, the biochip is derivatized with a chemical fimctional group including, but not 
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limited to, amino groups, carboxy groups, oxo groups and thiol groups, with amino groiq>s 
being particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, for example using linkers as are known in the 
5 art; for example, homo-or hetero-bifimctional linkers as are well known (see 1994 Pierce 
Chemical Company catalog, technical section on cross-linkers, pages 155-200, incorporated 
herein by reference). In addition, in some cases, additional linkers, such as alkyl groups 
(including substituted and heteroalkyl groups) may be used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 

1 0 and then attached to the surface of the solid support. As wiU be ^predated by those skilled 
in the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 

1 5 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 

20 in WO 95/25 1 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of angiogenesis-associated sequences. These assays are typically performed in 

25 conjunction with reverse transcription. In such assays, an angiogenesis-associated nucleic 
acid sequence acts as a template in an amplification reaction (e.g.. Polymerase Chain 
Reaction, or PCR). In a quantitative amplification, the amount of amplification product will 
be proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of angiogenesis-associated RNA Methods of 

30 quantitative amplification are well known to those of skill in the art. Detailed protocols for 
quantitative PCR are provided, e.g., in Imiis et al (1990) PCR Protocols, A Guide to Methods 
and Applications^ Academic Press, Inc. N.Y.). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
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dye and a 3' quenching agent. The probe hybridizes to a PGR product, but cannot itself be 
extended due to a blocking agent at the 3 * end. When the PGR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
5 quenching agent, thereby resulting in an increase in fluorescence as a function of 

amplification (see, for example, literature provided by Perkin-Ehner, e.g., www2.perkin- 
ehner.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) (see, Wu and Wallace (1989) Genomics 4; 560, Landegren et al. (1988) 
10 Science 241 : 1077, and Barringer et al. (1990) Gene 89: 1 17), transcription amplification 

(Kwoh et al. (19S9) Proc. Natl. Acad. Set USA 86: 1173), self-sustained sequence replication 
(Guatelli et al. (1990) Proc. Nat. Acad. ScL USA 87: 1874), dot PGR, and linker adapter PGR, 
etc. 

In a preferred embodiment, angiogenesis nucleic acids, e.g.^ encoding 

1 5 angiogenesis proteins are used to make a variety of expression vectors to express 

angiogenesis proteins which can then be used in screening assays, as described below. 
Expression vectors and recombinant DNA technology are well known to those of skill in the 
art (see, e.g., Ausubel, supra, and Gene Expression Systems, Fernandez & Hoeffler, Eds, 
Academic Press, 1999) and are used to express proteins. The expression vectors may be 

20 either self-replicating extrachromosomal vectors or vectors which integrate into a host 
genome. Generally, these expression vectors include transcription^ and translational 
regulatory nucleic acid operably linked to the nucleic acid encoding the angiogenesis protein. 
The term "control sequences" refers to DNA sequences used for the expression of an 
operably linked coding sequence in a particular host organism. Gontrol sequences that are 

25 suitable for prokaryotes, for example, include a promoter, optionally an operator sequence, 
and a ribosome binding site. Eukaryotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 

30 secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
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and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
5 nucleic acid will generally be appropriate to the host cell used to express the angiogenesis 
protein; for example, transcriptional and translational regulatory nucleic acid sequences from 
Bacillus are preferably used to express the angiogenesis protein in Bacillus. Nimierous types 
of appropriate expression vectors, and suitable regulatory sequences are known in the art for 
a variety of host cells. 

10 In general, transcriptional and translational regulatory sequences may include, 

but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
start and stop sequences. 

1 5 PromotCT sequences encode either constitutive or inducible promoters. The 

promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 

In addition, an expression vector may comprise additional elements. For 

20 example, the expression vector may have two replication systems, thus allowing it to be 

maintained in two organisms, for example in mammalian or insect cells for egression and in 
a procaryotic host for cloning and amplification. Fmlhennore, for integrating expression 
vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression constmct. 

25 The integrating vector may be directed to a specific locus in the host cell by selecting flie 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.^., Fernandez & Hoefifler, supra). See also Kitamura, et 
al. (1995) PNAS 92:9146-9150. 

In addition, in a preferred embodiment, the expression vector contains a 

30 selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The angiogenesis proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding an 
angiogenesis protein, under the appropriate conditions to induce or cause expression of the 
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angiogenesis protein. Conditions appropriate for angiogenesis protein expression will vary 
with the choice of the expression vector and the host cell, and will be easily ascertained by 
one skilled in the art through routine experimentation or optimization. For example, the use 
of constitutive promoters in the expression vector will require optimizing the growth and 
5 proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
- growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
10 and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeaists, E. coli, BaciUus subtilis, Sf9 cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THPl cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the angiogenesis proteins are expressed in 
manmialian cells. MammaUan expression systems are also known in the art, and include 
retroviral and adenoviral systems. Of particular use as mammalian promoters are the 
promoters from mammalian viral genes, since the viral genes are often highly expressed and 
have a broad host range. Examples include the SV40 early promoter, mouse mammary tumor 
vims LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 
CMV promoter (see, e.g., Fernandez & Hoeffler, supra). Typically, transcription temiination 
and polyadenylation sequences recognized by mammalian cells are regulatory regions located 
3' to the translation stop codon and thus, together with the promoter elements, flank the 
coding sequence. Examples of transcription temiinator and polyadenlytion signals include 
those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 
encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, angiogenesis proteins are expressed in bact^al 
systems. Bacterial expression systems are well known in the art. Promoters from 
bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
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and hybrid promoters are also useful; for example, the tac promoter is a hybrid of the tip and 
lac promoter sequences. Furttiermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the abihty to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 
5 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the angiogenesis protein in bacteria. The protein is either 
secreted into the grow^ media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bactmal 
expression vector may also include a selectable marker gene to allow for the selection of 

10 bacterial strains that have been transformed. Suitable selection genes include genes which 
render the bacteria resistant to dmgs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 

15 known in the art, and include vectors for Bacillus subtilis, E, coli, Sti-eptococcus a^emoris^ 
and Streptococcus lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, angiogenesis proteins are produced in insect cells. 

20 Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, angiogenesis protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C maltosa, Hansenula polyniorpha, 

25 Kluy\'eromyces fragilis and K. lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The angiogenesis protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, for example, for the creation of monoclonal 
antibodies, if the desired epitope is small, the angiogenesis protein may be fused to a carrier 

30 protein to form an immunogen. Alternatively, the angiogenesis protein may be made as a 
fusion protein to increase expression, or for other reasons. For example, when the 
angiogenesis protein is an angiogenesis peptide, the nucleic acid encoding the peptide may be 
linked to another nucleic acid for expression purposes. Fusion with detection q)itope tags 
can be made, e.g., with FLAG, His 6, myc, HA, etc. 
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In one embodiment^ the angiogenesis nucleic acids, proteins and antibodies of 
the invention are labeled. By "labeled" herein is meant that a compound has at least one 
element, isotope or chemical compound attached to enable the detection of the compound. In 
general, labels fall into three classes: a) isotopic labels, which may be radioactive or heavy 
5 isotopes; b) immime labels, which may be antibodies, antigens, or epitope tags and c) colored 
or fluorescent dyes. The labels may be incorporated into the angiogenesis nucleic acids, 
proteins and antibodies at any position. For example, the label should be capable of 
producing, either directly or indirectly, a detectable signal. The detectable moiety may be a 
radioisotope, such as ^H, ^"^C, ^^P, ^^S, or a fluorescent or chemiluminescent compound, 

1 0 such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline 
phosphatase, beta-galactosidase or horseradish peroxidase. Any method known in the art for 
conjugating the antibody to the label may be employed, including those methods described by 
Hunter et al.. Nature, 144:945 (1962); David et al.. Biochemistry, 13:1014 (1974); Pain et 
al., J. Immunol. Meth., 40:219 (1981); andNygren, J. Histochem. and Cytochem., 30:407 

15 (1982). 

Accordingly, the present invention also provides angiogenesis protein 
sequences. An angiogenesis protein of the present invention may be identified in several 
ways. "Protein" in this sense includes proteins, polypeptides, and peptides. As will be 
appreciated by those in the art, the nucleic acid sequences of the invention can be used to 

20 generate protein sequences. There are a variety of ways to do this, including cloning the 
entire gene and verifying its fi*ame and amino acid sequence, or by comparing it to known 
sequences to search for homology to provide a frame, assuming the angiogenesis protein has 
an identifiable motif or homology to some protein in the database being used. Generally, the 
nucleic acid sequences are input into a program that will search all three fi-ames for 

25 homology. This is done in a preferred embodiment using the following NCBI Advanced 

BLAST parameters. The program is blastx or blastn. The database is nr. The input data is as 
"Sequence in FASTA formaf The organism Ust is **none". The "expect" is 10; the filter is 
default. The "descriptions" is 500, the "aUgnments" is 500, and the "alignment view" is 
pairwise. The "Query Genetic Codes" is standard (1). The matrix is BLOSUM62; gap 

30 existence cost is 1 1, per residue gap cost is 1; and the lambda ratio is .85 default. This 
results in the generation of a putative protein sequence. 

Also included within one embodiment of angiogenesis proteins are amino acid 
variants of the naturally occurring sequences, as determined herein. Preferably, the variants 
are preferably greater than about 75% homologous to the wild-type sequence, more 
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preferably greater than about 80%, even more preferably greater flian about 85% and most 
preferably greatw than 90%. In some embodiments the homology will be as higjh as about 93 
to 95 or 98%, As for nucleic acids, homology in this context means sequence similarity or 
identity, with identity being prefenred. This homology will be determined usmg standard 
5 techniques well known in the art as are outlined above for the nucleic acid homologies. 

Angiogenesis proteins of the present invention may be shorter or longer than 
the wild type amino acid sequences. Thus, in a preferred embodiment, included within the 
definition of angiogenesis proteins are portions or fragments of the wild type sequences, 
herein. In addition, as outlined above, the angiogenesis nucleic acids of the invention may be 

10 used to obtain additional coding regions, and thus additional protein sequence, using 
techniques known in the art. 

In a preferred embodiment, the angiogenesis proteins are derivative or variant 
angiogenesis proteins as compared to the wild-type sequence. That is, as outlined more fiUly 
below, the derivative angiogenesis peptide will often contain at least one amino acid 

15 substitution, deletion or insertion, with amino acid substitutions being particularly preferred. 
The amino acid substitution, insertion or deletion may occur at any residue within the 
angiogenesis peptide. 

Also included within one embodiment of angiogenesis proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 

20 three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the angiogenesis 
protein, using cassette or PGR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant angiogenesis protein fragments having up to 

25 about 1 00-1 50 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 
a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
angiogenesis protein amino acid sequence. The variants typically exhibit the same qualitative 
biological activity as the naturally occurring analogue, although variants can also be selected 

30 which have modified characteristics as will be more fiiUy outlined below. 

While the site or region for introducing an amino acid sequence variation is 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed angiogenesis variants screened for 

39 



BNSOOCID: <WO 02079492A2_I_> 



wo 02/079492 PCTAJS02/04915 

the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, for example. Ml 3 
primer mutagenesis and PGR mutagenesis. Screening of the mutants is done using assays of 
angiogenesis protein activities. 
5 Amino acid substitutions are typically of single residues; insertions usually 

will be on the order of from about 1 to 20 amino acids, although considerably larger 
insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 
some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 

10 arrive at a final derivative. Generally these changes are done on a few amino acids to 

minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the angiogenesis protein are 
desired, substitutions are generally made in accordance with the amino acid substitution chart 
provided in the definition section. 

15 Substantial changes in function or immunological identity are made by 

selecting substitutions that are less conservative than those provided in the definition of 
"conservative substitution". For example, substitutions may be made which more 
significantly affect: the structure of the polypeptide backbone in the area of the alteration, for 
example the alpha-helical or beta-sheet stmcture; the charge or hydrophobicity of the 

20 molecule at the target site; or the bulk of the side chain. The substitutions which in general 
are expected to produce tlie greatest changes in the polypeptide's properties are those in 
which (a) a hydrophiUc residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic 
residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is 
substituted for (or by) any other residue; (c) a residue having an electropositive side chain, 

25 e.g. lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g. 
glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain, e.g. glycine. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 

30 selected to modify the characteristics of the angiogenesis proteins as needed. Altematively, 
the variant may be designed such that the biological activity of the angiogenesis protein is 
altered. For example, glycosylation sites may be altered or removed. 

Covalent modifications of angiogenesis polypeptides are included within the 
scope of this invention. One type of covalent modification includes reacting targeted amino 

40 



BNSOOCID: <WO 02079492A2 I > 



wo 02/079492 PCT/US02/04915 

acid residues of an angiogenesis polypeptide with an organic derivatizing agent that is 
capable of reacting with selected side chains or the N-or C-terminal residues of an 
angiogenesis. polypeptide. Derivatization with bifunctional agents is useful, for instance, for 
crosslinking angiogenesis polypeptides to a water-insoluble support matrix or surface for use 
5 in the method for purifying anti-angiogenesis polypeptide antibodies or screening assays, as 
is more fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, 
esters with 4-azidosalicylic acid, homobifimctional imidoesters, including disuccinimidyl 
esters such as 3,3 -dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 

10 maleimido-l,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains [T.E. 

1 5 Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San 

Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C- 
terminal carboxyl group. 

Another type of covalent modiiScation of the angiogenesis polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 

20 of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein 
to mean deleting one or more carbohydrate moieties found in native sequence angiogenesis 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 
sequence angiogenesis polypeptide. Glycosylation patterns can be altered in many ways. For 
example the use of different cell types to express angiogenesis-associated sequences can 

25 result in diffeent glycosylation patterns- 
Addition of glycosylation sites to angiogenesis polypeptides may also be 
accomplished by altering tiie amino acid sequence thereof The alteration may be made, for 
example, by the addition of, or substitution by, one or more serine or threonine residues to the 
native sequence angiogenesis polypeptide (for O-linked glycosylation sites). The 

30 angiogenesis amino acid sequence may optionally be altered through changes at the DNA 

level, particularly by mutating the DNA encoding the angiogenesis polypeptide at preselected 
bases such that codons are generated that will translate into the desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the 
angiogenesis polypeptide is by chemical or enzymatic coupling of glycosides to the 
polypeptide. Such methods are described in the art, e.g., in WO 87/05330. published 1 1 
September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem., pp. 259-306 (1981). 

5 Removal of carbohydrate moieties present on the angiogenesis polypeptide 

may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 
deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et al,. Arch, Biochem. Biophys., 259:52 (1987) and by Edge et ah, Anal. Biochem., 118:131 

10 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al., Meth. 
EnzymoL, 138:350 (1987). 

Another type of covalent modification of angiogenesis comprises linking the 
angiogenesis polypeptide to one of a variety of nonproteinaceous polymers, e.g., 

1 5 polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Angiogenesis polypeptides of flie present invention may also be modified in a 
way to form chimeric molecules comprising an angiogenesis polypeptide fiised to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fusion of an angiogenesis polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the angiogenesis polypeptide. The 
presence of such epitope-tagged forms of an angiogenesis polypeptide can be detected using 
an antibody against the tag polypeptide. Also, provision of the epitope tag enables the 

25 angiogenesis polypeptide to be readily purified by affinity purification using an anti-tag 
antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 
embodiment, the chimeric molecule may comprise a fiision of an angiogenesis polypeptide 
with an immunoglobulin or a particular region of an immimoglobulin. For a bivalent form of 
the chimeric molecule, such a fiision could be to the Fc region of an IgG molecule. 

30 Various tag polypeptides and their respective antibodies are well known in the 

art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 [Field et 
al, Mol Cell Biol, 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto [Evan et al. Molecular and Cellular Biology^ 5:3610-3616 (1985)]; 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et aL, 
Protein Engineering, 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide 
[HoppetaL BioTechnology, 6:1204-1210 (1988)]; the KT3 epitope peptide [Martm^fa/., 
Science, 255:192-194 (1992)]; tubulin epitope peptide [Skinner et al, J. Biol Chem., 
5 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz-Freyennuth et aL, 
Proc. Natl Acad. Set USA, 87:6393-6397 (1990)]. 

Also included with an embodiment of angiogenesis protein are other 
angiogenesis proteins of the angiogenesis family, and angiogenesis proteins from other 
organisms, which are cloned and expressed as outlined below. Thus, probe or degen^ate 

10 polymerase chain reaction (PGR) primer sequences may be used to find other related 

angiogenesis proteins from humans or other organisms. As will be appreciated by those in 
the art, particularly usefiil probe and/or PGR primer sequences include the unique areas of the 
angiogenesis nucleic acid sequence. As is generally known in the art, preferred PGR primers 
are from about 15 to about 35 nucleotides in length, with from about 20 to about 30 being 

15 preferred, and may contain inosine as needed. The conditions for the PGR reaction are well 
known in the art (e.^., Innis, PGR Protocols, supra). 

In addition, as is outlined herein, angiogenesis proteins can be made that are 
longer than those encoded by the nucleic acids of the figures, eg^., by the elucidation of 
extended sequences, the addition of epitope or purification tags, the addition of other fiision 

20 sequences, etc. 

Angiogenesis proteins may also be identified as being encoded by 
angiogenesis nucleic acids. Thus, angiogenesis proteins are encoded by nucleic acids that 
will hybridize to the sequences of the sequence listings, or their complements, as outlined 
herein. 

25 In a preferred embodiment, when the angiogenesis protein is to be used to 

generate antibodies, e,g,, for immunotherapy or immunodiagnosis, the angiogenesis protein 
should share at least one epitope or determinant with the fiill length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHG. Thus, in most instances, antibodies 

30 made to a smalls angiogenesis protein will be able to bind to the fiiU-length protein, 
particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a imique epitope show little or no cross-reactivity. In a preferred 
embodiment, the epitope is selected from a protein sequence set out in Table 8. 
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Methods of preparing polyclonal antibodies are known to the skilled artisan 
ie.g., Coligan, supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 
Typically, the immimizing agent and/or adjuvant will be injected in the mammal by multiple 
5 subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fi-agment thereof or a fusion protein thereof. It 
may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being inmiunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet heniocyanin, serum albumin, bovine thyroglobulin, and soybean 

10 trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immimization protocol may be selected by one skilled in the art 
without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

1 S antibodies may be prepared using hybridoma methods, such as those described by Kohler and 
Milstein, Nature, 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an inununizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
the immunizing agent. Altematively, the lymphocytes may be immunized in vitro. The 

20 immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
8 , or firagment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 

25 glycol, to form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice, 
Academic Press, (1986) pp. 59-103]. hnmortalized cell lines are usually transformed 
mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, 
rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a 
suitable culture medium that preferably contains one or more substances that inhibit the 

30 growth or survival of the imfused, immortalized cells. For example, if the parental cells lack 
the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture 
medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine 
("HAT medixun")> which substances prevent the growth of HGPRT-deficient cells. 
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In one embodiment, the antibodies are bispecific antibodies. Bispecific 
antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
5 protein encoded by a nucleic acid Tables 1-8 or a firagment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to angiogenesis protein are capable 

10 of reducing or eliminating a biological fimction of an angiogenesis protein, as is described 
below. That is, the addition of anti-angiogenesis protein antibodies (either polyclonal or 
preferably monoclonal) to angiogenic tissue (or cells containing angiogenesis) may reduce or 
eliminate the angiogenesis activity. Generally, at least a 25% decrease in activity is 
preferred, with at least about 50% being particularly preferred and about a 95-100% decrease 

1 5 being especially preferred. 

In a preferred embodiment the antibodies to the angiogenesis proteins are 
humanized antibodies {e,g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labs,Inc.) Himianized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immimoglobulin chains or fragments thereof (such as Fv, 

20 Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 

minimal sequence derived from non-himian immunoglobulin. Hmnanized antibodies include 
human immunoglobulins (recipient antibody) in which residues form a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 

25 affinity and capacity. In some instances, Fv framework residues of the human 

immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or fi-amework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 

30 substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 



45 



3NSOOCID: <WO 



.02079492A2 I > 



wo 02/079492 PCT/US02/04915 

immunoglobulin [Jones et al.. Nature, 321:522-525 (1986); Riechmann et al.. Nature, 
332:323-329 (1988); and Presta, Curt. Op. Struct. Biol., 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into it from 
a source which is non-human. These non-hxmian amino acid residues are often referred to as 
import residues, which are typically taken from an import variable domain. Humanization 
can be essentially performed following the method of Winter and co-workers [Jones et al.. 
Nature, 321:522-525 (19S6); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et 
al.. Science, 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherem substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. In practice, hiunanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 
(1991); Marks et aL, J. Mol. Biol., 222:581 (1991)]. The techniques of Cole et al. and 
Boemer et al. are also available for the preparation of human monoclonal antibodies (Cole et . 
al.. Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p, 77 (1985) and Boemer et . 
al., J. Immunol., 147(l):86-95 (1991)]. Similarly, human antibodies can be made by 
introducing of human immimoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immimoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 
This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the foUowmg scientific publications: 
Marks et al., Bio/Technology 10, 779-783 (1992); Lonberg et al.. Nature 368 856-859 (1994); 
Morrison, Nature 368, 812-13 (1994); Fishwild et al., Nature Biotechnology 14, 845-51 
(1996); Neuberger, Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intern. Rev. 
Immunol. 13 65-93 (1995). 

By immunotherapy is meant treatment of angiogenesis with an antibody raised 
against angiogenesis proteins. As used herein, immxmotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
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(patient). Active immunization is the induction of antibody and/or T-cell responses in a 
recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
5 desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 
expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the angiogenesis proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theory, 

10 antibodies used for treatment, bind and prevent the secreted protein from binding to its 
receptor, thereby inactivating the secreted angiogenesis protein. 

In another preferred embodiment, the angiogenesis protein to which antibodies 
are raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment, bind the extracellular domain of the angiogenesis protein and prevent it from 

15 binding to other proteins, such as circulating ligands or cell-associated molecules. The 

antibody may cause down-regulation of the transmembrane angiogenesis protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
angiogenesis protein. The antibody is also an antagonist of the angiogenesis protein. 

20 Further, the antibody prevents activation of the transmembrane angiogenesis protein. In one 
aspect, when the antibody prevents the binding of other molecules to the angiogenesis 
protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-P, IL-1, INF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 

25 methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, angiogenesis is 
treated by administering to a patient antibodies directed against the transmembrane 
angiogenesis protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 

30 otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated or frised to an 
effector moiety. The effector moiety can be any number of molecutes, including labelling 
moieties such as radioactive labels or fluorescent labels, or can be a thers^eutic moiety. In 
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one aspect the therapeutic moiety is a small molecule that modulates the activity of the 
angiogenesis protein. In another aspect the therapeutic moiety modulates the activity of 
molecules associated with or in close proximity to the angiogenesis protein. The therapeutic 
moiety may inhibit enzymatic activity such as protease or coUagenase activity associated witii 

5 angiogenesis, or be an attractant of other cells, such as NK cells. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to angiogenesis tissue or cells, results in a 
reduction in the number of afflicted cells, thereby reducing symptoms associated with 
angiogenesis. Cytotoxic agents are numerous and varied and include, but are not limited to, 

10 cytotoxic dmgs or toxins or active fragments of such toxins. Suitable toxins and their 

corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against angiogenesis 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 

IS the antibody. Targeting die therapeutic moiety to transmembrane angiogenesis proteins not 
only serves to increase the local concentration of therapeutic moiety in the angiogenesis 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the therapeutic moiety. 

In another preferred embodiment, the angiogenesis protein against which the 

20 antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
or fused to a protein which facilitates entry into the cell. In one case, the antibody enters the 
cell by endocytosis. In another embodiment, a nucleic acid encoding the antibody is 
administered to the individual or cell. Moreover, wherein the angiogenesis protein can be 
targeted within a cell, i.e., the nucleus, an antibody thereto contains a signal for that target 

25 localization, i.e., a nuclear localization signal. 

The angiogenesis antibodies of the invention specifically bind to angiogenesis 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
of at least about 0.1 mM, more usually at least about 1 klM, preferably at least about 0.1 
or better, and most preferably, 0.01 nM or better. Selectivity of binding is also 

30 important. 

In a preferred embodiment, the angiogenesis protein is purified or isolated 
after expression. Angiogenesis proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
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and chromatographic techniques, including ion exchange, hydrophobic, affinity, and revise- 
phase HPLC chromatography, and chromatofocusing. For example, the angiogenesis protein 
may be purified using a standard anti-angiogenesis protein antibody colimm. Ultrafiltration 
and diafiltration techniques, in conjxmction with protein concentration, are also usefiil. For 
5 general guidance in suitable purification techniques, see Scopes, R., Protein Purification, 
Springer- Verlag, NY (1982). The degree of purification necessary will vary depending on 
the use of the angiogenesis protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the angiogenesis proteins and 
nucleic acids are useflil in a nmnber of applications. They may be used as immunoselection 
10 reagents, as vaccine reagents, as screening agents, etc. 

Detection of angiogenesis sequence for diagnostic and therapeutic applications 

In one aspect, the RNAexpression levels of genes are detemiined for different 
cellular states in the angiogenesis phenotype. Expression levels of genes in normal tissue 

15 (z.e., not undergoing angiogenesis) and in angiogenesis tissue (and in some cases, for varjdng 
severities of angiogenesis that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 
essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 

20 generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 
obtained. Then, diagnosis may be performed or confirmed to determine whetiier a tissue 
sample has the gene expression profile of normal or angiogenesic tissue. This will provide 

25 for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 

30 normal versus angiogenic tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more statese. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both- Alternatively, the difference in expressicm may be quantitative, ^.g., in that 
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expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
5 GeneChip™ expression arrays, Lockhart, Nature Biotechnology, 14:1 675- 1 680 ( 1 996), 

hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PGR, Northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 

10 preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
• of the gene transcript, and the quantification of gene expression levels, or, altanatively, the 
fmal gene product itself (protein) can be monitored, e.g.^ with antibodies to the angiogenesis 

1 5 protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to 
angiogenesis genes, i.e., those identified as being important in an angiogenesis phenotype, 
can be evaluated in an angiogenesis diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed 

20 simultaneously on a nxmiber of genes. Multiple protein expression monitoring can be 

performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
particular cell. The assays are further described below in the example. PGR techniques can 

25. be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the angiogenesis protein are 
detected. Although DNA or RNA encoding the angiogenesis protein may be detected, of 
particular interest are methods wherein an mRNA encoding an angiogenesis protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 

30 complementary to and hybridizes with the mRNA and includes, but is not limited to, 

oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
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detected. In another method detection ofthemRNA is performed in sit^^ In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example a 
5 digoxygenin labeled riboprobe (RNA probe) that is complementary to the mJElNA encoding 
an angiogenesis protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed wdth nitro blue tetrazoUum and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins 
10 as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The angiogenesis proteins, antibodies, nucleic acids, modified proteins and cells 
containing angiogenesis sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
1 5 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, angiogenesis proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of angiogenesis. 
Detection of these proteins in putative angiogenesis tissue allows for detection or diagnosis of 

20 angiogenesis. In one embodiment, antibodies are used to detect angiogenesis proteins. A 
preferred method separates proteins fi-om a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the Uke). Following separation of proteins, the angiogenesis protein is 
detected, e.^., by immimoblotting with antibodies raised against the angiogenesis protein. 

25 Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred mefliod, antibodies to the angiogenesis protein find use in 
in situ imagmg techniques, e.g., in histology (e.^.. Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 
to many antibodies to the angiogenesis protein(s). Following washing to remove non-specific 

30 antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 
the antibody is detected by incubating with a secondary antibody that contains a detectable 
label. In another method the primary antibody to the angiogenesis protein(s) contains a 
detectable label, for example an enzyme marker that can act on a substrate. In another 
preferred embodiment each one of multiple primary antibodies contains a distinct and 
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detectable label. This method finds particular use in simultaneous screening for a plurality of 
angiogenesis proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are alsoprovided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
5 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing 
angiogenesis firom biological samples, such as blood, urine, sputum, or other bodily fluids. 
As previously described, certain angiogenesis proteins are secreted/circulating molecules. 

10 Blood samples, therefore, are useful as samples to be probed or tested for the presence of 
secreted angiogenesis proteins. Antibodies can be used to detect an angiogenesis protein by 
previously described immunoassay techniques including ELISA, immunoblotting (Western 
blotting), immunoprecipitation, BIACORE technology and the like. Conversely, the presence 
of antibodies may indicate an immune response against an endogenous angiogenesis protein. 

15 In a preferred embodiment, in situ hybridization of labeled angiogenesis 

nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
angiogenesis tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 

20 findings. It is fiirther imderstood that the genes which indicate the diagnosis may differ firom 
those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refiractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic 
25 acids, modified proteins and cells containing angiogenesis sequences are used in prognosis 
assays. As above, gene expression profiles can be generated that correlate to angiogenesis 
severity, in terms of long term prognosis. Again, this may be done on either a protein or gene 
level, with the use of genes being preferred. As above, angiogenesis probes may be attached 
to biochips for the detection and quantification of angiogenesis sequences in a tissue or 
30 patient. The assays proceed as outlined above for diagnosis. PGR method may provide more 
sensitive and accurate quantification. 

In a preferred embodiment members of the three classes of proteins as 
described herein are used in drug screening assays. The angiogenesis proteins, antibodies, 
nucleic acids, modified proteins and cells containing angiogenesis sequences are used in drug 
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screening assays or by evaluating the effect of drug candidates on a "gene expression profile" 
or expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
5 Zlokamik, et al., Science 279, 84-8 (1998); Held, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified angiogenesis proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the angiogenesis phenotype or an identified physiological 

1 0 function of an angiogenesis protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drag candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably m conjunction with high throu^put 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokamik, supra. 

15 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in angiogenesis, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
angiogenic protein. "Modulation" thus includes both an increase and a decrease in gene 

20 expression. The prefeired amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing angiogenesis, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodunents 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in angiogenic tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in 

25 angiogenic tissue compared to normal tissue often provides a target value of a 10-fold 
increase in expression to be induced by tiie test compoimd. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the angiogenesis protein and standard 

30 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entitites, z.e., an expression profile, is monitored simultaneously. Such profiles will 
typically invove a plurality of those entitites described herein.. 
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In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
particular cell. Alternatively, PGR may be used. Thus, a series, e.g., of microtiter plate, may 
be used with dispensed primers in desired wells. A PGR reaction can then be performed and 
5 analyzed for each well. 

Modulators of angiogenesis 

Expression monitoring can be performed to identify compounds that modify 
the expression of one or more angiogenesis-associated sequences, e.g., a polynucleotide 
10 sequence set out in Tables 1-8 , Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
that modulate angiogenesis, modulate angiogenesis proteins, bind to an angiogenesis protein, 
or interfere with the binding of an angiogenesis protein and an antibody or other binding 
partner. 

15 The term "test compoimd" or "drug candidate" or "modulator" or granomatical 

equivalents as used herein describes any molecule, e.g.y protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the angiogenesis phenotype or the expression of an angiogenesis sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 

20 expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses an angiogenesis phenotype, for example to a normal 
tissue fingerprint. In another embodiment, a modulator induced an angiogenesis phenotype. 
Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 

25 concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

In one aspect, a modulator will neutralize the effect of an angiogenesis protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and thereby has 
substantially no effect on a cell. 
30 In certain embodiments, combinatorial libraries of potential modulators will be 

screened for an ability to bind to an angiogenesis polypeptide or to modulate activity. 
Gonventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
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and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a hbrary containing a large number of potential therapeutic compounds (candidate 
5 compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compoimds thus identified can serve as 
conventional "lead compoxmds" or can themselves be used as potential or actual therapeutics. 
A combinatorial chemical library is a collection of diverse ch^ical 

1 0 compounds generated by either chenwcal synthesis or biological synthesis by combining a 
number of chemical '^building blocks" such as reagents. For example, a linear combinatorial 
chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). MilUons of chemical 

1 5 compoxmds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et al (1994) J, Med. Chem. 37(9): 1233-1251). 

Preparation and screening of combinatorial chemical libraries is well known to 
those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Int. J. Pept. Prot. Res., 

20 37: 487-493, Houghton et al. (1991) Nature, 354: 84-88), peptoids (PCT PubHcation No WO 
91/19735, 26 Dec. 1991), encoded peptides (PCT Publication WO 93/20242, 14 Oct. 1993), 
random bio-oligomers (PCT Publication WO 92/00091, 9 Jan. 1992), benzodiazepines (U.S. 
Pat. No. 5,288,514), diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs 
et al, (1993) Proc. Nat Acad. ScL USA 90: 6909-6913), vmylogous polypeptides (Hagihara 

25 et al. (1992) J. Amer. Chem. Soc. 114: 6568), nonpeptidal peptidomimetics with a Beta-D- 
Glucose scaffolding (Hirschmann et al., (1992) J. Amer. Chem. Soc. 1 14: 9217-9218), 
analogous organic syntheses of small compound libraries (Chen et al. (1994) J. Amer. Chem. 
Soc. 116: 2661), oUgocarbamates (Cho, et al., (1993) Science 261:1303), and/or peptidyl 
phosphonates (Campbell et al., (1994) J. Org. Chem. 59: 658). See, generally, Gordon et al., 

30 (1994) J. Med. Chem. 37:1385, nucleic acid libraries (see, e.g., Strategene, Corp.), peptide 
nucleic acid libraries (see, e.g., U.S. Patent 5,539,083), antibody Ubraries (see, e.g., Vaughn 
etal. (1996) Nature Biotechnology, 14(3): 309-314), and PCT/US96/1 0287), carbohydrate 
libraries (see, e.g., Liang et al, (1996) Science, 274: 1520-1522, and U.S. Patent No. 
5,593,853), and small organic molecule libraries (see, e.g., benzodiazepines, Baum (1993) 
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C&EN, Jan 18, page 33; isoprenoids, U-S. Patent No. 5,569,588; thiazolidinones and 
metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 
5,519,134; morpholino compounds, U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent 
No. 5,288,514; and the like). 
5 Devices for the preparation of combinatorial libraries are commercially 

available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Wobum, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
1 0 solution phase chemistries. These systems include automated workstations like the 

automated synthesis apparatus developed by Takeda Chemical Industries, LTD, (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate n, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 
synthetic operations performed by a chemist. Any of the above devices are suitable for use 
1 5 with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available {see, e.g., ComGenex, Princeton, N. J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
20 Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of angiogenesis gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

25 High throughput assays for the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
for example, U.S. Patent No. 5,559,410 discloses high throughput screening methods for 
proteins, U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic 

30 acid binding {i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
{see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
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typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibihty and customization. The manufacturers of such systems provide 
5 detailed protocols for various high throughput systems. Thus, for example, Zymark Corp. 
provides technical bulletins describing screening systems for detecting the modulation of 
gene transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
proteins or fi-agments of naturally occurring proteins. Thus, e.g^., cellular extracts containing 

1 0 proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodunent are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
preferred. Paticularly useful test compound will be directed to the class of proteins to which 

15 the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of fi-om about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and fi-om about 7 
to about 15 being particularly preferred. The peptides may be digests of naturally occiuring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 

20 "randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 

25 most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodunent, the library is fully randomized, with no sequence 
preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected fi-om a 

30 limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, for example, of hydrophobic 
amino acids, hydrophihc residues, sterically biased (either small or large) residues, towards 
the creation of nucleic acid binding domains, the creation of cysteines, for cross-linking. 
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prolines for SH-3 domains, serines, threonines, tyrosines or histidines for phosphorylation 
sites, etc., or to purines, etc. 

Modulators of angiogenesis can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
5 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 

10 After the candidate agent has been added and the cells allowed to incubate for 

some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis bufifers, electroporation, etc., 
with purification and/or amplification such as PGR performed as appropriate. For example, 

15 an in viti'o transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, for example, a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
detecting the target sequence's specific binding to a probe. The label also can be an enzjmie, 

20 such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 

appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 

25 streptavidin is labeled as described above, thereby, providing a detectable signal for the 

boimd target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 
probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

30 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 
5,594,1 18, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex. 
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A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. Hie assays are 
generally run under stringency conditions v^hich allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
5 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
g^erally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
1 0 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in dififei^t orders, 
with preferred embodiments outlined below. Li addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g, albumin, detergents, etc. 
15 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc, may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
20 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the angiogenesis phenotype. 
In one embodiment, screening is perfi3rmed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e,g., for diagnostic applications, having identified difTerentially 
25 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. Jn an another embodimrat, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, havmg identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
30 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress an 
angiogenesis expression pattern leading to a normal expression pattern, or to modulate a 
single angiogenesis gene expression profile so as to mimic the e:?qHession of the gene fi-om 
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normal tissue, a screen as described above can be performed to identify genes that are 
specifically modulated in response to the agent Comparing expression profiles between 
normal tissue and agent treated angiogenesis tissue reveals genes that are not expressed in 
normal tissue or angiogenesis tissue, but are expressed in agent treated tissue. These agent- 
specific sequences can be identified and used by methods described herein for angiogenesis 
genes or proteins. In particular these sequences and the proteins they encode find use in 
marking or identifying agent treated cells. In addition, antibodies can be raised against the 
agent induced proteins and used to target novel therapeutics to the treated angiogenesis tissue 
sample. 

Thus, in one embodiment, a test compound is administered to a population of 
angiogenic cells, that have an associated angiogenesis expression profile. By 
"administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodimmts, nucleic acid 
encoding a protemaceous candidate agent {i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral constract, and added to the cell, such that expression of 
the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, for example, angiogenesis tissue may be screened for agents that 
modulate, e.g., induce or suppress the angiogenesis phenotype. A change in at least one 
gene, preferably many, of the expression profile indicates that the agent has an effect on 
angiogenesis activity. By defining such a signature for the angiogenesis phenotype, screens 
for new drags that alter the phenotype can be devised. With this approach, the drug target 
need not be known and need not be represented in the original expression screwing platform, 
nor does the level of transcript for the target protein need to change. 

Measure of angiogenesis polypeptide activity, or of angiogenesis or the 
angiogenic phenotype can be performed using a variety of assays. For example, the effects of 
the test compounds upon the function of the anagiogenesis polypeptides can be measured by 
examining parameters described above. A suitable physiological change that affects activity 
can be used to assess the influence of a test compound on the polypeptides of this invention. 
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When the functional consequences are determined using intact cells or animals, one can also 
measure a variety of effects such as, in the case of angiogenesis associated with tumors, 
tumor growth, neovascularization, hormone release, transcriptional changes to both known 
and uncharacterized genetic markers (e.g., northern blots), changes in cell metabohsm such as 
5 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, mammalian angiogenesis polypeptide is typically used, e.g.^ 
mouse, preferably hvunan. 

A variety of angiogenesis assays are known to those of skill in the art. Various 
models have been employed to evaluate angiogenesis (e.g., Croix et aL, Science 289:1 197- 

10 1202, 2000 and Kahn et al, Amer. J. Pathol. 156:1887-1900). Assessement of angiogenesis 
in the presence of a potential modulator of angiogenesis can be performed using cell-cultre- 
based angiogenesis assays, e.g., endothelial cell tube formation assays, as well as other 
bioassays such as the chick CAM assay, the mouse corneal assay, and assays measuring ttie 
effect of administering potential modulators on implanted tumors. The chick CAM assay is 

15 described by O'Reilly, et al. Cell 79: 315-328, 1994. Briefly, 3 day old chicken embryos with 
intact yolks are separated from the egg and placed in a petri dish. After 3 days of incubation, 
a methylcellulose disc containing the protein to be tested is applied to the CAM of individual 
embryos. After about 48 hours of incubation, the embryos and CAMs are observed to 
detemiine whether endothelial growth has been inhibited. The mouse corneal assay involves 

20 implanting a growth factor-containing pellet, along with another pellet containing the 

suspected endothelial growth inhibitor, in the cornea of a mouse and observing the pattern of 
capillaries that are elaborated in the cornea. Angiogenesis can also be measured by 
determining the extent of neovascularization of a tumor. For example, carcinoma cells can be 
subcutaneously inoculated into athymic nude mice and tumor growth then monitored. The 

25 cancer cells are treated with an angiogenesis inhibitor, such as an antibody, or other 

compound that is exogenously administered, or can be transfected prior to inoculation with a 
polynucleotide inhibitor of angiogenesis. Immunoassays using endothelial cell-specific 
antibodies are typically used to stain for vascularization of tumor and the number of vessels 
in the tumor. 

30 Assays to identify compounds with modulating activity can be performed in 

vitro. For example, an angiogenesis polypeptide is j&rst contacted with a potential modulator 
and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the angiogenesis polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
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ELIS A and the like with an antibody that selectively binds to the angiogenesis polypeptide or 
a fragment thereof. For measurement of mKNA, amplification, e.g., using PCR, LCR, or 
hybridization assays, e.g., norttiOTi hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 

5 detection agents, e.g., fluorescently or radio actively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the angiogenesis 
protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 

1 0 treatment with a potmtial modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of the 

1 5 expression of the gene or the gene product itself can be done. The gene products of 

differentially expressed genes are sometimes referred to herein as **angiogenesis proteins". In 
preferred embodiments the angiogenesis protein comprises a sequence shown in Table 8. 
The angiogenesis protein may be a fragment, or altematively, be the frill length protein to a 
fragment shown herein. 

20 Preferably, the angiogenesis protein is a fragment of approximately 14 to 24 

amino acids long. More preferably the fragment is a soluble fragment. In one embodiment 
an angiogenesis protein is conjugated or frised to an immimog^c agent or BSA. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 

25 embodiment, screens are designed to first find compoimds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compoimds are identified, 
variants can be fiirther screened to better evaluate strucutre activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 

30 isolated gene product is used; that is, the gene products of one or more differentially 

expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immtmoassays are run to determine the amount of protein present. 
Altematively, cells comprising the angiogenesis proteins can be used in the assays. 
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Thus, in a preferred embodiment, the methods comprise combining an 
angiogenesis protein and a candidate compound, and determining the binding of the 
compound to the angiogenesis protein. Preferred embodiments utiUze the human 
angiogenesis protein, although other mammalian proteins may also be used, for example for 
5 the development of animal models of human disease. In some embodiments, as outUned 
herein, variant or derivative angiogenesis proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the angiogenesis 
protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 

10 made of any composition to which the compositions can be bound, is readily separated Scorn 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 

15 teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondiffusable. Preferred methods of binding include the use of antibodies 

20 (which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
the protein or agent, excess imboimd material is removed by washing. The sample receiving 
areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 

25 other innocuous protein or other moiety. 

In a preferred embodiment, the angiogenesis protein is bound to the support, 
and a test compound is added to the assay. Altematively, tiie candidate agent is bound to the 
support and the angiogenesis protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

30 analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 
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The deteimination of the binding of the test modulating compound to the 
angiogenesis protein may be done in a number of ways. In a preferred embodiment, the 
compound is labelled, and binding determined directly, e.g., by attaching all or a portion of 
the angiogenesis protein to a solid support, adding a labelled candidate agent (e.g., a 
5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

By "labeled" herein is meant that the compoimd is either directly or indirectly 
labeled with a label which provides a detectable signal, e.g. radioisotope, fluorescers, 
enzyme, antibodies, particles such as magnetic particles, chemiluminescers, or specific 

10 binding molecules, etc. Specific binding molecules include pairs, such as biotin and 
streptavidin, digoxin and antidigoxin, etc. For the specific binding members, the 
complementary member would normally be labeled with a molecule which provides for 
detection, in accordance with known procedures, as outlined above. The label can directly or 
indirectly provide a detectable signal. 

15 In some embodiments, only one of the components is labeled, e.g., the 

proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component caa be labeled with different labels, e.g., ^^^I for the proteinsand a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
usefiil. 

20 hi one embodiment, the binding of the test compound is determined by 

competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule {i.e. an angiogenesis protein), such as an antibody, peptide, binding partner, ligand, 
etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound, hi one 

25 embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Licubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 
40^C. hicubation periods are typically optimized, e.g., to facilitate rapid high tiiroughput 
screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 

30 removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that the test compound is bindinjg 
to the angiogenesis protein and thus is capable of binding to, and potentially modulating, the 
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activity of the angiogenesis protein. In this embodiment, either component can be labeled. 
Thus, for example, if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent. Alternatively, if the test compoimd is labeled, the 
presence of the label on the support indicates displacement. 
5 In an alternative embodiment, the test compoimd is added first, with 

incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the angiogenesis protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 
support, coupled with a lack of competitor binding, may indicate that the test compound is 

1 0 capable of binding to the angiogenesis protein. 

In a preferred embodiment, the methods comprise difTerential screening to 
identity agents that are capable of modulating the activitity of the angiogenesis proteins. In 
this embodiment, the methods comprise combining an angiogenesis protein and a competitor 
in a first sample. A second sample comprises a test compound, an angiogenesis protein, and 

15 a competitor. The binding of the competitor is detemiined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the angiogenesis protein and potentially modulating its activity. That is, if tiie 
binding of the competitor is different in the second sample relative to the first sample, the 
agent is capable of binding to the angiogenesis protein. 

20 Alternatively, differential screening is used to identify drug candidates that 

bind to the native angiogenesis protein, but cannot bind to modified angiogenesis proteins. 
The structure of the angiogenesis protein may be modeled, and used in rational drag design to 
synthesize agents that interact with that site. Drag candidates that affect the activity of an 
angiogenesis protein are also identified by screening drags for the ability to either enhance or 

25 reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for tiie binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 

30 and the amoxmt of bound, generally labeled agent determined. For example, where a 

radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of boimd compound. 

A variety of other reagents may be included in the screening assays. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
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to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The nuxture 
of components may be added in an order that provides for the requisite binding. 
5 In a preferred embodiment, the invention provides methods for screening for a 

compoimd capable of modulating the activity of an angiogenesis protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising angiogenesis 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes an angiogenesis protein. In a preferred CTibodiment, a library of 

1 0 candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, for example hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or oflier cells (i.e. cell-cell contacts). In another 

15 example, the determinations are determined at different stages of the cell cycle process. 

In this way, compoimds that modulate angiogenesis agents are identified. 
Compoimds with pharmacological activity are able to enhance or interfere with the activity of 
the angiogenesis protein. Once identified, similar structures are evaluated to identify critical 
structural feature of the compound. 

20 In one embodiment, a method of inhibiting angiogenic cell division is 

provided. The method comprises administration of an angiogenesis inhibitor. In another 
embodiment, a method of inhibiting angiogenesis is provided. The method comprises 
administration of an angiogenesis inhibitor. In a further embodiment, methods of treating 
cells or individuals with angiogenesis are provided. The method comprises administration of 

25 an angiogenesis inhibitor. 

In one embodiment, an angiogenesis inhibitor is an antibody as discussed 
above. In another embodiment, the angiogenesis inhibitor is an antisense molecule. 



Polynucleotide modulators of angiogenesis 
3 0 Antisense Polynucleotides 

In certain embodiments, the activity of an angiogenesis-associated protein is 
downregulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., an angiogenesis protein mRNA, or a subsequence th^eof. 
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Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
ofthemRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
5 subxmits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the angiogenesis protein 
mRNA, See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

1 0 Such antisense polynucleotides can readily be synthesized using recombinant 

means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used herein include antisense or sense 

15 oligonucleotides. Sense oligonucleotides can, e.g., be employed to block trancription by 

binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for angiogenesis molecules. A preferred antisense 
molecule is for an angiogenesis sequences in Tables 1-8 , or for a ligand or activator th^eof. 

20 Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 
generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, for example. Stein and Cohen (Cancer Res. 48:2659, 
1988) and van der Krol et al. (BioTechniques 6:958, 1988). 

25 

Ribozyjnes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of angiogenesis-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalj^ically cleaves other RNA molecules. Different kinds of ribozymes have 
30 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 

RNase P, and axhead ribozymes (see, e.g., Castanotto et al. (1994) Adv. in Pharmacology 25: 
289-317 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g.^ in Hampel et al. 
(1990) Nucl. Acids Res. 18: 299-304; Hampel et al. (1990) European Patent Publication No. 0 
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360 257; U.S. Patent No. 5,254,678. Methods of preparing are well known to those of skill in 
the art {see, e.g., Wong-Staal et al, WO 94/26877; Ojwang et al (1993) Proc. Natl Acad. 
Sci. USA 90: 6340-6344; Yamada et al (1994) Human Gene Tfterapy 1: 39-45; Leavitt et al 
(1995) Proa Natl Acad, ScL USA 92: 699-703; Leavitt et al (1994) Human Gene Tfierapy 5: 

5 1151-120; and Yamada et al (1994) Virology 205: 121-126). 

Polynucleotide modulators of angiogenesis may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 

10 bind to cell surface receptors. Preferably, conjugation of the ligand binding molecide does 
not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of 
angiogenesis may be introduced into a cell containing the target nucleic acid sequence, e.g.j 

1 5 by formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating angiogenesis in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 

20 anti-angiogenesis antibody that reduces or eliminates the biological activity of an 

endogeneous angiogenesis protein. Alternatively, the methods comprise, administering to a 
cell or organism a recombinant nucleic acid encoding an angiogenesis protein. This may be 
accomplished in any number of ways. In a preferred embodiment, for example when the 
angiogenesis sequence is down-regulated in angiogenesis, such state may be reversed by 

25 increasing the amount of angiogenesis gene product in the cell. This can be accomplished, 
e.^., by overexpressing the endogeneous angiogenesis gene or administering a gene encoding 
the angiogenesis sequence, using known gene-therapy techniques, for example. In a 
preferred embodiment, the gene therapy techniques include the incorporation of the 
exogenous gene using enhanced homologous recombination (EHR), for example as described 

30 in PCT/US93/03868, hereby incorporated by reference in its entireity. Altematively, for 

example when the angiogenesis sequence is up-regulated in angiogenesis, the activity of the 
endogeneous angiogenesis gene is decreased, for example by the administration of a 
angiogenesis antisense nucleic acid or other inhibitor, such as RNAi. 
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In one embodiment, the angiogenesis eproteins of the present invention may 
be used to generate polyclonal and monoclonal antibodies to angiogenesis proteins. 
Similarly, the angiogenesis proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to pimfy angiogenesis 
5 antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 

embodiment, the antibodies are generated to epitopes unique to a angiogenesis protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The angiogenesis 
antibodies may be coupled to standard affinity chromatography columns and used to purify 
angiogenesis protems. The antibodies may also be used as blocking polypeptides, as outlined 
10 above, since they will specifically bind to the angiogenesis protein. 

Methods of identifying variant angiogenesis-associated sequetices 

Without being boimd by theory, expression of various angiogenesis sequences 
is correlated with angiogenesis. Accordingly, disorders based on mutant or variant 

15 angiogenesis genes may be determined. In one embodiment, the invention pwrovides methods 
for identifying cells containing variant angiogenesis genes, e.g., determining all or part of the 
sequence of at least one endogeneous angiogenesis genes in a cell. This may be 
accomplished using any number of sequencing techniques. In a preferred embodiment, the 
invention provides methods of identifying the angiogenesis genotype of an individual, e.g., 

20 determining all or part of the sequence of at least one angiogenesis gene of the individual. 

This is generally done in at least one tissue of the individual, and may include tiiie evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced angiogenesis gene to a known angiogenesis gene, 
a wild-type gene. 

25 The sequence of all or part of the angiogenesis gene can then be comp^ed to 

the sequence of a known angiogenesis gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 
embodiment, the presence of a a difference in the sequence between the angiogenesis gene of 
the patient and the known angiogenesis gene correlates with a disease state or a propensity 

30 for a disease state, as outlined herein. 

In a preferred embodiment, the angiogenesis genes are used as probes to 
determine the number of copies of the angiogenesis gene in the genome. 

In another preferred embodiment, the angiogenesis gei^ are used as probes to 
determine the chromosomal localization of tiie angiogenesis genes. Information such as 
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chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the 
angiogenesis gene locus. 

5 Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of an angiogenesis protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 

10 known techniques (e.g., Ansel et aL, Pharmaceuitcal Dosage Forms and Dmg Delivery, 
Lippincott, Williams & Wilkins Publishers, ISBN:0683305727; Lieberman (1992) 
Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 082476918X, 
0824712692, 0824716981; Lloyd (1999) The Art, Science and Technology of Pharmaceutical 
Compounding, Amer. Pharmacutical Assn, ISBN 0917330889; and Pickar (1999) Dosage 

15 Calculations, Debnar Pub, ISBN 0766805042). As is known in the art, adjustments for 
angiogenesis degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
dmg interaction and the severity of the condition may be necessary, and will be ascertainable 
with routine experimentation by those skilled in the art. 

20 A "patient" for the purposes of the present invention includes both humans and 

other animals, particularly mammals. Thus the methods are applicable to both hiunan 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
preferably a primate, and in the most preferred embodiment the patient is human. 

The administration of the angiogenesis proteins and modulators thereof of the 

25 present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
some instances, for example, in the treatment of wounds and inflammation, the angiogenesis 
proteins and modulators may be directly applied as a solution or spray. 

30 The pharmaceutical compositions of the present invention comprise an 

angiogenesis protein in a form suitable for administration to a patient. In the preferred 
embodiment, the pharmaceutical compositions are in a water soluble form, such as being 
present as pharmaceutically acceptable salts, which is meant to include both acid and base 
addition salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain 
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the biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
5 acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodixmi, potassium, hthium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

10 potassium, sodiimi, calcium, and magnesium salts. Salts derived from phamiaceutically 

acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylanMue, 
tripropylamine, and ethanolamine. 

1 5 The pharmaceutical compositions may also include one or more of the 

following: carrier proteins such ais serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, com and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of tmit 

20 dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that angiogenesis protein modulators (e.g., antibodies, 
antisense constmcts, libozymes, smaU organic molecules, etc) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 

25 molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(5) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise an angiogenesis 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 

30 carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
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and the like, for example, sodium acetate, sodiimi chloride, potassium chloride, calciimi 
chloride, sodiimi lactate and the like. The concentration of active agent in these formulations 
can vary widely, and will be selected primarily based on fluid volumes, viscosities, body 
weight and the like in accordance with the particular mode of administration selected and the 
patient's needs (e.g^., Remington 's Pharmaceutical Science^ 15th ed.. Mack Publishing 
Company, Easton, Pennsylvania (1980) and Goodman and Gillman, Tlte Phamiacologial 
Basis ofTIterapeutics.QiardmBn, J.G, Limbird, L.E, Molinofif, P.B., Ruddon, R.W, and 
Oilman, A.G.,eds) TheMcGraw-Hill Companies, Inc., 1996). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 
patient per day may be used, particularly when the drug is administered to a secluded site and 
not into die blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g.. Remington 's Pharmaceutical Science and Goodman and Gillman, Hie 
Pharmacologial Basis of Therapeutics^ supra. 

The compositions containing modulators of angiogenesis proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease {e.g., a cancer) in an 
amount sufficient to cvire or at least partially arrest the disease and its complications. An 
amount adequate to accomplish this is defined as a ^therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 
effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 
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It will be appreciated that the present angiogenesis protein-modulating 
compounds can be administered alone or in combination with additional angiogenesis 
modulating compounds or with other therapeutic agent, e.g^., other anti-cancer agents or 
treatments. 

5 In nimierous embodiments, one or more nucleic acids, e.g^., polynucleotides 

comprising nucleic acid sequences set forth in Tables 1-8 , such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of angiogenesis-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 

1 0 organism-based) recombinant expression systems. 

The particular procedxffe used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 

15 plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g-., Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 Academic Press, Inc., San Diego, CA (Berger), F.M. Ausubel et aL, eds.. Current 
Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & 

20 Sons, Inc., (supplem^ted through 1999), and Sambrook et aL^ Molecular Cloning - A 
Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989. 

In a preferred embodiment, angiogenesis proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 

25 angiogenesis genes (including both the full-length sequence, partial sequences, or regulatory 
sequences of the angiogenesis coding regions) can be administered in a gene therapy 
application. These angiogenesis genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

30 Angiogenesis polypeptides and polynucleotides can also be administered as 

vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 
compositions can include, for example, lipidated peptides (e.^.,Vitiello, A. et aL^ J, Clin. 
Invest, 95:341, 1995), peptide compositions encapsulated in polyCDL-lactide-co-glycolide) 
CTLG") microspheres {see^ e.g., Eldridge, etal.^Molec. Immunol. 28:287-294, 1991: Alonso 
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etal. Vaccine 12:299-306, 1994; Jones et al. Vaccine 13:675-681, 1995), peptide 
compositions contained in immune stimulating complexes (ISCOMS) (5ee, e.^., Takahashi et 
al, Nature 344:873-875, 1990; Hu et a/., Clin Exp Immunol 1 13:235-243, 1998), multiple 
antigen peptide systems (MAPs) {see e.g.^ Tam, J. P., Proc. Natl Acad. Set U.SA. 85:5409- 

5 5413, 1988; Tam, J.P., J. Immunol Methods 196:17-32, 1996), peptides fomiulated as 
multivalent peptides; peptides for use in ballistic delivery systems, typically crystallized 
peptides, viral delivery vectors (Perkus, M. E. et al^ In: Concepts in vaccine development, 
Kaufinann, S. H. E,, ed., p. 379, 1996; Chakrabarti, S. et al. Nature 320:535, 1986; Hu, S. L. 
et al. Nature 320:537, 1986; Kieny, M.-P. et al, AIDS Bio/Technology 4:790, 1986; Top, F. 

10 H. et al, J. Infect. Dis. 124:148, 1971; Chanda, P. K. et a/.. Virology 175:535, 1990), 

particles of viral or synthetic origin (eg., Kofler, N. et al, J. Immunol Methods. 192:25, 
1996; Eldridge, J. K. et al, Sem. Hematol 30:16, 1993;Falo,L. D., Jr. etal. Nature Med. 
7:649, 1995), adjuvants (Warren, H. S., Vogel, F. R., and Chedid, L. A. Annu. Rev. Immunol 
4:369, 1986; Gupta, R. K. et al. Vaccine 1 1:293, 1993), liposomes (Reddy, R, et al, J. 

15 Immunol 148:1585, 1992; Rock, K. L., Immunol Today 17:131, 1996), or, naked or particle 
absorbed cDNA (Ulmer, J. B. et al. Science 259:1745, 1993; Robinson, H. L., Hunt, L. A., 
and Webster, R. G., Vaccine 11:957, 1993; Shiver, J. W, etal. In: Concepts in vaccine 
development, Kaufinann, S. H. E., ed., p, 423, 1996; Cease, K. B., and Berzofsky, J. A., 
Anm4. Rev. Immunol 12:923, 1994 and Eldridge, J. H. et al, Sem. Hematol 30:16, 1993), 

20 Toxin-targeted delivery technologies, also known as receptor mediated targeting, such as 
those of Avant Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen fi-om rapid catabolism, such as aluminum hydroxide 
or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

25 Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, for example, Freund*s Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, 
Detroit, MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 
(SmithKline Beecham, Philadelphia, PA); aluminum salts such as almninum hydroxide gel 
(alum) or aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of 

30 acylated tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 



74 



3NSDCXiD: «WO__0a070492A2_l.> 



wo 02/079492 



PCT/US02/04915 



Vaccines can be administered as nucleic acid compositions wherein DNA or 
RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff e/. a/.. Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,^79,647; 
5 WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 
cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
{see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 

1 0 invention can be expressed by viral or bacterial vectors. Examples of e7q>ression vectors 

include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, for example, as a vector to express nucleotide sequences that encode 
angiogenic polypeptides or polypeptide fragments. Upon introduction into a host, the 
recombinant vaccinia vims expresses the immunogenic peptide, and thereby elicits an 

1 5 immune response. Vaccinia vectors and methods usefril in immunization protocols are 
described m, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Cahnette 
Guerin). BCG vectors are described in Stover et al. Nature 351 :456-460 (1991). A wide 
variety of other vectors useful for therapeutic administration or immunization e.g. adeno and 
adeno-associated virus vectors, retroviral vectors. Salmonella typhi vectors, detoxified 

20 anthrax toxin vectors, and the like, will be apparent to those skilled in the art from the 

description herein (see, e.g., Shata et al (2000) Mol Med Today, 6: 66-71; Shedlock et al., J 
Leukoc Biol 68,:793-806, 2000; Hipp et al.. In Vivo 14:571-85, 2000). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing an angiogenesis gene or portion of an angiogenesis gene under the control of a 

25 regulatable promoter or a tissue-specific promoter for expression in an angiogenesis patimt. 
The angiogenesis gene used for DNA vaccines can encode fiill-Iength angiogenesis proteins, 
but more preferably encodes portions of the angiogoiesis proteins including peptides derived 
from the angiogenesis protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences dwived from an angiogenesis gene. 

30 For example, angiogenesis-associated genes or sequence encoding subfragments of an 
angiogenesis protein are introduced into expression vectors and tested for their 
immimogenicity in the context of Class I MHC and an abiUty to generate cytotoxic T cell 
responses. This procedure provides for production of cytotoxic T cell responses against cells 
which present antigen, including intracellular epitopes. 
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In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the angiogenesis polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment angiogenesis genes find use in generating 
animal models of angiogenesis. When the angiogenesis gene identified is repressed or 
diminished in angiogenesic tissue, gene therapy technology, e.g., wherein antisense RNA 
directed to the angiogenesis gene will also diminish or repress expression of the gene. 
Animal models of angiogenesis find use in screening for modulators of an angiogenesis- 
associated sequence or modulators of angiogenesis. Similarly, transgenic animal technology 
including gene knockout technology, for example as a result of homologous recombination 
with an appropriate gene targeting vector, will result in the absence or increased expression 
of the angiogenesis protein. When desired, tissue-specific expression or knockout of the 
angiogenesis protein may be necessary. 

It is also possible fliat the angiogenesis protein is overexpressed in 
angiogenesis. As such, transgenic animals can be generated that overexpress the 
angiogenesis protein. Depending on the desired expression level, promoters of various 
strengths can be employed to express the transgene. Also, the number of copies of the 
integrated transgene can be determined and compared for a determination of the expression 
level of the transgene. Animals generated by such methods find use as animal models of 
angiogenesis and are additionally useful in screening for modulators to treat angiogenesis or 
to evaluate a tiierapeutic entity. 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, angiogenesis-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 
ribozymes, dominant negative angiogenesis polypeptides or polynucleotides, small molecules 
inhibitors of angiogenesis-associated sequences etc. A th^apeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instmctional materials containing directions 
{i,e,, protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
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medium capable of storing such instructions and commtmicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
like. Such media may include addresses to internet sites that provide such instructional 
5 materials. 

The present invention also provides for kits for screening for modulators of 
angiogenesis-associated sequences. Such kits can be prepared from readily available 
materials and reagents. For example, such kits can comprise one or more of the following 
materials: an angiogenesis-associated polypeptide or polynucleotide, reaction tubes, and 

1 0 instructions for testing angiogenic-associated activity. Optionally, the kit contains 

biologically active angiogenesis protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 

1 5 parameters in disease which may be identified in historical or outcome data. 

It is understood that the examples described above in no way serve to limit 
the true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession nmnbers, and patent applications cited in this 
20 specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 

EXAMPLES 

25 Example 1 : Tissue Preparation, Labeling Chips, and Fingerprints 
Purify total RNA from tissue using TRIzol Reagent 

Homogenize tissue samples in 1ml of TRIzol per SOmg of tissue using a 
PolytronSlOOhomogenizer. The generator/probe used depends upon the tissue size. A 
generator that is too large for the amount of tissue to be homogenized will cause a loss of 

30 sample and lower RNA yield. TRIzol is added directly to frozen tissue, which is then 

homogenize. Following homogenization, insoluble material is removed by centrifiigation at 
7500 X g for 15 min in a Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf 
centrifuge at 4^C. The clear homogenate is transferred to a new tube for use. The samples 
may be fix)zen now at -60*" to -70°C (and kept for at least one month). The homogenate is 
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mixed with 0.2ml of chloroforai per 1ml of TRIzol reagent used in the original 
homogenization and incubated at room temp, for 2-3 minutes. The aqueous phase is then 
separated by centrifiigation and transferred to a fresh tube and the RNA precipitated using 
isopropyl alcohol. The pellet is isolated by centrifugation, washed, air-dried, resuspended in 
5 an appropriate volume of DEPC H2O, and the absorbance measured. 

Purification of poly A+ mRNA fi"om total RNA is performed as follows. Heat 
an oligotex suspension to 37*^0 and mixing immediately before adding to RNA. The 
Elution Buffer is heated at 70°C. Warm up 2 x Binding Buffer at 65°C if there is precipitate 
m the buffer. Mix total RNA with DEPC-treated water, 2 x Binding Buffer, and Oligotex 
10 according to Table 2 on page 16 of the Oligotex Handbook. Incubate for 3 minutes at 65°C. 
Incubate for 10 minutes at room temperature. Centrifuge for 2 minutes at 14,000 to 18,000 g. 
Remove supernatant without disturbing Oligotex pellet. A little bit of solution can be left 
behind to reduce the loss of Oligotex. Gently resuspend in Wash Bxiffer OW2 and pipet onto 
spin column. Centrifuge the spin colunm at full speed for 1 minute. Transfer spin column to 
15 a new collection tube and gently resuspend in Wash Buffer OW2 and centrifuge as describe 
herein. Transfer spin column to a new tube and elute with 20 to 100 ul of preheated (70oC) 
Elution Buffer. Gently resuspend Oligotex resin by pipetting up and down. Centrifuge as 
above. Repeat elution with fresh elution buffer or use first eluate to keep the elution volume 
low. Read absorbance, using diluted Elution Buffer as the blank. Before proceeding with 
20 cDNA synthesis, precipitate the mRNA as follows: add 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. 
of cold 100% ethanol. Precipitate at -20oC 1 hour to overnight (or 20-30 min. at -70oC). 
Centrifuge at 14,000-16,000 x g for 30 minutes at 4oC. Wash pellet with 0.5ml of 
80%ethanol (-20oC) then centrifuge at 14,000-16,000 x g for 5 minutes at room temperature.. 
Repeat 80% ethanol wash. Air dry the ethanol from the pellet in the hood.. Suspend pellet in 
25 DEPC H2O at lug/ul concentration. 

To further Clean up total RNA using Qiagen's RNeasy kit, add no more tiian 
lOOug to an RNeasy column. Adjust sample to a volume of lOOul with RNase-free water. 
Add 350ul Buffer RLT then 250ul ethanol (100%) to the sample. Mix by pipetting (do not 
centrifuge) then apply sample to an RNeasy mini spin column. Centrifuge for 15 sec at 
30 >lO,0O0rpm. Transfer column to a new 2-n[il collection tube. Add 500ul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough. Add 500ul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough then centrifuge for 2 min at 
maximum speed to dry column membrane. Transfer column to a new 1.5-ml collection tube 
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and apply 30-50ul of RNase-free water directly onto column membrane. Centrifuge 1 min at 
>10,000rpm. Repeat elution. and read absorbance. 

cDNA synthesis using Gibco's "Superscript Choice System for cDNA Synthesis" kit 
5 First Strand cDNA synthesis is performed as follows. Use 5ug of total RNA 

or lug of polyA+ mRNA as starting material. For total RNA, use 2ul of Superscript RT. For 
polyA+ mRNA, use lul of Superscript RT. Final voliune of first strand synthesis mix is 
20ul. RNA must be in a volume no greater than lOul. Incubate RNA with lul of lOOpmol 
T7-T24oligofor lOminatTOC. On ice, add 7 ul of: 4ul 5X 1st Strand Buffer, 2ul of O.IM 

10 DTT, and 1 ul of IQmM dNTP mix. Incubate at 37C for 2 min then add Superscript RT. 
Incubate at 37C for 1 hour. 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 ul 
DEPC H2O; 30ul 5X 2nd Strand Buffer; 3ul lOmM dNTP mix; lul lOU/ul E.coli DNA 
Ligase; 4ul lOU/ul E.coli DNA Polymerase; and lul 2U/ul RNase H, Mix and incubate 2 

15 hours at 16C. Add 2ul T4 DNA Polymerase. Incubate 5 min at 16C. Add lOul of 0.5M 

EDTA. A further clean-up of DNA is performed using phenol:chloroform:isoamyl Alcohol 
(25:24:1) purification. 

In vitro Transcription (TVT) and labeling with biotin is performed as follows: 
Pipet 1 .5ul of cDNA into a thin-wall PGR tube. Make NTP labeling mix by combining 2ul T7 

20 lOxATP (75mM) (Ambion); 2ul T7 lOxGTP (75mM) (Ambion); 1.5ul T7 lOxCTP (75mM) 
(Ambion); l.Sul T7 lOxUTP (75mM) (Ambion); 3.75ul lOmM Bio-ll-UTP (Boehringer- 
Mannheim/Roche or Enzo); 3.75ul lOmM Bio-16-CTP (Enzo); 2ul lOx T7 transcription 
buffer (Ambion); and 2ul lOx T7 enayme mix (Ambion). The final volume is 20ul. Incubate 
6 hours at 37°C in a PCR machine. The KNA can be furthered cleaned. 

25 Fragmentation is performed as follows. 1 5 ug of labeled RNA is usually 

fragmented. Try to minimize the fragmentation reaction volimie; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 
the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
KNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 

30 buffer is 200 mM Tris-acetate, pH 8. 1 ; 500 mM KOAc; 1 50 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65''C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
of the transcript size range 
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For hybridization, 200 ul (lOug cKNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50ng/ul final cone); 50 pM 948-b control 
oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; O.lmg/ml herring sperm DNA; 
0,5mg/ml acetylated BSA; and 300 ul with IxMES hyb buffer. 

Labeling is performed as follows: The hybridization reaction includes non- 
biotinylated IVT (purified by RNeasy colimms); IVT antisense RNA 4 M'g-M^U random 
HexamCTs (1 \xg/\il) 4 p.1 and water to 14 ul. The reaciton is incubated at 70^C, 10 min. 
Reverse transcriptionis performed in the following reaction: 5X First Strand (BRL) buffer, 6 
ul; 0;i M DTT, 3 ^il; SOX dNTP mix, 0.6 ^1; H2O, 2.4 ill; Cy3 or Cy5 dUTP (ImM), 3 ^1; SS 
RT n (BRL), 1 jal in a final volume of 16 |iL Add to hybridization reaction. Incubate 30 
min., 42''C. Add 1 |al SSII and incubate another hour. Put on ice. SOX dNTP mix (25mM of 
cold dATP, dCTP, and dGTP, lOmM of dTTP: 25 jal each of lOOmM dATP, dCTP, and 
dGTP; 10 |Lil of lOOmM dTTP to 15 ^l H20. dNTPs from Pharmacia) 

RNA degradation is performed as follows. Add 86 ^1 H20, 1.5 jil IM NaOH/ 
2mM EDTA and incubate at 6S^C, 10 min.. For U-Con 30, 500 ul TE/sample spin at 7000g 
for 1 0 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in SOOjil buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dil of DNAse/30ul Rx and incubate at 37''C for 15 min. Incubate 
at 5 min 9S^C to denature the DNAse/ 

For sample preparation, add Cot-1 DNA, 10 ^1; SOX dNTPs, 1 lal; 20X SSC, 
2.3 IliI; Na pyro phosphate, 7.5 ixl; lOmg/ml Herring sperm DNA; lul of 1/10 dilution to 21.8 
final vol. Dry in speed vac. Resuspendin 15 ^1H20. Add 0.38 ^1 10% SDS. Heat95^C,2 
min and slow cool at room temp, for 20 min. Put on slide and hybridize overnight at 64®C. 
Washing after the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 mis 20X SSC-K),75mls 
10% SDS in 250mls H20; IX SSC: 5 min., 12,5 mis 20X SSC in 250mls H20; 0.2X SSC: 5 
min., 2.5 mis 20X SSC in 250mls H20. Dry sUdes and scan at appropiate PMT's and 
chaimels. 

Example 2. A model of angiogenesis is used to determine expression in angiogenesis 

In the model of angiogenesis used to determine expression of angiogenesis- 
associated sequences, hvunan umbilical vein endothelial cells (HTJVEC) were obtained, e.g., 
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as passage 1 (pi) frozen cells from Cascade Biologies (Oregon) and grown in maintenance 
mediimi: Mediirai 199 (Life Technologies) supplemented with 20% pooled human serum, 
100 mg/ml heparin and 75 mg/ml endothelial cell growth supplements (Sigma) and 
gentamicin (Life Technologies). An in vitro cell system model was used in which 2x10^ 
HUVECs were cultured in 0.5 ml 3 mgs/ml plasminogen-depleted fibrinogen (Calbiochem, 
San Diego, CA) that was polymerized by the addition of 1 unit of maintenance medium 
supplemented with 100 ng/ml VEGF and HGF and 10 ng/ml TGF-a (R&D Systems, 
Minne^olis,MN) added (growtti medium). The growtti medium was replaced every 2 days. 
Samples for RNA were collected, at 0, 2, 6, 15, 24, 48, and 96 hours of culture. The 
fibrin clots were placed in Trizol (Life Technologies) and disrupted using a Tissuemizer. 
Thereafter standard procedures were used for extracting the RNA (e.^.. Example 1). 

Angiogenesis associated sequences thus identified are shown in Tables 1-8 . 
As indicated, some of the Accession numbers include expression sequence tags (ESTs). 
Thus, in one embodiment herein, genes within an expression profile, also termed expression 
profile genes, include ESTs and are not necessarily full length. 
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TABLE 1: 
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Ptey. Unique Eos probeset identifier number 

Aooesston: Accession number used for previous patent filings 

ExAocn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unlgene number 

UnigeneTiUe: Un^ene gene title 



Pkey Accession ExAocn UnigeneiD UnlgeneTitle 



134404 AB000450 
121443 AB002380 
100082 AB003103 
132817 AB004884 
130150 AF000573.ma1 
100104 AF008937 
130839 AF009301 
427064 AF009368 
100113 D00591 
133980 D00760 
100129 D11139 
100154 D14657 
100169 D14878 
101956 D17716 
100190 D210SO 
134742 D26135 
100211 D26528 
100238 D30742 
130283 D31762 
134237 D31765 
100248 D31888 
100256 D38128 
100262 D38500 
134329 D38551 
100281 D42087 
100294 D49396 
100327 D55640 
100335 D63391 
134495 D63477 
100338 D63483 
135152 D64015 
134269 D79990 
100372 D79997 
134304 D80010 
100394 084278 
100405 D86425 
100418 D86978 
133154 D87012 
134347 D87075 
128653 D87432 
100438 D87448 
134593 D87845 

100481 HG10984<T1098 X70377 
100552 HG2167-HT2237 AA019521 
100591 HG24154fr2511 NIVL004091 
cds 

100652 HG2825-HT2949 BE613608 
100662 HG2887-HT3031_r 
100899 HG46604fr5073 AL039123 
100905 HG4704-HT5146 L12260 
100945 HG8M-HT884 AF002225 
100950 HG919-HT919 
100964 J00212_f 
135407 J04029 
130149 J04031 
131877 J04088 
101016 J04543 
134786 L06139 
134100 L07540 
134078 L08895 
101132 L11239 
134849 L11353 
106432 L13773 



AB000450 Hs.82771 vaccinia related Idnase 2 

AF180681 Hs.6582 Rho guanine exdrange factor (GEF) 12 

AA1 30080 Hs.4295 proteasome (prosome. macropain) 26S subunit non-ATPase. 12 

N27852 Hs.57553 tousled<li)ce Idnase 2 

BE094848 Hs.151 13 )H3nfK)gentisate U-dioxygenase (homogentisate oxidase) 

AF008937 Hs.102178 syntaxin 16 

ABOl 1 169 Hs.20141 similar to S. cerewslae SSM4 

AF029674 Hs.173422 KIAA1605 protein 

NM_001269 Hs.84746 chromosome condensation 1 

AA294921 Hs.25081 1 v-ral simian leukemia viral oncogene homolog B (ras related; GTP binding protein) 

AA469369 Hs.5831 tissue inhibitor of metalloprotetnase 1 (eiythroid potentiating adnrity, coBagenase inhtbitor) 

H60720 Hs.81892 KIAA0101 gene product 

AL037228 Hs.82043 D123 gene product 

NM_002410 Hs.121502 mannosyl (alpha-1.6-)^lycoproteln beta-1,6-N-acetyl^lucosamlnyltransleiase 

M91401 Hs.178658 RAD23 (S. cerevisiae) homolog B 

Nl^001346 Hs.89462 diacylglycerol kinase, gamma (90kD) 

D26528 Hs.123058 DEAD/H (Asp^tu-Ala-Asp/Hls) box polypeptide 7 (RNA helicase. 52kD) 

L24959 Hs.34d caldum/calmodulin^ependent protein kinase iV 

NM-012288 Hs.153954 TRAM4ke protein 

D31765 Hs.170114 KIAA0061 protein 

NM.015156 Hs.78398 KIAA0071 protein 

025418 Hs.393 prostaglandin 12 (prostacyclin) receptor (IP) 

D38500 Hs.278468 postmeiotic segregaUon increased 2-^ 4 

N92036 Hs.81848 RA021 (S. pombe) homolog 

AF091035 Hs.184627 KIAA01 18 protein 

AA331881 Hs.75454 peroxiredoxin 3 

D55640 gbtiHuman monocyte PABL (pseudoautosomal boundary-like sequence) mRNA. done Mo2. 

AW247529 Hs.6793 platelet-activating factor acetylhydrolase, isofonn lb. gamma subunit (29kD) 

063477 Hs.84087 KIAA0143 protein 

D86864 Hs.57735 acetyl LDL receptor, SREC 

M96954 Hs.182741 TIA1 cytotoxfc granule-assodated RNA-binding protein-like 1 

NM_014737 Hs.80905 Ras associatton (Ra!GDS/AF-6) domain family 2 

NIVL014791 Hs.184339 KIAA0175 gene product 

BE613486 Hs.81412 lipin 1 

D84284 Hs.66052 CD38 antigen (p45) 

AW291587 Hs.82733 nidogen2 

086978 Hs.84790 KIAA0225 protein 

D87012 Hs.194685 topoisomerase (Dr4A) III beta 

AP164142 Hs.82042 solute carrier fomily 23 (nudeobase transporters), member 1 

D87432 Hs.10315 solute earner family 7 (cationic amino add transporter, y-*- system), member 6 

AA013051 Hs.91417 topoisomerase (DNA) II binding protein 
NM_000437 Hs.234392 platelet-activating factor acetylhydrolase 2 (40kD) 
Hs.121489 cystatinD 
Hs.301946 lysosomal 

Hs.231444 Homo safxens. Similar to hypothetical proteh PR01722, done MGC:15692. mRNA, complete 
Hs.142653 ret finger protein 

AI368680 Hs.816 SRY (sex detennining region Y)-box 2 
Hs.103042 microtubul&assodated protein IB 
Hs.172816 neuregufin 1 

Hs.180686 ubiquitin protein Ggase E3A (human papOloma vims E6-assodated protein. Angelman syndrome) 
AF128542 Hs.166846 polymerase (DNA directed), epsllon 
J00212 Empirically selected from AFFX single probeset 

J04029 Hs.99936 keratin 10 (epidenmiytte hyperkeratosis; keratosis palmaris et plantaris) 
AW067805 Hs.172665 melhytenetetrahydrofolate dehydrogenase (NADP+ dependent). methenyltetrahydrofbbtB 
J04088 Hs.156346 topoisomerase (DNA) 11 alpha (170kO) 
J04543 HSJ8637 annexinA7 

T29618 Hs.89640 TEK tyrosine kinase, endothelial (venous malfbmnations. multiple cutaneous and mucosal) 
AA460085 Hs.171075 repUcatton factor C(adivator 1)5 (36^kD) 

L08895 HS78995 MADS box transcription enhancer factor 2, polypeptide C (myocyte enhancer fector 2C) 
L1 1239 Hs^6993 gastnilation bredn homeo box 1 
BE409S25 Hs.902 neurofibromin 2 (t^teral acoustic neuroma) 
AK000310 H5.17138 hypottietka) protein RJ20303 
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3NSOOCIO: <WO 



lUl lO^ 


LloouU 


Alao4o25 


nS.9oo4 


sprndle pole body protein 






L14922 


Hs.166553 


replication factor C (activator 1) 1 (14SkD) 


ion coy 




dcZ97qoO 


HS.0UD9 


neat shock 70kD protein 9B (morta!in-2) 


lUlJDO 


LI 5388 


NM.005308 


Hs.211569 


G protein-coupled receptor kinase 5 


>IOi ICC 


LI 6895 


no7o79 


Hs.1 02267 


iysyl oxidase 




L27470 


Aru83o92 


HS.7560S 


tight junction protein 2 (zona ocdudens 2) 


133975 


L27624 


C18356 


Hs.295944 


tissue factor pathway inhibitor 2 


134739 


L32976 


NWL002419 


Hs.89449 


mitogen-activated protein kinase kinase kinase 11 


130155 


L33404 


AA101043 


Hs.151254 


kallikrein 7 (chymotryptte, stratum comeum) 


440538 


L35263 


W76332 


Hs79107 


mitogen-activated protein kinase 14 


132813 


L37347 


BE313625 


Hs.57435 


solute canrier family 1 1 (proton-coupled divalent metal ion transporters), member 2 


101294 


L40371 


AF1 68418 


Hs.1 16764 


thyroid honmone receptor intBractDr4 


101300 


L40391 


BE535S11 


Hs.74137 


transmembrane trafricklng protein 


101310 


L41607 


L41607 


Hs.934 


glucosaminyl (N^cetyt) transferase 2, l-branching enzyme 


130344 L77566 
embryonic lethal 


AW250122 


Hs.154879 


DIGeorge syndrome critkal region gene OGSI: likely ortholog of mouse expressed sequence 2 


101381 M13928 


AVV675039 


Hs.1227 


aminolevulinate, delta-, dehydratase 


101668 


M14016 


AW005903 


Hs.78601 


uroporphyrinogen decarboxylase 


133780 


M14219 


AA557660 


Hs.76152 


decorin 


101396 


M15796 


BE267931 


Hs.78996 


proliferating ceil nuclear antigen 


101447 


M21305 


M21305 




gbiHuman alpha satellite and satellite 3 junction DMA sequence. 


101458 


M22092 


M22092 




gb:Human neural cell adhesion molecule (N-CAM) gene, exon SEC and partial ods. 


101470 


M22898 


NM_000546 


Hs.1 846 


tumor protein p53 (Li-Fraumenl syndrome) 


134604 


M22995 


NM_002884 


Hs.865 


RAP1 A. member of RAS oncogene family 


101478 


M23379 


NM_002890 


Hs.758 


RAS p21 protein activator (GTPase activating protein) 1 


406698 


M24364 


X03068 


Hs.73931 


major histocompatibility complex, class It. DQ beta 1 


133519 


M24400 


AW583062 


Hs.74502 


chymotrypsinogen B1 


131185 


M25753 


BE280074 


Hs.23960 


cyclin B1 


134116 


M27691 


R84694 


Hs.79194 


cAMP responsive element binding protein 1 


133999 


M28213 


AA535244 


Hs.78305 


RAB2, member RAS oncogene family 


130174 


M29550 


M29551 


Hs.1 51 531 


protein phosphatase 3 (fonnerly 2B). catalytic subunit, beta isofomi (caldneurin A beta) 


129963 


M29971 


M29971 


Hs.1 384 


O-e-methytguanine-DNA mefiiyttransferase 


132983 


M30269 


M30269 


Hs.62041 


nidogen (enactin) 


133900 


M31158 


M31158 


Hs.77439 


protein kinase, cAMP-dependent, regulatory, type II. beta 


101543 


M31166 


M31166 


Hs.2050 


pentaxin-related gene, rapidly induced by 1L>1 beta 


101545 


M31210 


BE246154 


Hs.154210 


endothelial differentiation, sphingolipkl 6-protein-coup1ed receptor, 1 


101620 


M55420 


S55271 


Hs.247930 


Epsilon , IgE 


134691 


M59979 


AW382987 


Hs.88474 


prostaglandin-endoperoxkte synthase 1 (prostaglandin G/H synthase and cydooxygenase) 


133595 


M62810 


AA393273 


Hs.75133 


transcription factor 6-Gke 1 (ntftochondrial transcriptton factor 1-13(8) 


130425 


M63d38 


AA243383 


Hs.155530 


Interferon, gamma-inducible protein 16 


101700 


M64710 


D90337 


Hs.247916 


natriuretic peptide precursor C 


101714 


M68874 


M68874 


Hs.211587 


phosphoiipase A2. group iVA (cytosoHc. calcium-dependenQ 


134246 


M74524 


D28459 


Hs.80612 


ubiquitin-conjugating enzyme E2A (RAD6 homolog) 


101760 


M80254 


M60254 


Hs.173125 


peptidyiproiyi isomerase F (cydophirm F) 


133948 


M81780^cds3 


X59960 


Hs.77813 


sphingomyelin phosphodiesterase 1, add lysosomal (add sphingomyelinase) 


101791 


M83822 


M83822 


Hs.62354 


ceO division cyde 4-like 


101812 


M86934 


BE439894 


Hs78991 


ONA segment numerous copies, expressed probes (GS1 gene) 


101813 


M87338 


NM_002914 


Hs.1 39226 


replication fador C (activator 1) 2 (40kO) 


133396 


M96326_ma1 


M96326 


HsJ2885 


azuroddin 1 (cationic antimicrobial protein 37) 


135152 


M96954 


M96954 


Hs.182741 


TIA1 cytotoxic granule-assodated RNA-binding protein-like 1 


129026 


M98833 


AL1 20297 • 


Hs.108043 


Friend leukemia vims integratton 1 


101901 


S66793 


H38026 


Hs.308 


arrestin 3, retinal (X-anestin) 


134831 


S72370 


AA853479 


Hs.89890 


pymvate carboxylase 


134039 


S78569 


NIVL002290 


Hs.78672 


laminin, alpha 4 
lysosomal 


134395 


S79873 


AA456539 


Hs.8262 


101975 


S83325 


AA079717 


Hs.263664 


aspartate beta-hydroxylase 


101977 


S83364 


AF1 12213 


Hs.184062 


putative RabSnnterading protein 


101978 S83365 


BE561610 


Hs.5809 


putative transmembrane protein; homolog of yeast Golgi membrane protein Yif1p (Yip1 p- 


interacting {actor) 
101998 U01212 


U01212 


Hs.248153 


olfadory marker protein 


102003 


U01922 


U01922 


Hs.1 25565 


transiocase of inner mitochondrial membrane 8 (yeast) homokig A 


102007 


U02556 


U02556 


Hs75307 


t-complex-associated-testis-expressed 14ike 


102009 


U02680 


BE245149 


Hs.82643 


protein tyrosine kinase 9 


416658 


U03272 


U03272 


Hs.79432 


fibrillin 2 (congenital contradural arachnodactyly) 


132951 


U04209 


AW821182 


Hs.61418 


microfibrillar-assodated protein 1 


135389 


U05237 


U05237 


Hs.99872 


fetal Alzheimer antigen 


102048 


U07225 


U07225 


Hs.339 


purinergic receptor P2Y. G-protein coupled, 2 


130145 


U07620 




ns.ioiuol 


mitogen-activated protein kinase 10 


303153 


U09759 


U09759 


HSJ246857 


mitogen-activated protein kinase 9 


420269 


U09820 


U72937 


Hs.96264 


alpha thalassemia/mentat retardation syndrome X-llnked (f^54^S. oemvisiae) homok)g)' 


102095 


U11313 


U11313 


Hs.75760 


sterol carrier protein 2 


102123 


U14518 


NM_001809 


Hs.1594 


centromere protein A (17kD) 


102126 


U14575 


AW950870 


Hs.78961 


protein phosphatase 1 . regulatory (ffihibitor) subunit 8 


102133 


U15173 


AU076845 


Hs.155596 


BCL2yadenovinis£1B 19kDHnteracting protein 2 


102139 


U15932 


NM_004419 


Hs2128 


dual spedfidty phosphatase 5 


102162 


U18291 


AA45Q274 


HS.1S92 


CDCie^cell division cyde 16. S. cerevlsiae. homolog) 
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1Q2164 U1B300 


NM_000107 


Hs.77602 


427653 U18383 


AA159001 


Hs.160069 


131817 U20536 


U20536 


Hs.3280 


102200 U21551 


AA232362 


Hs.157205 


102210 U23028 


BE619413 


Hs.2437 


102214 U23752 


U23752 


Hs.32954 


132811 U25435 


U25435 


Hs.57419 


131319 U25997 


NM^003155 


Hs.25590 


102256 U28251^cds2 


U28251 


Hs.53237 


132316 U28B31 


U28B31 


Hs,44566 


102269 U30245 


U30245 




exonl. 






134365 li32315 


AA568908 


Hs.82240 


102293 U32439 


AF090116 


Hs.79348 


102298 U32849 


AA382169 


Hs.54483 


102325 U35139 


Ald15867 


Hs.50130 


302344 U36764 


BE303044 


Hs.192023 


102361 U39400 


AA223616 


Hs.75859 


102367 U39657 


039656 


Hs.1 18825 


102388 U41344 


AA362907 


HSJ6494 


102394 U41766 


NM_003816 


Hs.2442 


129829 U41613 


AF010258 


Hs.127428 


102251 U41815 


NIVL004398 


Hs.41706 


102409 U43266 


BE300330 


Hs.1 18725 


133746 U44378 


AW410035 


Hs.75862 


102423 U44754 


Z47542 


Hs,179312 


132828 U47011 cxls1 


AB014615 


Hs.57710 


130441 U47077 


U63630 


Hs.1 55637 


102450 U48251 


U48251 


Hs.75871 


129350 U50535 


U50535 


Hs.1 10630 


102534 U56833 


U96759 


Hs.198307 


130457 U58091 


AB014595 


Hs.155976 


135065 U58837 


AA019401 


Hs.93909 


102560 U59289 


R97457 


Hs.63984 


102567 U59863 


U63830 


Hs.146847 


134305 U67122 


U61397 


Hs.81424 


102636 U67319 


U67319 


Hs.9216 


132736 U68019 


AW081883 


Hs.288261 


sapiens mad protein homotog (hMAD-3) niRNA 


133070 U69611 


U92649 


Hs.64311 


102663 U70322 


NWL002270 


Hs.168075 


134660 U73524 


U73524 


Hs.87465 


102735 U79267 


AF111106 


Hs.3382 


102741 U79291 


AW959829 


Hs.83572 


101175 U82671_cds2 


U82671 


Hs.36980 


132164 U84573 


AI752235 


Hs.41270 


102823 U90914 


D85390 


Hs.5057 


102826 U91316 


NM«007274 


Hs.8679 


102831 U91932 


AA26217b 


Hs.80917 


102846 U96131 


BE264974 


Hs.6566 


129777 U97018 


U97018 


Hs.12451 


134161 U97188 


AA634543 


Hs.79440 


134654 V00503 


J03464 


Hs.179573 


302363 X04327 


AW163799 


Hs.198365 


133708 X06389 


AI018666 


Hs.75667 


125701 X07496 


T72104 


Hs.93194 


102915 X07820 


X07820 


Hs.2258 


134656 X14787 


AI750878 


Hs.87409 


413858 X15525.ma1 


NM-001610 


HS75589 


102968 X16398 


AU076611 


Hs.154672 


cydohydrolase 






102971 X16609 


X16609 


Hs.183805 


134037 X53586.nia1 


AI808780 


Hs.227730 


103023 X53793 


AW500470 


Hs.1 17950 


103037 X54936 


BE018302 


Hs.2894 


130282 X55740 


BE245380 


Hs.153952 


134542 X57025 


M14156 


Hs.85112 


128568 X60673 ma1 


H12912 


Hs.274691 


103093 X60708 


S79876 


Hs.44926 


133606 X62048 


U10564 


Hs75188 


129063 X63097 


X63094 


Hs.283822 


424460 X63563 


BE275979 


Hs.296014 


133227 X64037 


AW977263 


Hs.68257 


103181 X69636 


X69836 


Hs.334731 


103184 X69d7d 


U43143 


Hs.74049 


103194 X70649 


NM.004939 


HS78580 



damage-spectfic DNA binding protein 2 (48kD) 

nudear resplratoiy factor 1 

caspase 6. apoptosis-related cysteine protease 

branched chain arronotransferase 1, cytosolic 

euka/yotic translatron initiation facHor 2B. subunit 5 (epsDon. 82J(D) 

SRY (sex detendning region YHwx 1 1 

CCCTC-binding factor (zinc finger protein) 

stanniocatein 1 

ESTs. Highly similar to Z169JflJMAN ZINC FINGER PROTEIN 169 |H^lens] 
KIAA1641 protein 

gbtHtiman myetomonocyfic specific protein (MNDA) gene, 5' flanldng sequence and complete 
syntaxin 3A 

regulator of G-protein signalTing 7 
N-myc (and STAT) inteiactor 
neodin (mouse) homolog 

eukaryotic translation inittetion factor 3, subunit 2 (beta, 36kD) 

chromosome 1 1 open reading frame 4 

mitogen-activated protein kinase kinase 6 

proline arginlne-ridi end leudne-rich repeat protein 

a disintegrin and metalloproteinase domain 9 (meltrin gamma) 

homeo box A9 

DEAD/H (Asp^lu>AlaAsp/His) box polypeptide 10 (RNA helicase) 
selenophosphate synthetase 2 

MAD (mothers against decapentaplegic. DrosophSa) homotog 4 

srnad nuclear RNA activating complex, polypeptide 1. 43kD 

fibroblast groMvlh factor 8 (androgen-induced) 

protein kinase. DNA-activated. catalytic polypeptide 

protein kinase C binding protein 1 

Human BRCA2 region. mRNA sequence CG006 

von Htppel-Lindau binding protein 1 

cuITin 48 

cyclic nucleotide gated channel beta 1 
cadherin 13, H-cadherin (heart) 
TRAF family member-associated NFKB activator 
ubtquitin-like 1 (sentrin) 

caspase 7. apoptosis-retated cysteine protease 

Homo sapiens cDNA: FU23037 fis, done LNG02036, highly similar to HSU68019 Homo 

a disintegrin and metalloproteinase domain 17 (tumor necmsis fector. a4>ha, converting enzyme) 

karyopherin (imporfin) beta 2 

ATP/GTP-bindlng protein 

protein phosphatase 4. regulatory subunit 1 

hypothetical protein MGC14433 

melanoma antigen, family A, 2 

procollagen-lystne, 2-oxoglutarate S-dbxygenase (lysine hydroxytase) 2 
carboxypeptidase D 

cytosolic acyl coenzyme A thioester hydrolase 
adaptor-reiated protein complex 3. sigma 1 subunit 
thyroid honrtone receptor interactor 1 3 
echinodemrt microtubule-assodated protein-like 
IGF-ll mRNA-binding protein 3 
collagen, type I. alpha 2 
2.3-bisphosphoglycerate mutase 
synaptophysln 
apollpoproteln A-4 

matrix metaltoproteinase 10 (stromelysin 2) 

tturomt)OSpondin 1 

add phosphatase 2, lysosomal 

metr^ne tetrahydrofblate dehydrogenase (NAO dependenQ. mettienyltetFahydrofolate 

ankyrini.eryttuocytic 
Integrin, alpha 6 

multifundional polypeptide similar to SAICAR synthetase and AIR carixucylase 

placental growth factor, vascular endothelial growtti factor-related protein 

5' nudeotidase(CD73) 

insuDn-like growth factor 1 (somatomedin C) 

adenylate kinase 3 

dipeptkiyfpeptidase IV (CD26. adenosine deaminase complexing protein 2) 

weel-i'(S. pombe) homolog 

Rhesus blood group. D antigen 

polymerase (RNA) I) (DNA directed) polypeptide B (140kD) 

geneml transcription factor IIF. polypeptide 1 (74kD subunit) 

Homo sapiens, clone IMAGE:3448306, mRNA. partial cds 

fms-related fyrosine Kinase 4 

DEAD/H (Asp^hMla-Asp/His) box polypeptide 1 
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103208 X72841 AW411340 Hs.31314 retinoblastoma-binding protein 7 

129698 X74987 BE242144 Hs,12013 ATP-binding cassette, sub-family E<OABP), member 1 

131486 X83107 F06972 Hs.27372 BMX non-receptor tyrosine kinase 

130729 X84194 AI963747 Hs.18573 acylphosphatase 1, erythrocyte (common) type 

103334 X85753 Nl\/L001260 Hs.25283 cydin-dependent kinase 8 

132645 X87870 A1654712 Hs.54424 hepatocyte nuclear factor 4, alpha 

135094 X89066 Nl\/L003304 Hs^50687 transient receptor potental charniell 

103352 X89398 cds2 H09366 Hs.78853 uracB-DNA glycosylase 

103353 X89399 X89399 Hs.1 19274 RAS p21 protein acfivator (GTPase acfivafing protein) 3 (lns{1.3.4^)P4-binding protein) 
132173 X89426 X89426 Hs.41716 endothelial cell-spedfic nmteouJe 1 

103371 X91247 X91247 Hs.13046 thtoredoxin reductase 1 

131584 X91648 AA598509 Hs.29117 purinenlch element binding protein A 

103376 X92098 Ali)36166 Hs.323378 coated vesicle membrane protein 

103378 X92110 AL119690 Hs.153618 HCX3VIIH protein 

128510 X94703 X94703 Hs.296371 RAB28p member RAS oncogene family 

103410 XgSSOa AA158294 Hs.334879 DRI-assodated protein 1 (negative cofactor 2 alpha) 

133490 X97230J AF022044 Hs.274601 kilter cell immunoglobulin-like receptor, three domains, long cytoplasmk; tail, 1 

103438 X98263 AW175781 Hs.152720 M-phase phosphoprotein 6 

103440 X98296 X98296 Hs.77578 ubiquitin specific protease 9, X chromosome (Drosophila fat facets related) 

103452 X99584 NM_006936 Hs.851 19 SMT3 (suppressor of mif two 3, yeast) homolog 1 

amyloid beta (A4) precursor protein (protease nexin-lt> Alzheimer disease) 
Ric (Drosophila)-ltke. expressed In many tissues 

Y07759 Hs.170157 myosin VA (heavy polypeptide 12. myoxin) 

NM 007048 Hs.284283 butyrophiOii, subfamily 3, meiriberAI . 

BE386490 Hs.279663 Pirin 

AW408009 Hs.22580 alkylglycerone phosphate synthase 

Y09858 Hs.82577 spindlin-like 

Nr\/L002267 Hs.3886 karyopherin alpha 3 (impoitin alpha 4) 

103540 211559 NM_002197 Hs.154721 aconitase 1, soluble 

133152 211695 211695 Hs.324473 niitogen-activated protein kinase 1 

103548 215005 215005 Hs.75573 centromere protein E (31 2kD) 

103612 246261 BE336654 Hs.70937 H3histDne family, member A 

129092 AA01 1243 s D56365 Hs.63525 poly(rC)-binding protein 2 

103692 AA018418 AW137912 Hs.227583 Homo sapiens chromosome X map Xpll^ L4ype caldum chaiiriel alpha-1 subunit 



133536 Y00264 
135185 Y07566 
118523 Y07759 
134662 Y07827 

132083 Y07867 
103500 Y09443 
134389 Y09858 

132084 Y12394 



W25797.comp Hs.177486 
AW404908 Hs.96038 



35 (CACNA1F) gene, complete cds; HSP27 pseudogene, complete sequence; and JM1 protein, JM2 protein, and Hb2E genes, complete ods 



103695 AA018758 
129796 AA0188a4 
132258 AA031993 
132683 AA044217 
131887 AA046548 
member 1 

103723 AA057447_s 
453368 AA058376 
133260 AA083572 
AA085696 
AA088744 
AA089688 
AA091284 
AA092700 
[C.elegans] 
135289 AA092968 
AA094800 
AA100219 
AA1 14885 
AA129547 
AA133016 
AA149507 
129863 AA151005 
103850 AA1S7101 
103855 AA195179JS 
322026 AA203138 
135300 AA203645 
103861 AA206236 
130634 AA227621 
IC.elegans] 
447735 AA248283 
103909 AA249611 
131236 AA282640 
134060 AA287199 
129013 AA313990 
129435 AA314256 
103988 AA314389 
104000 AA324364 
425284 AA329211^s 
128629 AA399187 
133281 AA421079 



103765 
103766 
103767 
132051 
103773 



132729 
103794 
131471 
134319 
103807 
119159 



AW207152 Hs.186600 ESTs 

BE218319 Hs.5807 GTPase Rab14 

AA306325 Hs.431 1 SUMO-1 activating enzyme subunit 2 

BE264633 Hs.143638 WD repeat domain 4 

W17084 Hs.332848 SWl/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e. 

BE274312 Hs.214783 Homo sapiens cDNA aj14041 fis, clone HEMBA1005780 

W20296 Hs.288178 Homo sapiens cDNA FU1 1968 fis, clone HEMBB1001133 

AA403045 Hs.6906 Homo sapiens cDNA: FU23197 fis. clone REC00917 

AA085696 Hs.169600 KIAA0826 protein 

A1920783 Hs.191435 ESTs 

BE244667 Hs.296155 CGMOO protein 

AA393968 Hs.180145 HSPC030 protein 

AI219323 Hs.101077 ESTs, Weakly similar to T22383hypolhetkaI protein F47G9.4- Caenorhabdniselegans 

AW372569 Hs.9788 hypothetical protein MGC10924 similar to Nedd4 WW-binding protein 5 

AW970843 Hs.55682 eukaryotic translation initiation factor 3. subunit 7-<z€ta. 66/67kD) 

AF244135 Hs.30670 hepatocellular caroinoma-assodated antigen 66 

AA164842 Hs.192619 KIAA1600 protein 

BE304999 Hs.75653 fumarate t^iatase 

AW958264 Hs.103832 similar to yeast Up{3. variants 

AF142419 Hs.15020 homotog of mouse quaking QKI (KH domain RNA binding protein) 

BE379765 Hs.129872 spemi associated anfigen 9 

AA1S7101 Hs^13194 hypothetk^l protein MGC10895 

W02363 Hs.302267 hypothetical protein FU10330 

AW024973 Hs.283675 NPD009 protein 

AA142922 Hs^78626 Arg/AbMnterading protein ArgBP2 

AA206236 Hs.4944 hypothetical protein FU12783 

AI769067 Hs.127824 ESTs, Weakly similar to T28770 hypoBietkal prote'm W03D2.1 - Caenoihabditis elegans 

AA775268 Hs.6127 Homo sapiens cDNA: FU23020 fis, ctone LNG00943 

AA24961 1 Hs.47438 SH3 domain binding glutamic add-rich protein 

AF0431 17 Hs^4594 ubtquitination factor E4B (homologous to yeast UFD2) 

D42039 Hs78871 mesodemn development candidate 2 

AA371 156 Hs.107942 DKF2P564M1 12 protein 

AF151852 Hs.111449 CGl-94 protein 

AA314389 Hs.42500 ADP-ribosylation factor-like S 

AI146527 Hs.80475 polymerase (RNA) II (DMA directed) polypeptkie J<13.3kO) 

AF1 55568 Hs.155489 NSI-assodated protein 1 

AL096748 HS.1027Q8 Dra^434A043 protein 
AK001601 Hs.69594 high-mbblTi^ group 2aA 
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Hs.143640 ESTs. Weakly similar to hyperpolarizafioivacGvated cyclic nudeotide^ted channel hKCN2 

Ras-GTPase^dfvating preteln SH3-doma{n4)inding protein 
KIAA0251 protein 

Homo sapiens mRNA; cDNA DKFZp586E1624 (from done DKFZp586E1624) 
nudearfectorl/A 

Homo sapiens mRNA; cONA DKFZp586L0120 (from done DKFZp586U)120) 
K1AA0328 protein 

Homo sapiens mRNA; cONA DKFZp566Ja720 (from done DKFZp586J0720) 
semm-indudbte kinase 
hypothetkal proten FLJ22393 
CGM21 protein 

Homo sapiens done H63 unknown mRNA 
EST 

ooproporphyrinogen oxidase (coproporphyria, harderopoiphyria) 
KIAA0014 gene product 
ESTs 

EGF-TM7-latrophiIin-relatBd protein v 
KIAA1036 protein 
phosphoglycerate kinase 1 
platelet-activating factor acetylhydrolase 2 (40kD) 
msh (Drosophila) homeo box homotog 2 
cAMP responsive element binding protein-IIke 2 
ESTs 

Homo sapiens cDNA: FLJ21409 fis. done COL03924 
ESTs 
ESTs 
ESTs 

Kruppel-like factor 8 
integrin, alpha 8 

gastric intrinsic factor (vitamin B synthesis) 
threonyl-tRNA synthetase 

Homo sapiens, done ]MAGE:4299322. mRNA. partial cds 
protocadherin 17 
Bardet-Biedl syndrome 2 
zinc finger protein 198 
inhibitor of growth bmily, member 3 
hypothetical protein FU10814 

gb7h26b09.r1 Scares placenta Nb2HP Homo sapiens cDNA done IIVIAGE:130841 S". mRNA 

ESTs, Weakly similar to p40 (H^piens] 
hypothetkal protein DKI=2:p7611141 
hypothetical protein AF14Q225 
ESTs 

progestin induced protein 
cystelne-rich motor neuron 1 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564D016 (from done DKFZp564C)016) 
ESTs 

ESTs. Weakly similar to 138022 hypothetteal protein [H.sapiens) 
ESTs 

K1AA0955 protein 

gb:ze97d1 1 ^1 SoaresJetaLhearLNbHH19W Homo sapiens cDNA done 1MAGE:366933 3" 
similar to contains Alu repetitive element;. mRNA sequence. 



104104 


AA422029 


AA422029 


Hs.143640 


[H^sapiens] 






108154 


AA425230 


NM_005754 


Hs.220689 


132091 


AA447052 


AW954243 


Hs.170218 


135073 


AA452000 


W55956 


Hs.94030 


131367 


AA456687 


A1750575 


Hs.173933 


129593 


AA487015^S 


AI338247 


Hs.98314 


135268 


AB002326 


R41179 


Hs.97393 


133505 


C01527 


AI630124 


Hs.324504 


132064 


C01714 


AA121098 


Hs.3838 


134393 


C01811J 


W52642 


Hs.8261 


131427 


C02352_S 


AF151879 


Hs^oe 


133435 


C02375 


AI929357 


Hs.323986 


104282 


C14448 


C14448 


Hs.332338 


134827 


D16611_s 


BE314037 


Hs.89866 


130443 


D25216 


D25216 


Hs.155650 


131742 


D31352 


AAg61420 


Hs.31433 


132837 


D58024.S 


AA370362 


Hs.57958 


130377 


D80897 


NM_014909 


Hs.155182 


104334 


D82614 


D82614 


Hs.78771 


134593 


D87845 


NNL000437 


Hs.234392 


134731 


D89377J 


D89377 


Hs.89404 


129913 


H06583 


NIVL001310 


Hs.13313 


131670 


H40732 


H03514 


Hs.10130 


104394 


H46617 


AA1 29551 


Hs.172129 


104402 


H56731 


H56731 


Hs.132956 


129781 


H75570 


AA306090 


Hs.124707 


129077 


H78886 


N74724 


Hs.108479 


104417 


H81241 


AI819448 


Hs.320861 


134927 


L36531 


L36531 


Hs.91296 


129280 


M63154 


M63154 


Hs.110014 


134498 


M63180 


AW246273 


Hs.84131 


104460 


M91504 


AW955705 


Hs.62604 


104488 


N56191 


N55191 


Hs.106511 


131248 


N78483 


At038989 


Hs.332633 


129214 


N79268 


AL044335 


Hs.109526 


130017 


R146S2 


AK000096 


Hs.143198 


104530 


R20459 


AK001676 


Hs.12457 


104534 


R22303 


R22303 




sequence. 






104544 


R33779 


A1091173 


Hs.222362 


133328 


R36553 


AW452738 


Hs.265327 


104567 


R64534 


AA040620 


Hs.5672 


128562 


R66475 


AA923382 


Hs.101490 


129575 


R70621 


F08282 


Hs.278428 


130776 


R79355 


AF167705 


Hs.19280 


104599 


R84933 


AW815036 


Hs,151251 


104660 


RC_AA007160 


BE298665 


Hs.14846 


104667 


RC.AA007234.S AI239923 


Hs.30098 


104718 


RC^AA018409 


AI143020 


Hs.36250 


104764 


RC.AA025351 


AI039243 


Hs.278585 


104786 


RC.AA027168 


AA027167 


Hs,10031 


104787 


RC_AA027317 


AA027317 





134079 RC_AA029423 
104804 RC_AA031357 
104865 RC_AA045136 
130828 RC^AA053400 
104907 RC_AA055829 
WARNING ENTRY [H.sapiens] 
104943 RCJ\A065217 AF072873 
105013 RCJ^116054 
105024 RCJ^126311 
132592 RCJVA129390 
105038 RC_AA130273 
105077 RC^AA142919 
105096 RC_AA150205 
129215 RCJ\A176867 
105169 RC^AA180321 
132796 RC_AA180487 
130401 RCJ\A187634 
105200 RCJ^195399 
130114 RC.AA234717 
105330 RCJ\A234743 
105337 RCJ\A234957 
129385 RCJ^A235604 



AK001751 
AI858702 
T79340 
AW631469 
AA055829 



H63789 
AA126311 
AW803564 
AW503733 
W55946 
AL042506 
AB040930 
BE245294 
NIVL006283 
BE396283 
AA328102 
AA233393 
AW338625 
A1468789 
AA172106 



Hs.171835 hypothetical protein FU10889 

Hs.31803 ESTs. Weakly similar to N>WASP [H.sapiens] 

Hs.22575 B-oell CLUIymphoma 6, member B (zinc finger protein) 

Hs.203213 ESTs 

Hs.195701 ESTs. Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.1 14218 frizzled (Drosophila) homok)g 6 

Hs.296288 ESTs. Weakly similar to KIAA0638 protein [H.sapiens] 

Hs 9879 ESTs 

Hs!288850 Homo sapiens cDNA: FLJ2252a fis. done HRC12825 

Hs.9414 KIAA1488 protein 

Hs.234863 Homo sapiens cDNA FU1^2 fis. done HEMBB1002492 

Hs.21599 KnippeMike factor 7 (ubiquitous) 

Hs.126085 KIAA1497 protein 

Hs.180789 S164 protein 

Hs. 1731 59 transfonning, addic coHed-coil containing protein 1 

Hs.173987 eukaryotfc translation tniliatton factor 3, subunit 1 (alpha. 35kD) 

Hs.24641 cytoskeleton associated protein 2 

Hs.14992 hypothetical protein FU11151 

Hs^120 ESTs 

Hs.23200 myotubularih related protein 1 

HS.1109S0 Rag 0 protein 
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105376 RC AA236559 
105397 RC AA242868 
131982 RC AA251776 
131991 RC_AA251909 
128658 RC_AA252672_s 
105489 RC AA256157 
105508 RC.AA256680 
105539 RC_AA258873 
135172 RC AA262727 
131569 RC^AA281451 
132542 RC AA281545 
105643 RC_AA282069 
105659 RC_AA283044 
105666 RC.AA283930 
105674 RCJ^84755 
105709 RCJ^A291268 
105722 RCJVA291927 
105765 RCJW1343514 
115951 RCJ^398109 
105962 RCJ\A405737 
105985 RCJ^406610 
gb:X02067 

106008 RC^AA411465 
RC AA416886 
RC_AA424013 
RC AA424148 
RC AA424558 
RC^AA424961_s 
RCJ\A425367 
RC AA425921 



131216 
134222 
113689 
106141 
130839 
106157 
130777 
receptor 
130561 RC_AA426220 
106196 RC.AA427735 
WARNING 

131878 RC_AA430673 
RC^AA432248 
RC_AA435896 
RCJ^A436705 
RC.AA446561 
RC_AA44823B 
133442 RC_AA448688 
439608 RCJ^449756 
RC^AA450303 
RC_AA452411 
RC_AA454566 
RC_AA454657 
RC_AA456437 
RC_AA456646 
RC_AA456826 
RC AA456981 
CONTAMINATION 
134655 RC_AA458959 
member 1 

106636 RC_AA459950 
RC_AA460449 
RC^AA463910 
RC AA464603 
RC_AA464606 
106717 RC^AA465093 
131775 RCJVA465692 
RCJ^476473 
RC_AA478109 
RC.AA478474 
RC.AA480889 
RC_AA485223 
RC_AA485254 
RC_AA486183 
RC_AA496936 
WARNING 

107001 RC_AA598589 
RC^AA598831J 
RC_AA600150 
RC.AA608545 
RC_AA609210 
RC_AA610108 



133200 
106302 
106328 
450534 
106423 



106477 
106503 
446999 
106543 
130010 
106589 
106593 
106596 



106654 
131353 
106707 
131710 



106747 
106773 
105781 
106817 
106846 
106848 
106856 
418699 



130638 
107054 
107059 
107080 
107115 



AW994032 Hs.8768 hypothetical protein FU10849 

AA814807 Hs.7395 hypothetical protein FU23182 

AKQD0046 Hs.267448 hypothetical protein FLJ20039 

AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 (yeast homolog), beta 

BE397354 Hs.324830 diptheria toxin resistance protein required for diphthamide biosynthesis (SaocharomyoesHiKe 2 

AA256157 Hs.241 15 Homo sapiens cDNA FU14178 fis. clone NT2RP2003339 

AA173942 Hs.326416 Homo sapiens mRNA; cDNA DKFZp564H1916 (from done DKFZp564H1916) 

AB0408B4 Hs.109694 KIAA1451 protein 

AB028956 Hs.l2144 KIAA1033 protein 

AL389951 Hs.271623 nudeoporin 50kD 

AL137751 Hs.263671 Homo sapiens mRNA; cONA DKFZp434l0812 (from done DKFZp434l0812}; partial ods 

BE621719 Hs.173802 KIAA0603 gene product 

AA283044 Hs.25625 hypothetical protein FU1 1323 

AA426234 Hs.34906 ESTs. Weakly similar to T17210 hypothetical protein DKFZp434N041.1 [H.sapiensl 

AI609530 Hs279789 histone deacetylase 3 

AI928962 Hs^6761 DKFZP586L0724 protein 

Aig22821 Hs.32433 ESTs 

AA299688 Hs^4183 ESTs 

BE546245 Hs.301048 sec13-tlke protein 

AW880358 Hs.339808 hypothetical protein FU10120 

AA406610 gb:zv15b10.s1 Soares.NhHMPu_S1 Homo sapiens cDNA done IMAGE:753691 3* similar to 

AB033888 Hs.861 9 SRY (sex detemiining region Y)-box 18 

AI815486 Hs.243901 Homo sapiens cDNA FU20738 fis. done HEP08257 

AW855861 Hs.8025 Homo sapiens done 23767 and 23782 mRNA sequences 

AB037850 Hs.l6621 DKFZP434I116 protein 

AF031463 Hs.9302 phosdudn-like 

A6011 169 Hs.20141 similar to S. cerevisiae SSM4 

W37943 Hs.34892 KIAA1323 protein 

AW135049 Hs.285416 Homo sapiens cDNA FU10843 fis. done NT2RP20(^53. h^hly similar to Homo sapens i-l 

AB011095 Hs.16032 KIAA0523 protein 

AA525993 Hs.l73699 ESTs. Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

AA083764 Hs.6l01 hypothetical protein MGC3178 

AB037715 Hs.183639 hypothefical protein FU10210 

AA398859 Hs.18397 hypothetical protein FU23221 

AU)79558 Hs.28020 K1AA0766 gene product 

AI57D189 Hs.25132 KIAA0470 gene product 

ABQ20722 Hs.16714 Rho guanine exchai^ fedor (GEF) 15 

AL137663 Hs.7378 Homo sapiens mRNA: cDNA DKFZp434G227 (from done DKFZp434G227) 

AW864696 Hs.301732 hypothetical protein MGC5306 

R23324 Hs.41693 DnaJ (Hsp40) homolog. subfamily B. member 4 

AB033042 Hs.29679 cofador required for Spl transcrqitional activation, subunit 3 (130kD) 

AA151520 Hs.334822 hypothetical proton MGC4485 

AA676939 Hs.69285 neuropilin 1 

AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 

AK000933 Hs.28661 Homo sapiens cDNA FLJ10071 lis. done HEMBA1001702 

AW296451 Hs.24605 ESTs 

AA452379 Hs.293552 ESTs. Moderately similar to ALU7 HUMAN ALU SUBFAMILY SO SEQUENCE 



107130 RCJM^0582 



AF265208 Hs.l 23090 SWI/SNF related, matrix assodated. aclin dependent regulator of chromatin, subfamily f , 

AW958037 Hs.286 - nliosomal protein L4 

AW075485 Hs.286049 phosphoserine aminotransferase 

AW754182 gb:RC2-CT0321-131199^11-c01 CT0321 Homo sapiens cONA. mRNA sequence 

AK000566 Hs.98135 hypoflietical protein FU20569 

NM_015368 Hs.30985 pannexin 1 

AA600357 Hs^39489 TIA1 cytotoxic granule-associated RNA-bindmg protein 

AB014548 H5.31921 K1AA0648 protein 

NM_007118 HS.1719S7 triple functional domain (PTPRF interacting) 

AA478109 Hs.188833 ESTs 

AA330310 Hs.24181 ESTs 

D61216 Hs.18672 ESTs 

AB037744 Hs.34892 K1AA1323 protein 

AA449014 Hs.121025 chromosome 1 1 open reading frame 5 

W58353 H5.285123 Homo sapiens mRNA full length insert cDNA done EUROIMAGE 2005779 

BE539639 Hs.173030 ESTs. Weakly similar to ALU8_HUMAN ALU SUBFAMILY SX^QUENCE CONTAMINATION 

A1926520 Hs.31016 putative DNA binding protein 

AW021276 Hs.17121 ESTs 

AI076459 Hs.15978 KIAAl 272 protein 

BE614410 Hs.23044 RAD51 (S. cerevisiae) homotog <E con RecA homotog) 

AL122043 Hs.19221 hypothetkal protein DW=Zp58eG1424 

BE379623 Hs.27693 peptidytprolyl isomerase (cydophain)-like 1 

AB033106 Hs.12913 KIAA1280 protein 
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107156 


RC AA621239 




Lie. QCCQ 


107174 


RC.AA621714 


RP4997R9 
DC l^/OZ 




130621 


RCJVA621718 


AWoijUo/ 


Lit* ICQflQ 


10719D 


RC D19673 


AAooo4D1 


nS.5iU3 


132626 


RC.D25755_s 


A\AfCAil7QO 




107217 


RC D51095 


Al n0f\99C 


ns.oooDl 


131610 RC.D6027^i 




ns./w/o 


129604 T08879 


ArUooooD 


nS. 1 lOsU 


107295 


T34527 


AA1 000/9 


nS.oUi/U 


(GalNAc-TI) 






107299 


T40327.S 


OCjifJ'tO/ 


n5.oU0Dl 


107315 


T62771_s 


AA31oZ4i 


r1S.9Uo91 


107316 


T63174.S ' 


T63174 


Hs.1 93700 


107328 


T83444 


AW959o91 


nS.7o591 


107334 


T93M1 


T93597 


Hs.1 87429 


134715 


U48263 


U48263 


ns.89040 


128636 


U49065 


U49065 


Hs.102865 


129938 


U79300 


AW003OOO 


Hs. 135587 


107375 


U88573 


BE011845 


nS.2510o4 


130074 


U93867 


AL038596 


Hs.250745 


107387 


W01094 


Doo9o3 


Ms. 11oo93 


132036 


W01568 


ALi 57433 


Lin Q'nnc 


107426 


W26853 


W26853 


nS.291U03 


113857 


W27179 


AWZ43i5o 


IJt« COOT 


135388 


W27965 


WZ7900 


|_|_ QOQCC 


130419 


W36280.S 


Ar037448 


nS. ioo4o9 


107469 


W47063 


W47063 


Hs.94668 


132616 


W79080 


BE262677 


Hs^83558 


107506 


W88550 


AB0289o1 


Hs.8021 


132358 


X60486 


NIVL003542 


PIS .404/3 


107522 


X78931^s 


X78931 


nS.99971 


125827 


214077.S 


NM_003403 


Hs.97496 


107582 


RC_AA002147 


AA002147 


Hs.59952 


107609 


RCJ^004711 


R75654 


Hs.1 64797 


107661 


RC_AA010383 


AA010383 


U» CAQfid 

nS.Duoo9 


107714 


RCJ^A015761 


AA015761 


Hs.60642 


107775 


RCJWV018772 


AW008846 


ns.60o57 


107832 


RC_AA021473.r AA021473 




sequence. 






107859 


RC M024835 


AW732573 


Hs. 47584 


124337 


RC_AA025858 


N23541 


190 1 


107914 


RC AA027229 


AA027229 


Hs.61329 


[C.elegans) 






107935 


RC_AA029428 


AA029428 


Hs.61555 


116262 


RC AA035143 


AI936442 


Hs.59838 


131461 


RC_AA035237 


AA992841 


Hs,27263 


108007 


RC_AA039347 


AA039347 


Hs.61916 


108029 RCJ^040740 


AA04a740 


Hs.62007 


108040 


RC^AA041551 


AL121031 


Hs.159971 



member 1 

108084 RC AA045513 AA058944 

108088 RC_AA045745 AA045745 

108168 RC.AA055348 AI453137 

130719 RC.AAa56582_s AA679262 

108189 RC_AA056697 AW376061 

108190 RC_M056746 



108203 
108216 
108217 
108245 
108277 
mRNA 
108280 
108309 
133739 
108340 
108403 



RCJWV057678 
RC_AA058681 
RC_AA058686 
RCJ\A062840 
RC.AA064859 



AA0S8746 
AW847814 
AA524743 
AA058886 
BE410285 
AA064859 



RC_AA065059 AA065069 
RC_AA069923 AA069818 
RC_AAD70799.s BE536554 
RC^AA070815 AA069820 
RC_AA075374 
3\ mRNA sequence. 
108427 RC.AA076382 
3'. mRNA sequence. 
108435 RC_AA078787 
108439 RC^AA078986 
3', mRNA sequence. 
108465 RCJ\A079393 
108469 RC_AA079487 
sequence 



AA075374 

AA076382 

T82427 
AA078986 



AA079393 
AA079487 



Hs.1 16602 

Hs.62886 

Hs.63176 

Hs.14235 

Hs.63335 

Hs.63338 

H5^89005 

Hs.44883 

Hs.62588 

Hs.89545 



Hs.278270 
Hs.180g09 



Hs.194101 
Hs.3462 



prograntmed oeO death 6-intefacfing protein 
ESTs 

LUC7(S.ceievisiae)-Gke 
ESTs 

hypothetical protein FU11011 
DKFZP586E1621 protein 
scavenger receptor with C-type lecfin 
cath^InF 

UDP-N-aoetykalpha-(Vgalactosamine:polypepfide N-aoetytgalactosaminyttransferase 1 

hypothetical protein M6C4606 
nudeophosminAiudeoplasmin 3 

Homo sapiens mRNA; cDNA DKFZp586l0324 (from done DKFZp586l0324) 

KIAA0887 protein 

ESTs 

prepronodceptin 

interteukin 1 receptor-like 2 

Human done 23629 mRNA sequence 

h*Qh-mob&ity group (nonhistone chromosomal) protein 14 

polymerase (RNA) 111 (DNA directed) (62kD) 

Melanoma associated gene 

hypothetical protein D}<FZp434E2220 

hypothetical protein MGC4707 

DKFZP564A2416 protein 

epimorphin 

NSI-assodated protein 1 
ESTs 

hypothetical protein PR01855 

KiAA1058 protein 

H4 histone family, member G 

zinc finger protein 272 

YY1 transcription factor 

EST 

hypothetical protein FU13693 

ESTs 

ESTs 

ESTs 

gb:ze66c11.s1 Scares retina N2b4HR Homo sapiens cDNA done IMAGE:363956 3*. mRNA 

potassium voltage^ated channel, delayed-rectifier, subfamily S. member Z 
Homo sapiens cDNA: FU23582 tis, done LNG13759 

ESTs. Weakly similar to T16370 hypothetical protein F45E12.5 - CaenorhatxJitis elegans 
ESTs 

hypottietical protein FU10808 

KIAA1458 protein 

EST 

ESTs 

SWl/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily b. 

Homo sapiens, done IMAGE:4154008. mRNA, partial cds 

ESTs 

ESTs 

hypothetical protein FLJ20008; KIAA1839 protein 

ESTs, Moderately sMar to A46010 X-ltnked retinopathy protein [H.saplens] 

EST 

Homo sapiens cDNA: FU21532 Gs. done COU06049 

ESTs 

ESTs 

proteasome (prosome, macropain) subunit, beta type. 4 

gb:zm50{03.s1 Stratagene fSirobiast (937212) Homo sapiens cDNA done IMAGE:529085 3\ 

gb:zm12e1 1^1 Stratagene pancreas (937208) Homo sapiens cDNA done 3', mRNA sequence 
gb:zm67e03.r1 Stratagene neuroephhelium (937231) Homo sapiens d)NA done 5* similar to 
unactive progesterone receptor, 23 kD 
peroxtredoxin 1 

gb:zm87a01^1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done iMAGE:544872 
gb:zm91g08.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done IMAGE:545342 
Homo sapiens cDNA: FU20869 fs. done ADKA02377 

gb2m92h01.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done IMAGE.545425 

cytochrome c oxidase subunit Vile 

gb2m97ffl8.s1 Stratagene colon HT29 (937221) Homo sapiens cONA done 3\ mRNA 
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130890 
134585 
130385 
108749 
108807 
108808 



108500 RC AA083207 

108501 RC^AA083256 
gb:M33308 

108533 RC^AA084415 
mRNA 

108562 RC^AA085274 
gb:X15341 

108589 RC.AA088678 
RC^AA100925 
RC^AA101255 
RC^M126474 
RC^AA127017 
RCJVA129968 
RC.AA130240 
108833 RCJ^131866 
107290 RCJ^132039 
108846 RC.AA132983 
108857 RCJ^133250 
131474 RCJ^A133583. 
108894 RC.AA135941 
108941 RC^AA148650 
IMAGE:5672023\ 
108968 RC AA151110 
RC.AA155754 
RC AA156125 
RC AA156289 
RC.AA156997 
RC_AA157291 
RC AA1 57293 
RC_AA164293„ 
RC_AA164676 
RC_AA167375 
RC AA167550 
RC^AA176589 
RC_AA180448 
RC_AA187144. 
RC AA189170. 
RCIaA192757 
RCJ^A205650 
RCJU\233342 
109485 RC„AA233472 
109516 RC^AA234110 
RC_D80981 
RC_F01660 
RC F02206 
RC_F02208 
RC F02544 
RC F03918 
RC_F04258_s 
RC_F04600 
RC_F08998 
RC F09805 
RC^F11115 
RC_H06371 
RC H10995 
RC H11938 
RC_H16568 
RC_H16772 
RC_H18951 
RC.H20859 
RC.H23747 
RC.H38087 
RC.H40331 
110342 RC_H40567 
110395 RC.H46956 
RC.H56640J 
RC.H57154 
RC.H96712 
RC.N20814 
RC.N25249 
RC_N27100 
RC_N39616 
RC_N48982 
RC,N51957 
115062 RC.N52271 
111081 RC.N59435 



108996 
109001 
131183 
109019 
109022 
109023 
109068 
109072 
129021 
130346 
109146 
109172 
131080 
129208 
109222 
109300 
109481 



109537 
109556 
109577 
109578 
109595 
109625 
131983 
109648 
109671 
109699 
109820 
109933 
110014 
110039 
110099 
110107 
110155 
110197 
110223 
110306 
110335 



110511 
110523 
110715 
110754 
130132 
131135 
134263 
110938 
110983 



AA083207 Hs.68270 EST 

AA083256 gb2n08g12.s1 Stratagene hNT neuron <937233) Homo sapiens cDNA ctone 3' similar to 

AA084415 gb:2n06g09.s1 Stratagene hm neuron (937233) Homo sapiens d)NA clone IMAGE:546688 3'. 

AA100796 gb:zn[)26c06.s1 Stratagene pancreas (937208) Homo sapiens cDNA clone 3* slmflar to 

A1732404 Hs.68846 ESTs 

AI907537 Hs J6698 stress-associated endoplasmic reticulum protein 1; ribosome associated membrane protein 4 

D14041 Hs.278573 H-2K binding factor-2 

AW067800 Hs. 155223 stanniocafdn 2 

AA127017 Hs.71052 ESTs 

AI6S2236 Hs.49376 hypothefical protein FU20644 

AA045088 Hs.62738 ESTs 

AF188527 Hs.61661 ESTs. Weakly similar to AF174605 1 F-box protein Fbx25 (H.sapiens] 

W27740 Hs,323780 ESTs 

AL117452 Hs.44155 DKF2P586G1517 protein 

AK001468 Hs.62180 aniliin (Drosophila Scraps homotog), ac&n binding protein 

s L46353 Hs.2726 high-mobility group (nonhistone chromosomai) protein isofonn l-C 

AK001431 Hs.5105 hypothetical protein FU1 0569 

AA148650 gb:zo09e06.s1 Strataigene neuroeiHthelium NT2RA1\A1 937234 Homo sapiens cOt^-done 

A1304870 Hs.188680 ESTs 

AW995610 Hs.332436 EST 

A1056548 Hs.721 16 hypothetical protein FU20992 similar to hedgehog-interacting protein 

AI611807 Hs.285107 * hypothetical protein FU13397 

AA156755 Hs72150 ESTs 

AA157291 Hs.21479 ubinudein 1 

AA157293 Hs72168 ESTs 

f AA164293 Hs.72545 ESTs 

AI7325B5 Hs,22394 hypothetical protein FU10893 

AL044675 Hs.173081 K1AA0530 protein 

H05769 Hs.1 88757 Homo sapiens, done I\/1GC:5564, mRNA. complete cds 

AA176589 Hs.142078 EST 

AA180448 Hs.144300 EST 

s NM_001955 Hs.2271 endothelin 1 

f AI587376 Hs.109441 MSTP033 protein 

AA192833 Hs.333512 similar to rat myomegaBn 

AA418276 Hs.170142 ESTs 

AA878923 Hs^89069 hypothetical protein FU21 016 

BE619092 Hs.28465 Homo sapiens cDNA: FU21869 fis. done HEP02442 

AI471639 HsJ1913 ESTs 

AI858695 Hs.34898 ESTs 

A1925294 Hs.87385 ESTs 

F02206 Hs.296639 Homo sapiens potassium channel subunit (HERG-3) mRNA, connpiete ods 

F02208 Hs.27214 ESTs 

AA078629 Hs.27301 ESTs 

H29490 Hs.22697 ESTs 

AF1 19665 Hs.184011 pyrophosphatase (inorganic) 

H17800 Hs7154 ESTs 

R59210 Hs^6634 ESTs 

H18013 Hs.167483 ESTs 

AW016809 Hs.323795 ESTs 

R52417 Hs,20945 Homo sapiens done 24993 mRNA sequence 

AL109666 Hs7242 Homo sapiens mRNA fuii length insert cDNA done EUR0lti4AGE 35907 

H11938 Hs^1907 histone acetyltransferase 

R44557 Hs.23748 ESTs 

AW151660 Hs.31444 ESTs 

A1559626 Hs.93522 Homo sapiens mRNA for KIAA1647 protein, partial cds 

AW090386 Hs.112278 arrestin. beta 1 

H19836 Hs.31697 ESTs 

H38087 Hs.105509 CTL2gene 

H65490 Hs.18845 ESTs 

H40961 Hs.33008 ESTs 

AA025116 Hs.33333 ESTs 

H56640 H5^14aO ESTs 

AI040384 Hs.19102 ESTs. Weakly simHarto organic anion transporter 1 (H^iens] 

H96712 Hs.269029 ESTs 

AW302200 Hs.6336 KIAA0672 gene product 

U55936 Hs.184376 synaptosomat-assodated protein. 231cD 

NM.016569 Hs^7182 TBX3-iso protein 

AVV973443 Hs.8086 RNA (guanine-7-) melhyltranslerase 

N46982 Hs^8034 Homo sapiens cDNAFU 12924 fis. done NT2RP2004709 

NM-015367 Hs.10267 M1L1 protein 

AA253314 Hs.154103 LIM protein (similar to rat protein iunase C-binding enigma) 
AI146349 Hs.271614 CGl-1 12 protein 
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112033 
130987 
112300 
112513 



111128 RC_N64139 
135244 RC.N66981 
111216 RC_N68640 
437562 RC N89352 
131002 RC^N95226 
111399 RC^R00138 
111514 RC^R07998 
similar to 

130182 RC R08929 
111574 RC_R10307 
111804 RC_R33354 
111831 RC^R36083 
129675 RC.R37938J 
111904 RC.R39330 
sequence 

133868 RC_R40816_s 
RC_R43162.s 
RC^R45698 
RC,R54554 
RC.R68425 
112514 RC.R68568 
112522 RC_R68783 
112540 RC_R70467 
130346 RC.R73565 
129534" RC.R73640 
112597 RC_R78376 
112732 RC_R92453 
131458 RC_T03865 
RC_T03872 
RC_T10072 
RC_T10080 
RC_T10132 
RC_T15343 
RC_T23457 
RC.T23555 
RC.T23670 
RC^T23948 
RC_T33464 
RC_T34413 
RC_T34611 
113095 RC^T40920 
113179 RC.T55182 
RC.T77453 
RC_T84039 
RC_T86458 
RC_T87693 
RC.T89350.S 
RC.T90945 
113559 RCjr90987 
113589 RC.T91863 
113591 RC.T91881 
113619 RC.T93783.S 
113683 RC,T96687 
RC_T96944 
RC_T97307 



112888 
131863 
112911 
132215 
112931 
112984 
112998 
133376 
113026 
113070 
128970 
113074 



113337 
113421 
113454 
113481 
131441 
113557 



113692 
113702 
mRNA 
113717 
113824 



RC^T97764 
RC W48817 
113840 RC W58343 
113844 RC_W59949 
PROTEIN TC10 
113902 RC.W74644 

113904 RC.W74761 

113905 RC.W74802 

113931 RC W81205 

113932 RC^W81237 
131965 RC^W90146J 
114035 RC.W92798 
114106 RC.Z38412 
133593 RC_Z38709 
114161 RC^Z38904 
424949 RC^Z39103 
129059 RC^Z39930J 
128937 RC.739939 
WARNING 

130983 RCJ^4001^.i 



AW505364 Hs.19074 LATS (laige tumor suppressor, Drosophila) homolog 2 

AI834273 Hs.9711 novel prot^ 

AWI 39408 Hs. 152940 ESTs 

ABQ01636 Hs.5683 DEAD/H (Asp^tu-Ala-Asp/Hls) box polypeptide 15 

AL050295 Hs.22039 K1AA0758 protein 

AW270776 Hs.18857 ESTs 

R07998 gb7f16g11^1 Soares fetal liver sp!een 1NFLS Homo sapiens cONA done IMAGE:127076 3' 

BE267033 Hs.192853 ubiqultin-conjugating enzyme E2G 2 (homologous to yeast UBC7) 

AI024145 Hs.188526 ESTs 

AA482478 Hs.1817e5 ESTs 

R36095 Hs.268695 ESTs 

NM_015556 Hs.172180 K1AA0440 protein 

Z41572 gb:HSCZYB122 noonafized intent btain cDNA Homo sapiens cDNA done o-zyb12, mRNA 

AB012193 Hs.183874 cuinn4A 

R49031 Hs.22627 ESTs 

BE613269 Hs.21893 hypothetical proteb DKFZp761NQ824 

H24334 Hs.26125 ESTs 

R68425 Hs.13809 hypothetical protein FU1 0648 

R68568 Hs. 1 83373 src homology 3 domain-containing protein HiP-55 

R68857 Hs.265499 ESTs 

RG9751 gb:yi40a10.s1 Soares placenta Nb2HP Homo sapiens cDNA done 3', mRNA sequence 

H05769 Hs.188757 Homo sapiens, done M6C:5564. mRNA, oonrtplete ods 

AK002126 Hs.11260 hypothetical protein BJ 11264 

R78376 Hs.29733 EST 

R92453 Hs.34590 ESTs 

BE297567 Hs.27047 hypothefical protein FU20392 

AW195317 Hs.107716 hypothefical protein FU22344 

AI656378 Hs.33461 ESTs 

AW732747 Hs.13493 like mouse brain protein E46 

AL035703 Hs.4236 KIAA0478 gene product 

T02966 Hs.167428 ESTs 

T16971 Hs.289014 ESTs, Weakly similar to A43932 mudn 2 precursor, intestinal [H.saptens] 

H 1 1 257 Hs.22968 Homo sapiens done IMAGE:451 939. mRNA sequence 

BE618768 Hs7232 acetyl-Coenzyme A carboxylase alpha 

AA376654 Hs. 183664 eukaryotic translation initiafion factor A gamma. 2 

AB032977 Hs.6298 KIAA1 151 protein 

AI375672 Hs.165028 ESTs 

AK001335 Hs.31 137 protein tyrosine phosphatase, receptor type, E 

AA828380 Hs.126733 ESTs 

BE622021 Hs.152571 ESTs. Highly similar to IGF-ll mRNA-bindIng protein 2 [H.sapiens] 

T77453 Hs.302234 ESTs 

A1769400 Hs.189729 ESTs 

AI022166 Hs.16188 ESTs 

T87693 Hs.204327 EST 

AA302862 Hs.90063 neurocaldn delte 

H66470 Hs.16004 ESTs 

T79763 Hs.14514 ESTs 

AI078554 Hs.15682 ESTs 

T91881 Hs.200597 KIAA0563 gene product 

R08665 Hs.17244 hypothetical protein aJ13605 

AB035335 Hs.144519 T-cell leukemia/lymphoma 6 

AL360143 Hs.17936 DKF2P434H1 32 protein 

T97307 gb7e53h05^1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done lMAGE:1214g7 3*. 

T99513 Hs.187447 ESTs 

AI631964 Hs.34447 ESTs 

R72137 Hs.7949 DKFZP58662420 protein 

AI369275 Hs.243010 Homo sapiens cDNA FU14445 fis. done HEMBB1001294. highly similar to GTP^INDING 

AA340111 Hs.100009 acyl-Coenzyme A oxidase 1. palmitoyi 

AF125044 Hs.19196 ubiquitin-conjugating enzyme HBUCE1 

R81733 Hs.33106 ESTs 

BE255499 Hs.3496 hypothetical protein MGC1 5749 

AA256444 Hs.126485 hypothetical proteh FU12G04: KIAA1692 protein 

W79283 Hs.35962 ESTs 

W92798 HS.2691B1 ESTs 

AW602S26 gb:RC5-BT0562-2601004)11-A02 BT0562 Homo sapiens cONA, mRNA sequence 

AI416988 Hs238272 inositol 1 ,4.5-triphosphate receptor, type 2 

6E548222 Hs.299883 hypothetkail protefai RJ23399 

AF052212 Hs.153934 oore^ndlng fector. mnt domain, alpha subunit 2; translocated to. 2 

AW069534 Hs^79583 CGk81 protein 

AA251380 Hs.10726 ESTs. Weakly sindlar to ALU1J1UMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
AI479813 Hs^7841 1 NCK-assodated protein 1 
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114277 RC.240377„S 
(Celegans] 
114304 RC.240820 
114364 RC^241680 
132900 RC_AA005112 
129034 RCJ^005432 
131881 RC_AA010163 
452461 RC_AA026356 
114465 RC_AA026901 
131376 RC_AA036867 
101567 RC_AA044644 
431555 RG_AA046426 
132944 RC_AA054515 
114618 RC.AA084162 
130274 RC_AA085749 
110330 RC^AA098874 
114648 RC„AA101056 
IMAGE:548429 3' 
114658 RC_AA102746 
132456 RC_AA114250_s 
131319 RC_AA126561_s 
132225 RC_AA128980J 
IMAGE:567164 3' 
132669 RCJ\A129757 
114709 RC_AA129921 
131973 RCJ\A133331 
114750 RCJ\A135958 
115714 RC.AA136524_s 
114763 RC_AA147044 
114767 RC M148885 
114774 RC_AA150043 
129388 RC_M151621 
129183 RC_AA155743 
128869 RC_AA156335 
130207 RC^M156336 
114798 RC_AA159181 
114600 RC_AA159825 
[Celegans] 

114828 RC_AA234185 
114846 RC_AA234929 
114848 RC^AA234935 
114902 RC_AA236359 
132271 RC_AA236466 
114907 RC_AA236535 
135159 RC.AA236935^S 
132204 RCJ\A236942 
114928 RCJ\A237018 
132481 RC.AA237025 
114932 RC_AA242751 
314162 RCJ^42760 
131006 RC_AA242763 
114935 RC.AA242809 
WARNING 

132454 RC_AA243133 
437754 RC_AA243495 
114957 RC_M243706 
114974 RC_AA250848 
114977 RC_AA250868 
114995 RC_AA251152 
115005 RC^AA251544^s 
417177 RCJ^251792 
131889 RC^AA252063 
115026 RC^AA252144 
115045 RC_AA252524 
115068 RC^AA253461 
133138 RC_AA255522 
RECEPTOR. 
115114 RC.AA256488 
129584 RC_AA256528 
115137 RC_AA257976 
134312 RCJ\A258296 

115166 RC_AA258409 

115167 RCJ^5B421 
129807 RC.AA262077 
115239 RCJ\A278650 
115243 RCJ\A278766 



A1052229 Hs^73 ESTs» Weakly similar to T20410 hypothetical protein €02A10^ -Caenorhabditis elegans 

Af934204 Hs.16129 ESTs 

AL1 17427 Hs.172778 Homo sapiens mRNA; cONA DKFZp566P013 (from done DKI^66P013) 

AA777749 Hs.5978 LIM(k)mainonly7 

AA481157 Hs.108110 DKFZP547E2110 piotein 

AW361018 Hs.3383 upstream regulatory element binding protein 1 

N78223 Hs.108106 transcription factor 

BE621056 Hs.131731 hypothetical protein FU11099 

AK001644 HS56156 hypothetical protein RJ10782 

M33552 Hs.56729 lysosomal 

A1815470 Hs.260024 Cdo42 effector protein 3 

T96641 Hs.6127 Homo sapiens cDNA: FU23020 fis, clone i^G00943 

AW979261 Hs.291993 ESTs 

AA128376 Hs.153884 ATP binding protein associated with cell differentfation 

AI288666 Hs.16621 DKI^434I1 16 protein 

AA101056 gb'^25b03.s1 Stratagene neuroepithelium NT2RAM! 937234 Homo sapiens cONA done 

AA102383 iHs.249190 turnor necrosis factor receptor superfamily, member 10a 

AB011084 Hs.48924 KIAA0512 gene product; ALEX2 

NM_003155 Hs.25590 stanniocaldn 1 

AA128980 gbrzoOSal 1^1 Stratagene neuroepitheDum NT2RAMI 937234 Homo sapiens cDNA done 

W38586 Hs.293981 guanine nudeotide binding protein (G protein), gamma 3, linked 

AA397651 Hs.301959 proline synthetase co-transcribed (baderial homdog) 

AB018284 HS.1S8688 K1AA0741 gene ptodud 

AA887211 Hs.129467 ESTs 

T19226 Hs.172572 hypothetical protein FU20093 

AA810755 Hs.88977 hypothetical protein dJ511E16.2 

AI859B65 Hs.154443 ntinichromosome maintenance defident (S. cerevisiae) 4 

AV656017 Hs.184325 CGI-76 protein 

AA662477 Hs.1 10964 hypothetical protein FU23471 

6E561824 Hs273369 uncharacterized hematopolette stem/progenitor cells protein MDS027 

AA768242 Hs.80618 hypothetical protein 

AF044209 Hs. 144904 nudear receptor co-repressor 1 

AA159181 Hs.54900 serotogtcally defined coton cancer antigen 1 

219448 Hs.131887 ESTs. Weakly similar to T24396 hypofliefical protein T03F6^ - Caenorhabditis el^ans 

AA252937 Hs^83522 Homo sapiens mRNA; cDNA DKFZp434J1912 (from done DKFZp434J1912) 

BE018682 Hs.166196 ATPase, Class I, (yfw 8B, member 1 

BE614347 Hs.169615 hypothetical protein FU20989 

AW275480 Hs.39504 hypothetical protein MGC4308 

AB030034 Hs.1 15175 sterile-alpha motif and leudne zipper oontelning kinase AZK 

N29390 Hs.13804 hypothetical protein dJ4620232 

U43374 Hs.95631 Human nonnal keratinocyle mRNA 

AA235827 Hs42265 ESTs 

AA237018 Hs.94869 ESTs 

W93378 Hs.49814 ESTs 

AA971436 Hs.16218 K1AA0903 protein 

BE041820 Hs.38516 Homo saptens. done MGC:15887. mRNA. complete cds 

AF064104 Hs.221 16 CDC14 (cefl division cyde 14, S. cerevisiae) homoteg B 

H23329 H5.290880 ESTs. Weaklysimilarto AIU1_HUMAN ALU SUBFAfi/ULY J SEQUBICECONTAA/IINATION 

BE2S6227 Hs.250822 serine/threonine kinase 15 

R60366 Hs.5822 Homo sapiens cDNA: FU221 20 fis. done HEP18874 

AW170425 Hs.87680 ESTs 

AW966931 Hs.179662 nudeosome assembly protein 1 -like 1 

AW296978 Hs.87787 ESTs 

AA769266 Hs.193657 ESTs 

AI760825 Hs.1 11339 ESTs 

NM_004458 Hs.81452 fatty-add-Coenzyme A figase, tong-chain 4 

NM_002589 Hs.34073 BHi>rotocadherin (braln-hearQ 

AA251972 Hs.188718 ESTs 

AW014549 Hs.58373 ESTs 

AW512260 Hs.87767 ESTs 

AV657594 Hs.181161 Homo sapiens cDNA,FU14643 fis. done NT2RP2001597. weakly similar to RYANODINE 

AA527548 Hs7527 small fragment nudease 

AV656017 H5.184325 CG1>76 protein 

AW968304 HS.561S8 ESTs 

AB011151 H5334659 hypothetical proteki MGC14139 

AF095727 Hs.287832 myefin protein zero-fflce 1 

AA749209 Hs.43728 hypothetical protein 

Y1 1 192 Hs.5299 aktehyde dehydrogenase 5 fan%, member A1 (sucdnate-semiaklehyde dehydrogenase) 

BE251328 Hs73291 hypothetteal protein FU10881 

AA806800 Hs.116665 KIAA1842 protem 



91 

.02079492A2_I_> 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



AA286914 
AI831594 

AF069291 
AK001468 



AA1 48984 

D14540 

AA044840 

AF255910 

BE395161 

AI658580 

AA486620 

AW292809 . 

AB033035 

AA192669 

AI636361 

AI745379 

BE513442 

AA447709 

AA252468 

BE384932 

AA043429 



100850 RC_AA279667_s AA836472 

126884 RC_AA280791 U49436 

115322 RC AA280819 L08895 

133626 RC AA280828 AW836130 

115372 RC.AA282195 AW014385 

132825 RCJ^83127_s U82671 

130269 RCJVA284694 F05422 
129192 RCJ\A291137 
452598 RCJVA291708 
WARNING 

132131 RC_AA293495 
115536 RC.AA347193 
132411 RC.AA398474_s AA059412 

115575 RC_AA398512 AA393254 
115601 RC_AA400277 
WARNING 

103928 RC.AA400896 
125819 RC_AA404494 
115683 RC_AA410345 
115715 RC AA416733 
.132952 RC„AA425154 
115819 RC_AA426573 
132525 RC.AA431418 
115895 RC„AA435182 
132333 RC.AA437099 
115962 RC_AA446585 
115967 RC^AA446887 
115974 RC^AA447224 
115985 RC_AA447709 
129254 RC^AA453624 
133071 RC_AA455044 
116095 RC.AA456045 
122691 RC.AA460454_5 R19768 

116210 RCJ\A476494 BE622792 

116213 RCJ\A476738 AA292105 

134585 RC^AA481422 D14041 

134790 RCJU482269 BE002798 

116265 RC_AA482595 BE297412 
129334 RC AA485084^s AW1 57022 
116274 RC„AA485431.s AI129767 

303150 RC^AA489057 AA887146 

129945 RC.AA489638 BE514376 

116331 RC_AA491000 N41300 

116333 RC_AA491250 AF155827 

132994 RC^AA505133 AA112748 

134577 RC.AA598447 BE244323 

116391 RC_AA599243 T85558 
116394 RC„AA599574J NM^006033 

134531 RC_AA600153 AI742845 

116417 RCJ^609309 AW499664 

116429 RC_AA609710 AF191018 

116439 RC_.AA610068 AA251594 

116459 RC^AA621399 R80137 

427505 RC_AA621752 AA361562 

132699 RC_C21523 AW449822 

116541 RC.D12160 D12160 

132557 RC.D19708 AA114926 

112259 RC.D25801 AA337548 

116571 RC.D45652 D45652 
sequence. 

129815 RC_D60208J BE565817 

421919 RC.D80504,s AJ224901 

116643 RC.F03010 AI367044 

116661 RG_F04247 R61504 

repetitive 

116715 RC.F10966 AL1 17440 

116729 RC.F13700 BE549407 

318709 RC_H05063 R52576 

134760 RC^H16758 NIVL000121 

116773 RC.H17315.S AI823410 

106425 RC^H22556 H24201 

116780 RC^H22566 H22566 

131978 RC.H48459_s AA355925 

116819 RC_H53073 H53073 

111428 RC.H56559.S AL031428 

133175 RC.H57957.S AW955632 



Hs.297939 cathepan 8 

Hs.286236 KIAA1 856 protein 

Hs.78995 MADS box transcription enhancer fector 2, polypeptide C (myocyte enhancer factor 2C) 

Hs.75277 hypothetical protein FU13910 

Hs.88678 ESTs, Weakly similar to Unl(nown[Ksapiens] 

Hs.57698 EmplricaDy selected from AFFXsbigteprobeset 

Hs.168352 nudeoporin-iike protein 1 

Hs.183299 ESTs 

Hs.68647 ESTs, Weakly simOar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

Hs.40539 chromosome 8 open reading frame 1 

Hs.62180 anlUin (Drosophila Scraps homolog), actin binding protein 

Hs.47986 hypothetical protein MGC10940 

Hs.43619 ESTs 

Hs.48849 ESTs. Weakly simflar to ALU4.HUMAN ALU SUBFAMILY SB2 SEQUENCE CONTAMINATION 

Hs.199160 myetotd/tymphokJ or mixed-lineage leukemia (trithorax (Drosophila) homok)g) 

Hs.251871 CTP synthase 

Hs.54650 Junctional adhesion molecule 2 

Hs.1390 proteasome (prosome. macropain) subunit bete type, 2 

Hs.61426 Homo sapiens mesenchymal stem ceil protein DSC96 mRNA. partial cds 

Hs.41135 endomudn-2 

Hs.50727 N-acetytglucosaminldase. alpha- (Sanfilippo disease IIIB) 

Hs.51965 KIAA1209 protein 

Hs.45032 ESTs 

Hs.179520 hypothetical protein MGC10702 

Hs.42911 ESTs 

Hs.238944 hypothetical protein FU10631 

Hs.268115 ESTs. Weakly similar to T08599 probabte transcriptnn factor CA150 (H.saplens] 

Hs.1098 DKFZp434J1813 protein 

Hs.64313 ESTs. Weakly similar to AF257182 1 G-protein-coupled receptor 48 [M.sapiens] 

Hs.62618 ESTs 

Hs.172788 ALEX3 protein 

Hs.172788 ALEX3 protein 

Hs.326740 hypothetical protein I^C10947 

Hs^8573 H-2K bindir^ factor-2 

Hs.287850 integral membrane protein 1 

Hs.55189 hypothetical protein 

Hs.4947 hypotheticaJ protein FU22584 

Hs.182674 guanine nucleotide binding protein (G protein) alpha 12 

Hs.8217 stromal antigen 2 

Hs.165998 PAI-1 mRNA-binding protein 

Hs71616 Homo sapiens mRNA; cDNA DKFZp586N1720 (from ctone DKFZp686N1720) 

Hs.203963 iiypothetical protein FU10339 

HS279905 done HQ0310 PRO0310p1 

Hs.85951 exportin. tRNA (nuclear export receptor for tRNAs) 

Hs75113 general transcription factor IIIA 

Hs.65370 lipase, endothelial 

Hs.l 10713 DEK oncogene (DNA binding) 

Hs.12484 Human done 23826 mRNA sequence 

Hs.279923 putetive nucleotide binding protein. estradtoMnduoed 

Hs.43913 PIBFI gene product 

Hs.302738 Homo sapiens cDNA: FLJ21425 fis. done COL04162 

Hs.178761 26S proteasome-assodated padi homolog 

Hs.55200 ESTs 

Hs.249212 polymerase (RNA) 111 (DNA directed) (155kD) 

Hs.5122 ESTs 

Hs333402 hypothettoal protein MGC127e0 

gb:HUMGS02848 Human adult lung 3* directed Mboi cDNA Homo sapiens cONA 3*. mRNA 

Hs^6498 hypothetk:al protein FLJ21657 

Hs.109526 zinc finger protein 198 

Hs.153638 myetoid/lymphotd or mixed-lineage leukemia 2 

gbyh16a03^1 Scares infant brain 1NIB Homo saptens cDlSIA done 3' similar to oonteins Abi 

Hs.170263 tumor protein p53-binding protein, 1 

Hs.l 15823 ribonudease P, 40kD subunit 

Hs.285280 Homo saptens cONA: FU22096 fis. done HEP169S3 

Hs.89548 erythropoietin receptor 

Hs.169149 karyopherin alpha 1 (importin alpha 5) 

Hs.247423 addudn2(bete) 

Hs.30098 ESTs 

Hs.36232 K1AA0186 gene product 

Hs.9369d EST 

Hs.174174 KIAA0601 protein 

Hs£6666 ESTs, We^ similar to S19560 proline-rich protein MP4 • mouse [tilmusculus] 
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116844 RC_H64938 s 

116845 RC_H64973 
116892 RC.H69535 
116925 RC_H73110 
116981 RC_H81783 
131768 RC.H86259 
117031 RC_H88353 
contains LI 
117034 RC^H88639 
132542 RC_H88675 
134403 RC H93708 s 
117280 RC_N22107 
117344 RC_N24046 
117422 RC_N27028 
117475 RC_N30205 
117487 RC^N30621 
130207 RC^N33258 
117549 RC^N33390 
117683 RC_N40180 
IMAGE:276387 3' similar to 
117710 RC_N45198 
104514 RC»N45979.s 
117791 RC_N48325 
117822 RC.N48913 
129647 RC_N49394 
117895 RC_N50656 
(H.sapiens] 
131557 RC^N50721 

RC_N53143 
RC.N55326 
RC.tsl55493 



133057 
118103 
118111 
mRNA 
118129 RC_N57493 
IMAGE:277358 3'. mRNA 
118278 RC_N62955 
118329 RC_N63520 
3', mRNA 

118336 RC N63604 
132457 RC^N64166 

118363 RC_N64168 

118364 RC_N64191 
118475 RC_N66845 
sMarto 

118491 RC_N67135 
118500 RC_N67295 
101663 RCJ468399 
118584 RC_N68g63 
sequence 

421983 RC.N69331 
118661 RC_N70777 
118684 RC_N71364_s 

118689 RC.N71545 s 

118690 RC_N71571 
118766 RC_N74456 
118793 RC_N75594 
118817 RC_N79035 
118844 RC N80279 
118919 RC^N91797 
129558 RC„N92454 
132692 RC_N94581 
118996 RC_N94746 
119021 RC,N98238 
119039 RC^R02384 
119063 RC.R16833 
WARNING 

118523 RC_R41828 s 
119111 RC_R43203 
133970 RC.R46395 
119146 RC_R5e863 
120296 RC_R78248 
119239 RC.T11483 
sequence. 

119281 RC_T15896 
119298 RC_T23820 
126502 RC_T30222 
135073 RC.W15275^s 



H64938 Hs.337434 ESTs. Weakly similar to A46010 X-^inked retinopathy protein IH.sapiens] 

AA649530 gb:ns44ID5^1 NGIjCGAP^AIvl Homo sapiens cDNA done. mRNA sequence 

AI573283 Hs.38458 ESTs 

H73110 Hs^60603 ESTs, IVtodeiately similar to A47582B-cellgiDwth factor precursor fH.s^^ 

N29218 Hs.40290 ESTs 

AC005757 Hs.31809 hypotheUcal protein 

H88353 gb:yw21a02.s1 Morton Fetal Cochlea Homo sapiens cDNA done IMAGE'.252842 3* similar to 

U72209 Hs.180324 YY1 -associated factor 2 

AL137751 Hs.263671 Homo sapiens mRNA; cDNA DKFZp434l0812 (from done DKF2p434l0812): partial cds 

AA334551 Hs.a2767 spem) specific antigen 2 

M18217 Hs,172129 Homo sapiens cDNA: FLJ21409 fis, done COL03924 

R19085 Hs.210706 Homo sapiens cDNA FLJ13182 fis, done NT2RP3004070 

AI355562 Hs.43880 ESTs, Weakly similar to A46010 X-llnked retinopathy protein IH,sapiensl 

N30205 Hs.93740 ESTs. Weakly similar to 138022 hypothetwal protein [H^sapiens] 

N30621 Hs,44203 ESTs 

AF044209 Hs.144904 nuclear receptor co-repressor 1 

N33390 Hs.44483 EST 

N40180 gb:yy44d02.s1 Soares^mu!tiple_sderosis.2NbHMSP Homo sapiens cDNAdone 

N45198 Hs.47248 ESTs. Highly similar to similar to Cdc14B1 phosphatase tH.sapiens3 

AF164622 Hs.182982 golgin-67 

N48325 Hs.93956 EST 

AA706282 Hs.93963 ESTs 

AB018259 Hs.1 18140 KIAA0716 gene produd 

AW450348 Hs.93996 ESTs. Highly similar to SORU-HUMAN SORTIUN-RELATED RECEPTOR PRECURSOR 

AA317439 Hs.28707 signal sequence receptor, gamma (transtooon-associated protein gamma) 

AA465131 Hs.64001 Homo sapiens done 25218 mRNA sequence 

AA401733 Hs.184134 ESTs 

N55493 ' gb:yv50c02.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE246146 3', 

N57493 gb:yy54c08^1 Soares^multlple.sderosls„2NbHMSP Homo sapiens cDNA done 

N62955 Hs.316433 Homo sapiens cDNA FLJ1 1375 fis, done HEMBA100041 1 , weakly similar to ANKYRIN 

N63520 gb:yy62«)1.s1 Soares_muItiple_sderosis^2NbHMSP Homo sapiens cONA done IMAGE:278137 

BE327311 Hs.47166 HT021 

AB017365 Hs.173859 frizzled (Drosophila) homolog 7 

AI183838 Hs.48938 hypothetical protein FU21 802 

N461 14 Hs.29169 hypothetical protein FU22623 

N66845 gb:za46c1 1 .si Soares fetal liver spleen 1 NFLS Homo sapiens cDNA done iMAGE:295604 3' 

AV647908 Hs.90424 Homo sapiens cDNA: FU23285 fis, clone HEP09071 

W32889 Hs.154329 ESTs 

NM.003528 Hs^178 H2B histone tomity. member Q 

AW136928 gb:UI-H^)1-adpKf-08^l.s1 NCLCGAP.Sub3 Homo sapiens cDNA done 3', mRNA 

AI252640 Hs.110364 peptidylpfDtyljsomeraseC(cydophllinC) 

AL137554 Hs.49927 protein Wnase NYD-SP15 

N71313 Hs.163986 Homo sapiens cDNA: FU22765 fis, done KAIA1180 

AW390601 Hs.184544 Homo sapiens, done IMAGE:3355383. mRNA. partial cds 

N71571 Hs.269142 ESTs 

N74456 Hs.50499 EST 

N75594 Hs.285921 ESTs. IVIoderately similar to T47135 hypothetical protein DKFZp761L0812.1 [H^apiens] 

AI668658 Hs.50797 ESTs 

AL035364 Hs.50891 hypothetical protein 

AW452696 Hs.130760 myosin phosphatase, target subunit2 

AW580922 Hs.180446 karyopherin (importin) beta 1 

AW191962 Hs.249239 collagen, type Vlll, alpha 2 

N94746 Hs.274248 hypothetical protein FU20758 

N98238 Hs.55185 ESTs 

AI160570 Hs^52097 pregnancy specific bela-1 glycoprotein 6 

R16833 Hs.53106 ESTs. Moderately similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Y07759 Hs.170157 myosin VA (heavy polypeptide 1 2. myoxin) 

T02865 Hs.328321 EST 

AA214228 Hs.127751 hypothetical protein 

R58863 Hs.91815 ESTs 

AW995911 Hs^99883 hypolhefical protein FU23399 

T11483 gb:CHR90049 Chromosome 9 exon Homosapiens cDNA done 111-1 5' and 3', mRfMA 

AI692322 Hs.65373 ESTs. Weakly similar to T02345 hypothetfcal protein KIAA0324 [H.sapiensl 

NM^001241 Hs.1 55478 cydinT2 

T10077 Hs.13453 hypothetfcal protein fU14753 

W55956 Hs.94030 Homosapiens mRNA:<^NAOI^Zp586E1624<fromdone DKFZp586E1624) 
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119558 RC W38194 W38194 
132736 RC_W42414.s AW081863 
sapiens mad protein 
132173 RC_W46577_s 
134873 RC.W49632.S 
119650 RC.W57613 
119654 RC_W57759 
similar to 

119883 RC.W61118 
119894 RC_W65344 
119718 RC^W69216 
133010 RC_W69379 
119938 RC^WB672B 
120128 RC„Z38499 
120130 RC„Z38630 
120148 RC,Z39494 
120155 RC_Z39623 
131486 RC_Z40071.s 

120183 RC.Z40174 

120184 RC_Z40182 
120211 RC 240904 
120245 RC.AA166965 
120247 RC.AA167500 
120254 RC_AA169599_s W90403 

120259 RC.AA171724 AW014786 

120260 RC^AA171739 
120275 RC^AA177105 
120284 RC_AA182626 
to contains 

114056 RC_M186324 
129507 RC_AA192099 

120302 RC_AA192173 

120303 RC.AA192415 
120305 RC_AA192553 
120319 RCJUV194851 
133389 RC AA195520_s AA1 95764 
120326 RC^AA196300 AA196300 

RC^AA196517 
RC_AA196549 
RC AA196721 
RC_AA196729J 
RC.AA196979 
RC.AA206828 



X89426 
AA884471 
R82342 
W57759 

W65379 

AA041350 

W69216 

AI287518 

AW014862 

BE379320 

AA045767 

F02d06 

Z39623 

F06972 

AW082866 

Z40182 

Z40904 

AW959615 

AA167500 



AK000061 
AA177105 
AA179656 

AA188175 

AJ236885 

AA837098 

AI216292 

AW295096 

T57776 



X76040 

H94227 

AK000292 

N66397 

AA923278 

AA206828 



134272 
133145 
120327 
106686 
120328 
120340 
similar to 

134292 RC AA207123 AI906291 

RC_AA214539J AI380040 
RC_AA226914_s AA227068 

RC_AA227260 AF028706 

RC_AA227469 AA227469 
IMAGE:663732 3', mRNA sequence. 

120390 RC_AA233122 AA837093 
303876 RC JVA233334_s U64820 
dominant, ataxin 3) 

132038 RCJ\A233347 AI825842 

RCJU233519 T85825 

RC_AA233714 AA018515 

RCJ^A233796 AA134006 
RC_AA235050J AA235050 
gb±07077 

120414 RC_AA235704 AW137156 

RC.AA236031 AI128114 

RC_AA236352 AL133097 
RC^AA236390_s W94915 

RC AA236453 AA236453 

RC AA243370 AA243370 

RC.AA250947 AA250947 

RC_AA2510a3 AA251720 

RC AA251113 AA488750 

RC_AA251973 AA251973 

RC^AA252023 AI244901 

RC_AA252414 AA252414 

RCJ\A252650 AF006689 

RC_AA255523 AW952916 

RC AA258128 AI796395 

RCJ^262105 AA262105 

RCJUV262107 AI923511 



131522 
129051 
120375 
120376 



104463 
125750 
120396 
120409 



120420 
120422 
132221 
120423 
120435 
120453 
120455 
120456 
120473 
128922 
120477 
120479 
120488 
120510 
120527 
1^528 



Enq)irically selected from AFFX single probeset 

Hs288261 Homo sapiens cDNA: FU23037 fis. done LNG02036. highly simSar to HSU88019 Homo 

Hs.41718 endothelial cell-spediic molecule 1 

Hs.90449 Human done 23908 mRNA sequence 

Hs.79856 ESTs, Weakly similar to S65657 alpha-IC-adrenergic receptor spGoe (dm) 2 (H.sapiens] 

gb2d20g11.s1 Soares.fetaLbeaiLNbHH19W Homo sapiens cDNA done IMAGE:341252 3* 

Hs.57835 ESTs 

H5.57847 ESTs, Moderately similar to ICE4.HUMAN CASPASE-4 PRECURSOR [H^piens] 

Hs.92848 ESTs 

Hs.62669 Homo sapiens mRNA: CDNA DKFZp586D0923 (from done OKFZp586D0923) 

Hs.58885 ESTs 

Hs.91448 MKP-1 like protein tyrosine phosphatase 

Ks.5300 bladder cancer assodated protein 

Hs.65765 ESTs 

Hs.65783 ESTs 

Hs.27372 BMX non^oeptor tyrosine kinase 

Hs.65882 ESTs 

Hs.65885 EST 

Hs.66012 EST 

Hs.l 11045 ESTs 

Hs.103939 EST 

Hs.111054 ESTs 

Hs.192742 hypothetical protein FU12785 

Hs.101590 hypothetical protein 

Hs78457 solute carrier family 25 (mitochondrial carrier ornithine transporter) member 15 

gb2|^e1 1 ^1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA done 3" similar 

Hs.82506 KIAA1254 protein 

Hs.112180 zinc finger protein 146 (pHZ-52) 

Hs.269933 ESTs 

Hs.96184 ESTs 

Hs.101 337 uncoupling protein 3 (mitodiondrial. proton carrier) 

Hs.191094 ESTs 

Hs72639 ESTs 

Hs.21 145 hypothetical protein RG083M05.2 

Hs.278614 protease, serine. 15 

Hs.6592 Homo sapiens, done IMAGE:2961368, mRNA. partial cds 

Hs.278732 hypothetical protein FU202d5 
Hs.334825 Homo sapiens cDNA FU14752 fis. done NT2RP3003071 
Hs.290905 ESTs, Weakly similar to protease [H^apiens] 

gb:zq80b08^1 Stratagene hNT neuron (937233) Homo sapiens cDNA done 1MAGE£47895 3' 

Hs.81234 immunoglobulin superfamily, member 3 

Hs.239489 TfA1 cytotoxic granule-associated RNA-binding protein 

Hs. 1 08301 nudear receptor subfamily 2, group C. member 1 

Hs.1 1 1 227 Zic family member 3 (odd-paired Drosophila homoiog, heterotaxy 1) 

gb:zr18a07.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA done 

HS.1114G0 cakaum/calmodulirvdependent protein kinase (CaM kinase) II delta 

Hs.66521 Machado-Joseph disease (spinocerebellar ataxia 3. ofivopontocerebeDar ataxia 3. autosomal 

Hs.3776 zinc finger protein 216 
Hs^46885 hypothefical proteh FU20783 

Hs.264482 Homo sapiens mRNA; cDNA DKFZp761 A041 1 (from done DKFZp761A041 1) 
Hs.79306 eukaryotte translatk)n Initiation lador 4E 

gb2s38e04.s1 Soares.NhHMPu.SI Homo sapiens cONA done IMAGE:687486 3* similarto 

Hs.181202 hypothetical protein FU10038 
Hs.1 12885 spinal cord-derived growth fedor-6 
Hs.301717 hypothetteal protein DKFZp434N1928 
Hs.42419 ESTs 

Hs.18978 Homo sapiens cDNA: FU22822 fis. done KAIA3968 
Hs.96450 EST 

Hs.170263 tumor protein p53-binding protein, 1 

Hs.104347 ESTs, Weakly similar to ALUC^HUMAN III! ALU CLASS C WARNING ENTRY 111 (H^piens] 

Hs.88414 BTB and CNC homology 1. basic leucine zipper transcription factor 2 

Hs.269988 ESTs 

Hs.9589 ubiquiTin 1 

Hs.43141 DKFZP727C091 protein 

Hs.1 10299 mitogen^ctivated protein kinase kinase 7 

Hs.63510 KIAA0141 gene produd 

Hs.1 11377 ESTs 

Hs.4094 Homo sapiens cONA FU14208 fis. done NT2RP3003264 
Hs.104413 ESTs 
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RC_AA2^5 

RC.AA278298 

RCJVA278529J 

RCJ\A278721 

RC_AA28003€ 

RCJ\A280646 

RCJ^A280738 

RCJ\A280794 

RC_AA280837 

RCJVA280886 



120529 
120541 
131445 
120544 
120562 
120569 
120571 
120572 
129434 
130529 
repetitive 
120575 RC_AA280934 
RC_AA2B1535 
RC_AA281797^S 
RC„AA282047 
RC_AA283002 
RC_AA283709 
RC_M283902 
RC^AA284108 
RC AA284109 
RC.AA284371 
RC_AA284744J 



132635 
120591 
120593 
430275 
117729 
120609 
132754 
130315 
132614 
447503 
cds 

135376 RC^AA284784 
120621 RC^AA284840 
RCJ\A286B44 
RCJ\A287032 
RCJVA287038 
RC.AA287546 
RC.AA287553_s 
RC_AA287556 
RC„AA287564 
RC_AA291015_s 
RC^AA291716 
RC_AA291749^s 
RC^AA293656 
RC.AA302430 
RC_AA302809 
RC_AA302820_S 
RC^AA310499 
RC_AA321890 
RC_AA340589 
RC_AA340622 
RC^AA342457J 
CONTAMINATION 
133439 RC_AA342828_s 
120793 RC^AA342864 
120798 RCJ\A342973 
120809 RCJ\A3464g5 
repeat mRNA sequence. 
132459 RC.AA347573 
120825 RC_AA347614 
120827 RC_AA347717 
120839 RC_AA348913 
repeat. mRNA sequence. 
120850 RC_AA349647 
RC_AA349773 
RC_AA350541^s 
RC.AA357159J 
RC_AA357172.i 
WARNING 

134637 RC_AA369856.s 
RC^AA370132 
RC_AA370472_s 
RC_AA370867 
RC_AA377296 
RC_AA383902 
WARNING 

120936 RC.AA385934 

120937 RCJ^386255 

120938 RC_AA386260 
129722 RC_AA386266 
120960 RCJ\A398014 
120985 RCJ\A398222 
120988 RCJ^A398235 



107868 
129868 
120644 
120660 
135370 
120661 
129116 
131567 
120699 
100690 
120726 
120737 
120745 
135192 
120750 
120761 
120768 
120769 
135232 



120852 
128852 
135240 
120870 



120894 
131854 
120897 
120915 
120935 



Ar434823 Hs.104415 ESTs 

W07318 Hs.240 M-phase phosphoprotein 1 

NIVL014264 Hs.172052 serine/threonine kniase 18 

BE548277 Hs.103104 ESTs 

BE244580 Hs.302267 hypothetical protein aJ10330 

AA807544 Hs.24970 ESTs. Weak^ stmllarto B34323 GTP-binding protein Rab2 {H^sapiens] 

AB037744 Hs^92 KtAA1323 protein 

H39599 Hs.294008 ESTs 

AWg67495 Hs.186644 ESTs 

AA178953 gbzp39e03.s1 Stratagene muscle 937209 Homo sapiens cDNA done 3* similar to contains Aki 

AW978022 Hs.238911 hypothetical protein DKFZp762E151 1; KIAA1816 protein 

AB020686 Hs.54037 ectonucleotide pyrophosphatase/phosphodiesterase 4 (putative function) 

AF078847 Hs.1 91356 general transcription fiactor IIH. polypeptide 2 (44kD subunit) 

AA748355 Hs. 193522 ESTs 

Z1 1773 Hs.237786 zinc finger protein 187 

AA306166 Hs.7145 calpain7 

AW978721 IHs.266076 ESTs. Weakly simQar to A46010 X-iinked retinopathy protein [H^apiens] 

AI752244 Hs.75309 eukaiyotic translation elongation factor 2 

AI241084 Hs.1 54353 nonselective sodium potassium/proton exchanger 

AA284371 HsA 18064 similar to rat nudear ubiquitous casein kinase 2 

AA1 15496 Hs.336898 Homo sapiens. Similar to RIKEN cDNA 1810038N03 gene, done IMGC:9890. mRNA. complete 

BE617856 Hs.99756 mitochondrial ribosome recyding factor 

AW961294 Hs.143818 hypothetical protein FU23459 

AA266844 Hs.61260 hypothetkal protein FU1 3164 

AW172431 Hs.13012 ESTs 

Ai869129 Hs.96616 ESTs 

AA286785 Hs.99677 ESTs 

6E622187 Hs.99670 ESTs. Weakly similar to 138022 hypofiietical protein [H.sapiens] 

AA2875S6 Hs.263412 ESTs. Weakly similar to ALUB.HUMAN 1111 ALU CLASS B WARNING ENTRY !l! [H.sapiens] 

AB019494 Hs.225767 1DN3 protein 

AF015592 Hs.28853 CDC7 (cell division cyde 7. S. cerevisiae. homokigHike 1 

Ai683243 Hs.97258 ESTs. I^erateiy similar to S29539 ribosomal protein L13a. cytosolto {H^apiens] 

AA383256 Hs.1 657 estrogen receptor 1 

AA293655 Hs.97293 ESTs 

AL049176 Hs.82223 chordin-like 

AA302809 gb:EST10426 Adipose tissue, white I Homo sapiens cDNA 3' end. mRNA sequence. 

U83993 Hs.321709 purinergic receptor P2X. ligand-gated ion channel. 4 

Ai191410 Hs.96693 ESTs. Moderately similar to 2109260A B cell growth factor [H.sapiens] 

AA321890 HS.126S branched chain keto add dehydrogenase El, bete po^pepfide (mapte synip urtoe disease) 

AA340589 Hs.1 04560 EST 

AI769467 Hs.96769 ESTs 

AL038812 Hs.96800 ESTs. Moderately similar to ALU7_HU[^N ALU SUBFAMILY SO SEQUENCE 

Z23091 Hs.73734 glycoprotein V (platelet) 

AA342864 Hs.96812 ESTs 

AI247356 Hs.96820 ESTs 

AA346495 gb:EST52657 Fetal heart II Homo sapiens cDNA 3' end simPar to€ST containing 0 temily 

AL120071 Hs.48998 fibronedin leudne rich transmembrane protein 2 

AI280215 Hs.98885 ESTs 

AA382525 Hs.132967 Human EST done 122887 mariner transposon Hsmarl sequence 

AA348913 gb£ST55442 Intent adrenal gtend II Homo sapiens cDNA 3* end simiterto EST containing Atu 

AA349647 Hs.96927 Homo sapiens cDNA FU12573 fis. done NT2RM4000979 

AA349773 Hs.191564 ESTs 

R40622 Hs.106601 ESTs 

AA357159 Hs.96986 EST 

AA357172 Hs.292581 ESTs. Moderately simBar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

U87309 Hs.180941 vacuolar protein sorting 41 (yeast homotog) 

AA370132 Hs.97063 ESTs 

AF229839 Hs.173202 l-kappa-B4nterading Ras-like protein 1 

AA370867 Hs.97079 ESTs, Moderately similar to AF174605 1 F-box protein Fbx25 {H^apiensJ 

AL135556 Hs.97104 ESTs 

AL048409 Hs.97177 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

AA385934 Hs.97184 EST, Highly similar to (defUne not availabte 7499603) [Celegans] 

AA386255 Hs.97186 EST 

AA386260 Hs.104632 EST 

R20855 Hs.5422 glycoprotein M6B 

AA398014 Hs.104684 EST 

AI219896 Hs.97592 ESTs 

AA398235 Hs.97631 ESTs 
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121008 RC_AA3S8348 
GSSsandaCpG 
121029 RC_AA398482 
121032 RCJ^398504 
5 121033 RCJ^8505 

121034 RCJVA398507 

121035 RC_AA398^ 
121058 RC.AA398625 
121060 RCJ\A398632 

10 121081 RCJKA398633 

121091 RCJ\A398894 

CXDNTAMINATION 

121092- RC_AA398895 

121094 RC_AA39B900 
15 121096 RCJVA398904 

121115 RCJ\A399122 

121121 RC AA399371 

121122 RC_AA399373 
121125 RCJ^399441 

20 121151 RC_AA399636 

121153 RC.AA399640 

121163 RC_AA399680 

121176 RC_AA400080 

121192 RC.AA400262 
25 121223 RCJVA400725 

121227 RC_AA400748 

121231 RC.AA400780 

121278 RCJ^A401631 

121279 RC_AA40168B 
30 121282 RC_AA401695 

121299 RC.AA402227 

121301 RCJ^402329 

121302 RC.AA402398 
121304 RCJU402449 

35 121305 RC_AA402468 

134721 RC_AA403268_s AK000112 

121323 RC_AA403314 AA291411 

121324 RC_AA404229 
129047 RC AA404260 

40 131074 RC AA404271 

121344 RC_AA405026 

121348 RC_AA405182 

121350 RC_AA405237 

contains AIu 
45 121400 RC_AA406061 

121402 RC AA406063 

121403 RC_AA406070 
121408 RC_AA406137 
121431 RC^AA406335 

50 132936 RC_AA411801 

121471 RC_AA411804 

121474 RC.AA411833 
. 121526 RC.AA412219 

121530 RC.AA412259 
55 121558 RC.AA412497 

contains L1.t3 LI 

121559 RCJ^412498 

121584 RCJVA416586 

121609 RCJ^416867 
60 121612 RCJ^416874 

121737 RC.AA421133 

121740 RC_AA421138 

129194 RC_AA422079 

121784 RC.AA423B37 
65 121802 RCJVA424328 

121803 RC„AA424339 

135286 RC^AA424469_S AW023482 

121806 RC_AA424502 AA424313 

129517 RC_AA425004 
70 121845 RC.AA425734 

CONTAMINATION 

121853 RC.AA425887 

121891 RC_AA426456 

121895 RC„AA427396 
75 similar to contains 

121699 RCJU427555 R55341 



AA396348 Hs.301720 Human DMA sequence torn done RP1 1-251 J6 on chromosome 13 Contains EST s, STSs, 



AA398482 
AA393037 
AA398505 
AL389951 
AA398523 
AA398625 
AA398632 
AA393268 
AA398894 

AA398895 

AA402505 

AA398904 

AA398187 

AA399371 

A1126713 

AL042981 

AA399636 

AA399640 

AI676062 

AL121523 

AA400262 

A1002110 

AA400748 

AA814g48 

AA037121 

AA292873 

AA401695 

AA4022Z7 

NM_006202 

AA402587 

AA293863 

AA402468 



AA404229 

A1768623 

U16125 

AA405026 

AA405182 

AA405237 

AA406061 

AA406063 

AA406070 

AA406137 

AA035279 

AL120659 

AA411804 

AA402335 

AW665325 

AA778658 

AA412497 

Ai192044 
AI024471 
AA416867 
AA416874 
AA421133 
AA421138 
. AA150797 
T90789 
AI251870 
AI338371 



AW972853 
AI732692 

AA425887 
AA426456 
AA427396 



Hs.97641 EST 

Hs.161798 ESTs 

Hs.97360 ESTs 

Hs^1623 nudeoporinSOkD 

Hs.210579 ESTs 

Hs.97391 ESTs 

Hs.97395 ESTs 

Hs.97396 ESTs 

Hs.97657 ESTs. Moderately similar to ALU8.HUAAAN ALU SUBFAMILY SX SEQUENCE 

Hs.97658 EST 

gb2t62h10.r1 Soams.te5tis.NHT Homo sapiens cDNA done 5^. mRNA sequence 

Hs,332690 ESTs 

Hs.104682 ESTs. Weakly similar to mitochondrbl dtrate transport protein [Helens] 

Hs.189095 similar to SALL1 (sal (DrosophilaKiKe 

Hs.192233 ESTs. Highly similar to T00337 hypothetical protein K1AA0568 [H.saplens] 

Hs.251278 KiAA1201 protein 

Hs.143629 ESTs 

Hs.97694 ESTs 

Hs.1 11902 ESTs 

Hs.97774 ESTs 

Hs.190093 ESTs 

Hs.97169 ESTs. Weakly similar to dJ667H12.Z1 [H.sapiensl 

HS.97B23 Homo sapiens mRNA; cDNA DKFZp434l3024 (from done DKFZp434D024) 

Hs.96343 ESTs, Weakly similar to ALUC.HUMAN 1111 ALU CLASS 0 WARNING ENTRY HI [H.sapiens] 

Hs.98518 Homo sapiens cDNA FU1 1490 fis. done HEMBA1001918 

Hs.177996 ESTs 

Hs.97334 ESTs 

Hs.22826 tropomodulin 3 (ubiquitous) 

Hs.89901 phosphodiesterase 4A. cAMP-spedfic (dunce (DrosophilaHtomolog phosphodiesterase E2) 

Hs.325520 UT1-3TM protein 

Hs.97316 EST 

Hs.291557 ESTs 

Hs.89306 hypothetical protein FU20105 

Hs.97247 ESTs 

Hs.97842 EST 

Hs.108264 ESTs 

Hs.181581 glutamate receptor, ionotroptc. kainate 1 

Hs.193754 ESTs 

Hs.97973 ESTs 

gb:zt06e10.s1 NCLCGAP.GCB1 Homo sapiens cDNA done IMAGE:712362 3' simflar to 

Hs.98001 EST 

Hs.98003 ESTs 

Hs.98004 EST 

Hs.98019 EST 

Hs.176731 ESTs 

Hs.61 1 1 aryl-hydrocart)on receptor nudear translocator 2 

Hs.251575 ESTs 

Hs.188760 ESTs. Highly similar to Trad [H.sapiens] 

Hs.98120 ESTs 

Hs 98122 ESTs 

gb:ztg5g12.s1 Soares.testis.NHT Homo sapiens cDNA done IMAGE:730150 3* sbnltar to 

Hs.104778 ESTs 

Hs.98232 ESTs 

Hs.98185 EST 

Hs.98168 ESTs 

Hs. 1 04671 erythrocyte transmembrane protein 

Hs.98334 EST 

Hs.1 09276 latexin protein 

Hs.94308 RAB35. member RAS oncogene family 

Hs.188898 ESTs 

Hs.157173 ESTs 

Hs.97849 ESTs 

Hs.98402 ESTs 

Hs.112237 ESTs 

HS.1650S6 ESTs, Moderately simflar to ALU2.HUMAN ALU SUBFAMILY SB SEQUENCE 

Hs.98502 hypothetical protein FU14303 
Hs.98469 ESTs 

gb:zw33a02^1 Soares ovary tumor NbHOT Homo sapiens cONA done IMAGE:771O50 3* 
Hs.50421 iaAAQ203 gene product 
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121917 RC_AA428218 

121918 RCJ^428242 

121919 RCJ\A428281 

121941 RCJ^28865 

121942 RC.M428994 
121970 RC.M429666 
121993 RC_AA430181 
134660 RC_AA430184 s 
126753 RC_AA431288.s 

RC_M431293 
RC_AA431478 
RC„AA431492 
RC^AA431732 
RC_AA432278 
RC_AA434411 
RC_AA435512J 
122162 • RC_AA435698 
129406 RC.AA435711 

RC^AA435815 s 
RC.AA435842 
RC_AA436475 
RC_M436489 
RC_AA442060 
RC_AA442079 
RC.AA443151 
122382 RC_AA446133 
122425 RC_AA447145 
122431 RCJ^447398 
122450 RC_AA447643 
302653 RCJ\A447742_s 
122477 RC„AA448226 
122500 RC^AA448825 
RC_AA449444 
RC.AA450087 
RC_AA450211 
RC.AA450244 
RC_AA452123 
RCJ\A452155 
RC_AA452156 



122022 
122050 
122051 
122055 
122105 
122125 
135235 



318801 
122186 
122235 
129131 
134664 
122310 
122334 



122522 
122536 
122538 
122540 
122560 
421919 
122562 
mRNA 
122585 
122608 
122635 



RC_AA453036 
RC_AA453526 
RC_AA454086 

similar to 

122636 RC^AA454103 
122653 RC_AA454642 
122660 RC^AA454935 
122703 RCJ\A456323 
122724 RC_AA457395 
122749 RC_AA458850 
122772 RCJVA459662 
131098 RC_AA459668 
129045 RC_AA459679_s 
122777 RC_AA459702 
135362 RC_AA460017J 
122798 RC_AA460324 
122837 RC_AA461509 

122860 RC_M464414J 
mRNA sequence. 

122861 RCJ^464428 
122910 RCJ\A470084 
132899 RCJ\A476606 s 
122967 RC_AA478521 
129560 RC_AA478523 
123009 RCJ\A479949 
128917 RC^AA481252 
123081 RC_AA485351 
123133 RC^AA487264 
123184 RC.AA489072 
129671 RCJ^489630 

123233 HCJ^490225 
[H^piensl 

123234 RCJ\A490227 
123236 RCJ\A49Q255 
123255 RCJ\A490890 
129503 RCJ\A490916 s 



AA406397 Hs.98038 ESTs 

BE274689 Hs.184175 chromosome 2 open reading frame 3 

AA428281 Hs.g8560 EST 

AA428865 Hs.98563 ESTs 

AW452701 Hs.293237 ESTs 

AA429666 Hs.98617 EST 

AW297880 Hs.98661 ESTs 

U73524 Hs.87465 ATP/GTP-binding protein 

AA306478 Hs.95327 CD3D antigen, delta polypeptide (TiT3 complex) 

AA431293 HS.9B716 ESTs, Moderately similar to T42650 tiypothetical protein DKFZp434D0215.1 [H.sapiens] 

AI453076 Ks.166109 EUV (embryonk: lethal, almomial vision. DrosophilaHilffi 2 

AA431492 Hs.98742 EST 

AA431732 Hs.98747 EST 

AW2416B5 Hs.98699 ESTs 

AK000492 Hs.98806 hypothetical protein 

AW298244 Hs.293507 ESTs 

AA628233 Hs.79946 cytochrome P450, subfamily XIX (aromatization of androgens) 

AB018255 Hs.111138 KIAA0712 gene product 

U40763 Hs.77965 peptidyl-piolyl isomerase G (cydophiiin G) 

AA398811 Hs.104673 ESTs 

AA436475 Hs.1 12227 membrane-associated nucleic acid binding protein 

AB026436 Hs. 177534 dual specific]^ phosphatase 10 

AA256106 Hs.87507 ESTs 

AW192803 Hs.98974 ESTs. Weakly simBar to S65824 reverse transcriptase homolog [H.sapiensl 

BE465894 Hs.98365 ESTs, Weakly similar to LB4D_HUMAN NADP-OEPENDENT i^UKOTRJENE B4 12- 

AA446440 Hs.98643 ESTs 

AB007859 Hs.100955 KIAA0399 protein 

AA447398 Hs.99104 ESTs 

AA447643 Hs.1120g5 hypothetk^al protein DKFZp434F1 81 9 

AJ404468 Hs.284259 dynein. axonemal. heavy polypeptide 9 

AA448226 Hs.324123 ESTs 

AA448825 Hs.99190 ESTs 

AA299607 Hs.98g69 ESTs 

AF060877 Hs.99236 regulator of G-protein signalling 20 

AA450211 Hs.99239 ESTs 

AA476741 Hs.98279 ESTs. Weakly similar to A43932 mucin 2 precursor, intestinal IH.sapiens] 

AW392342 Hs.263077 centrosomal P4. 1-associated protein; uncharacterized bone marrow protein BM032 

AJ224901 Hs.109526 zinc finger protein 198 

AA452156 gb2x29c03.s1 Soares_totaLfetus_Nb2HF8.9w Homo sapiens cDNA done IMAGE:787876 3*. 

AI681654 Hs.170737 hypothetical protein FU23251 

AA453525 Hs.143077 ESTs 

AA454085 gbzx33a08^1 Soares_totaLfetus.Nb2HF8.9w Homo sapiens cDNA done IMAGE:788246 3* 

AW651706 Hs.99519 hypothetical protein FU14007 

AW009166 Hs.99376 ESTs 

A1816827 Hs.180069 nuclear respiratory factor 1 

AA456323 Hs.269369 ESTs 

AA4573gs Hs.99457 ESTs 

AA458850 Hs.293372 ESTs. Weakly similar to B34087 hypothetical protein IH^aptens] 

AW1 17452 Hs.99489 ESTs 

U66669 Hs.236642 3<hydroxyisobutyiyl-Coenzyme A hydrolase 

AI082883 Hs.30732 hypothetk:al protein FU13409; KIAA1711 protein 

AK001022 Hs.214397 hypothetical protein FU10160 similar to insulin reteted protein 2 

AA978128 Hs.99513 ESTs, Weakly similar to T17454 diaphanousHnelated fbnnin - mouse [M-muscuIus] 

AW366286 Hs.145696 splicing factor (CC1, 3) 

AA461509 Hs^565 ESTs. Weakly similar to putetive p150 [H.sapiensl 

AA464414 gb2x78gOU1 Soares ovary tumor NbHOT Homo saptens cDNA done IMAGE:809904 3^, 

AA335721 Hs.119394 ESTs 

AA470084 Hs.98358 ESTs 

AA476606 Hs.59666 SMAD in the antisense orientation 

AA806187 Hs.289101 ghicose regulated protein, 58kD 

AA317841 Hs7845 hypothetical proteh M6C2752 

AA535244 Hs.78305 RAB2, member RAS oncogene family 

AI365215 HS206097 oncogene TC21 

AI81 5486 Hs^43901 Homo sapiens cDNA FU20738 frs. done HEP08257 

AA487264 Hs.154974 Homo sapiens mRNA; cDNA DKRZp667N084 (from done DKFZp667N064) 

BE247767 Hs,18166 KIAA0870 protein 

NM-014700 Hs.1 190O4 KIAA0665 gene product 

AW974175 Hs.188751 ESTs. Weakly similar to MAPB.HUMAN MICROTUBULE-ASSOCIATEO PROTEIN IB 

NM_00193B Hs.16697 dowm-regulator of transcription 1 , TBP-binding (negative cofador 2) 

AW988504 Hs.123073 CDC2-ieiated pr(^ kinase 7 

AA830335 Hs.105Z73 ESTs 

AW768399 Hs.112157 €STs 
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131043 RC_AA490925 
123259 RC_AA490955 

[H.sap}ens] 

123284 RC AA495812 
123288 RCJVA495824 
123315 RC.AA496369 

to contains 

129179 RC_AA504125^s 
RCJ^521473 
RC AA598440 
RC AA598899J 
RC_AA599244 
RC AA599694_s 
RC AA600037 
RC_AA609135 
RC AA609582 
RC AA609684 
RC_AA609839 



131612 
123421 
123449 
129021 
132830 
123497 
123604 
129539 
123712 
123731 
simflarto 
130725 RCJ\A609862 
123800 RCJ!^620423 
123841 RC.AA620747 
123929 RC.AA621364 
123978 RC.C20653 
133184 RC.D2C085 
132835 RC.D20749 
132406 RC.D51285.S 
128695 RC„059972.I 
124028 RC.F04112J 
sequence. 

124057 RC_F13604 
RC_H01662 
RC_H05135J 
RC.H12245 
RC.H22842 
RC.H30894 
RC.H43442_s 
RC_H45996 
RC„H69281J 
RC_H69485J 
RC.H69899 



134899 
130973 
124106 
124136 
124165 
131229 
124178 
129948 
134374 
124254 
similar to 
129056 RC^H70627^s 
RC_H73050^s 
RC H73260 
RC_H77531_s 
RC_H80552 
RC^H80737^s 
RC.H93412 
RC.H9489^s 
RC.H95643_s 
RC.H96552 
RC_H97146 
RC^H99131.8 
RC^H99462_s 
RC_H99837_s 
RC_N22140 
RC„N22197 
RC.N23756_s 
RC^N24134 
RC„N24195 
RC^N26739 
RC.N27098 
RC_N27637 
RC_N33090 
RC_N35967 
RC_N38959J 
RC^N39069 
RC_N46441 
132338 RC.N48270J 
131403 RC.M8365_s 
RC.N51316 
RC^N51499.S 
RC.N53976 
RC_N54157 



100919 
130724 
100716 
124274 
129078 
124828 
124315 
100747 
124324 
452933 
132231 
129170 
133143 
132963 
135297 
134347 
130365 
421642 
439311 
124383 
124387 
129341 
129081 
102827 
124433 
124441 



124466 
132210 
124483 
124484 



124485 RC.N5430D 



AF084535 Hs.22464 epilepsy, progresshfe myodonts type 2, Ufora disease (lafbr^^ 

AI744152 Hs.283374 ESTs. Weakly simflar to CA15.HUMAN COLLAGEN ALPHA 1(V) CHAIN PRECURSOR 

AA488988 H&293796 ESTs 

AA495824 H5.188822 ESTs. Wealdy similar to A46010X-linked retinopathy protein [^^ 

AA4983e9 gb:zv37d10^1 Soares ovary tumor NbHOT Homo sapiens cDNA done 1MAGE:755827 3* 

AW969025 Hs.109154 ESTs 

AU076668 Hs.334884 SEC10 (S. cerevlsiaeHike 1 

AA598440 Hs.291 154 EST, Weakly simflar to 138022 hypothetical protein [H^piens] 

AL049325 Hs.1 12493 Homo sapiens mRNA; cDNA OKFZp564D036 (from done DKFZp564D036) 

AL044675 Hs.173081 KIAA0530 protein 

NM_014777 Hs.57730 KIAA0133 gene produd 

AA765256 Hs.135191 ESTs» Weakly Similar to unnamed protein product [H.saptens] 

AA609135 Hs^076 ESTs 

T47614 Hs.323022 ESTs, Highly similar to p60 katanin [Helens] 

AA609684 Hs.112748 Homo sapiens d3NA: FU21543 fis, done COL06171 

AA609839 gb:ae62f01 ^1 Stratagene lung carcinoma 937218 Homo sapiens cDNA done ll^6E:951481 3' 

T98807 Hs.80248 RNA-binding protein gene with multiple spDdng 

AA620423 Hs.1 12862 EST 

AA620747 Hs.1 12896 ESTs 

AA621364 Hs.1 12981 ESTs 

T89832 Hs.170278 ESTs 

AA001021 Hs.6685 thyrokl honnone receptor interador 8 

Z83844 Hs.5790 hypothetical protein dJ37E16.5 

AL133731 ' Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (from done DKFZp761C1712) 

NM.003478 Hs.101299 cuNinS 

F041 12 gb:HSC2JH062 nomnalized infant brain cDNA Homo sapiens cDNA done c-2^b6 3*. mRNA 

AA902384 Hs.73853 tjone morphogenette protein 2 

AI609045 Hs.321775 hypothetical protein DKFZp434D1428 

AI638418 Hs.78580 DEAD/H (Asp-GIUnAlaVVsp/Hts) box polypeptide 1 

H12245 gb7ml7a12.r1 Scares infant brain INIB Homo sapiens cDNA done 3', mRNA sequence 

H22842 Hs.101770 EST 

H30039 Hs.107674 ESTs 

N^/L015340 Hs.2450 leucyl-tRNA synthetase, mltochondriai 

6E463721 H5.97101 putative G protein-coupled receptor 

A1537162 Hs.263988 ESTs 

N22687 Hs.8236 ESTs 

H69899 gb7u70c12.s1 Weizmann Olfadory Ef^llum Homo sapiens cDNA done 1MAGE239158 3" 

AI76995d Hs.10d336 ESTs. Weakly similar to AWE JiVhAAN 1111 ALU CLASS E WARNING ENTRY 111 (H^apiens] 

X54534 Hs.27d994 Rhesus blood group. CcEe antigens 

AK001507 Hs.306084 Homo sapiens done FLB6914 PR01821 mRNA, complete cds 

X89687 Hs. 1 72350 HIR (histone cell cyde regulation defective, S. cerevisiae) homobg A 

H80552 Hs.102249 EST 

AI351010 Ks.102267 lysosomal 

AW9S2124 Hs.13094 presenillns assodated ihombokMike protein 

Nm.Q05402 HS.2887S7 v-ral simian leukemia viral oncogene homok)g A (ras related) 

X04588 Hs.85844 neurotrophk: tyrosine kinase, receptor, type 1 

H96552 Hs.159472 Homo sapiens cDNA: FU22224 fis. done HRC01703 

AW391423 Hs.266555 Homo saptens cONA: FLJ22425 fis, done HRC08666 

AA662910 Hs.42635 hypothetical protein DKFZp434K2435 

AW250360 Hs.109059 mitochondrial ribosomal protein L12 

AA094538 Hs.272808 putative transcription regulatnn nudear protein; K1AA1689 protein 

AA099693 Hs.34851 epsiion-tubunn 

AL1 18762 Hs.300206 Sec23-inteiBCting protein p125 

AF1 641 42 Hs.82042 solute earner family 23 (nudeobase transporters)^ member 1 

W561 19 Hs.1 55103 eukaryotic translation initiation factor 1 A, Y chromosome 

AF172066 Hs.106346 retinok; add repressible protein 

BE270668 Hs.151945 mitochondrial n*bosomal protein L43 

N27098 Hs.102463 EST 

N27637 Hs.109019 ESTs 

A1193519 Hs.226396 hypothetical protein FU11126 

AI364933 Hs. 168913 serine/threonine kinase 24 (Ste20, yeast homotog) 

BE244588 Hs.6456 chaperanin containing TCP1 . subunit 2 (beta) 

AA280319 Hs.288840 PR01 575 protein 

AW450481 Hs.161333 ESTs 

AA353868 Hs. 182982 go^tn-67 

AI473114 Hs.26455 ESTs 

R10084 Hs.1 13319 kinesin heavy chah member 2 

NI\A_007203 Hs.42322 A kinase (PRKA) anchor protein 2 

A1821780 Hs.179864 ESTs 

H66118 Hs^65520 ESTs. Weakly similar to 210926aA B cell grawrth factor [H.sapiens] 

AB040g33 Hs.15420 KIAA1500 protein 
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124494 RC_N54831 N54831 Hs.271381 ESTs. Weakly similar to 138022 hypothetical protein fH^iens] 

129200 RC_N59849 N69849 Hs.13565 Sam68-fike phosphotyrosine protein. T-STAR 

124527 RC_N62132 N79264 Hs^69104 ESTs 

124532 RC.N82375 N62375 Hs.102731 EST 

133213 RC_N63138 AA903424 Hs.6786 ESTs 

RC_N63172 D54120 Hs.148409 cell division cyde 42 (GTP-binding protein. 25kD) 

RC_N63772 AI301740 Hs.173381 dihydropyrimidinase-like 2 

RC„N63787 BE296313 Hs.265592 ESTs, Weakly sWarto 138022 hypothetical protein (H.saplensl 

RC.N68168 N68168 gb:za11cOU1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA ctone 3*. mRNA sequence 

RC_N68201 N68201 Hs.269124 ESTs, Weakly similar to 138022 hypothetfeal protein [H.sapiens] 

RC^N88300 N68300 gb.za12g07^1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA ckme IMAGE:292380 3*. 

Ne8321 Hs.231500 EST 

N69576 Hs.102788 ESTs 

AL133572 Hs.199009 protein containing CXXC domain 2 

AI680737 Hs^89068 Homo sapiens cDNA FU1 1918 fis, done HEMBB1000272 

T78277 Hs.100293 0-linked N-acety!glucosamine (GlcNAc) transferase (UDP-N-Gce1yIgluoosamine:polyDeptide-N- 

AW582962 Hs.102897 CGM7 protein 

W19407 Hs.3862 regulator of nonsense transcripts 2; DKreP434D222 protein 

AB002316 Hs.65746 KIAA0318 protein 

AK001357 Hs.102951 Homo sapiens cDNA FU10495 fis, done NT2RP2000297. moderately similar to ZINC FINGER 



124539 
133651 
129196 
124575 
124576 
124577 
mRNA 
124578 
124593 
128501 
105691 
128473 
128639 
124652 
133137 
124671 



RC_N68321 
RC^N69575 
RC.N75007 
RC„N75542 
RC^N90066 
RC^N91246 
RC_N92751 
RC^N93214_s 
RC_N99148 
PROTEIN 
133054 RC^R07876 
[Celegans] 

130410 RC_R10865J 
124720 RC.R11056 
similar to 

124722 RC.R11488 
129961 RC.R22947 
repetitive element 128944 
132965 RC R26589J 
133740 RC_R37588_s 
133074 RC R37613 
124757 RC R38398 
124762 RC_R39179J 
124773 RC R40923 
135256 RC_R41179 
131375 RC R41294_s 
133753 RC_R42307J 
128540 RC^R43189J 
124785 RC_R43306 

124792 RC_R44357 

124793 RC„R44519 
sequence. 

124799 RC_R45088 
sequence. 

124812 RC_R47948J 
124821 RC_R51524 
127274 RC_R54950 
124835 RC^R55241 
124845 RC_R59585 
124847 RC_R60044 
440630 RC_R60872 



AA464836 Hs.291079 ESTs. Weakly similar to T27173 hypothetical protein Y54G11 A.9 - Caenorhabditis eiegans 
J00077 Hs.155421 alpha-fetoprotein 

F^05283 gb:ye9lG08.s1 Soaies fetal liverspleen It^S Homo sapiens cONA done IMAGE:125102 3* 



T97733 
R23053 
RC_R23930_sAL137586 



Hs.185685 ESTs 



A1248173 

AW162919 

AL134275 

HI 1368 

AA553722 

R45154 

R41179 

AW293165 

NM.004427 

AW297929 

W38537 

R44357 

R44519 

R45088 

R47948 

H87832 

AW966158 

R55241 

R59585 

W07701 

BE561430 



Hs.191460 

Hs. 170160 

Hs.6434 

Hs.141055 

Hs.92096 

Hs.106604 

Hs.97393 

Hs.143134 

Hs.165263 

Hs.328317 

Hs.280740 

Hs.48712 



Hs.188732 

Hs.7388 

HS.S8582 

Hs.101214 

Hs.101255 

Hs.304177 

Hs.239388 



gb:yh31a05.r1 Soares placenta Nb2HP Homo saptens cDNA done 5* stoOar to contains LI 

Hs.52763 anaphase-promoting complex subunit 7 

hypothetical protein MQC12936 

RAB2. member RAS oncogene fiamily-igce 

hypothetical protein DKFZp761F2014 

Homo sapiens done 23758 mRNA sequence 

ESTs, Moderately simitar to A46010 X-linked retinopathy prote^ [H^sapiens] 
ESTs 

KtAA0328 protein 
ESTs 

early development regulator 2 (homolog of polyhomeofic 2) 
EST 

hypotheticat protein MGC3040 
hypothetical protein FU20736 

gb7g24h04.s1 Soares infant brain 1NIB Homo sapiens cDNA done IMA6E:33350 3\ mRNA 
gb:yg38g04.5l Soaies Infant biain 1N1B Homo sapiens cONA done IMAGE:34896 3'. mRNA 
ESTs 

keteh (Drosophila)>fike 3 

Homo sapiens cDNA FU12789 fis. done Nr2RP2001947 

EST 

ESTs 

Homo sapiens done FLB8503 PR02286 mRNA, complete cds 

Human DNA sequence from done RP1-304B14 on chromosome 6. Contains a gene for a novel 



protein and a part of a gene for a novel protein with two isofomis. Contains ESTs, STSs, GSSs and a CpG island 

124861 RC_R66690 R67567 Hs.1O7110 ESTs 

130141 RC.R67266_s NM_004455 Hs.150956 exostoses {multipleHike 1 

124879 RC_R73588 R73588 Hs.101533 ESTs 

124892 RC_R79403 AI970003 Hs.23756 hypothetical pR)tein similar to swineapylneuiamlnale lyase 

124906 RC_R87647 H75964 Hs.l07815 ESTs 

124922 RC^R93622 R93622 Hs.1 2163 eukaryotic translation initiation factor 2, subunit 2 (beta. 38kD ) 

124940 RC_R99599^s AF068846 Hs.103804 heterogeneous nuclear ribonudeoprotein U (scaffold attachment fector A) 

124941 RC.R99612 AI766661 Hs,27774 ESTs. Highly similar to AF161349 1 HSPC086 [H.sapiensl 

124943 RC J02888 AW963279 Hs.123373 ESTs, Weakly similar to ALUI^HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

T03170 Hs.100165 ESTs 

AW964237 Hs.6728 KIAA1 548 protein 

U55184 Hs.154145 hypothetical protein FU 11 585 

BE383768 Hs.65238 95 kOa retinoblastoma protein binding protein; KIAA0661 gene product 

R43504 Hs.6181 ESTs 

AW960782 Hs.6856 ash2 (absent, smaB. or homeoticDrosophila,homotog}4ike 

AI091121 Hs^46216 Homo sapiens cDNA: FU21781 fis. done HEP00223 

T40841 Hs.98681 ESTs 

BE313210 Hs^241 eukaiyotto translation etongatkxi factor 1 delta (guanine nudeoikle exchange protein) 

T50116 gb^bT/ciasI Stratageneovaiy (937217) Horno sapiens cONA done IMAGE:77202 3' sMar 



WARNING ENTRY [H.sapiens] 

124947 RC_T03170 

124954 RCJ10465 

132924 RC_T15418J 

133113 RC_T15597 f 

132975 RCJ-15650 

133235 RC_T1689B.s 

131082 RC_T2e644J 

124980 RCJ-40841 

124984 RC^T47566J 

124991 RC_T50116 



to similar to SP: VE22^LAMBD P03756 EA22 GENE , mRNA sequence. 
129475 RC_T50145.s NM.004477 Hs203772 FSHDregkmgenel 
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125000 RC.T58615 
132932 RC.T59940J 
129534 RC_T63595. 

125008 RC T64891 

125009 RC.T64924 
132940 RC_T64933_r 

125017 RC.T68875 
sequence. 

125018 RC_T69027 
125020 RC.T69924 
129891 RC.T70353 

RC^T79780_S 
RC^T79951 
RC^T80174.s 
RC.T80622 
RC_T85352 



T58615 

AW1 18826 

AKa02126 

T91251 

T64924 

T79138 

T68875 



Hs.1 10640 

Hs.6093 

Hs.11260 

Hs.303046 
Hs.127243 



ESTs 

Homo sapiens cONA: FU22783 fis, done KAIA1993 
hypotheGcal protein FIJ1 1264 

gb:yd60a10^1 Soaies fetal Ihrer spleen 1NFLS Homo sapiens cDtMA done 3^, mRNA sequence 
ESTs 

Homo sapiens mRNA for KIAA1724 protein, partial cds 

gb7c30f05^1 Stratagene liver (937224) Homo sapiens cDNA done IIVIAGE:82209 3\ mRNA 



T69027 Hs.57475 sex comb on midteg homotog 1 

T69981 gb7c19d03.r1 Stratagene (ung (937210) Homo sapiens cONA done 5. mRNA sequence 

AI084813 Hs.13197 ESTs 
AI873257 Hs.7994 hypothetical proten FU20551 
AW970209 Hs.1 11805 ESTs 

T851 04 Hs.222779 ESTs. Moderately similar to similar to NEDD-4 [H^piens] 
T80622 Hs.268601 ESTs. Weakly simi^ to envelope [H.sapiens] 

T85352 gb:yd82d01.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 1MAGE:114721 3* 

similar to contains Aiu repetitive element;contains LI repetitive element ;, mRNA sequence. 

125064 RC^T85373 T85373 gb:yd82f07^1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA done iMAGE:1 14757 3" 

similar to contains Alu repetitive element;contains MER3 repetitive element mRNA sequence. 



134204 
125050 
125052 
125054 
125083 



125066 RC^T86284 T86284 
Alu repetitive element;, mRNA sequence 

112264 RC.T89579.S AL045364 HS.793S3 

125080 RC^Tg0360 T90360 Hs.268620 
WARNING ENTRY {^.sapiens] 

125097 RC.T94328J AW5763B9 Hs.335774 
125104 RC^T95590 T95590 



gb:yd77b07.s1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA done 3* simOar to contains 
transcription fadorDp-1 

ESTs. Highly similar to ALU6_HUMAN ALU SUBFAMILY SP SEQUENCE CONTAMINATION 



EST, Moderately simitar to S65657 alpha-IC^dreneigic receptor splice forni 2 [H.sapiens] 
gb:ye40a03.s1 Scares fetal fiver spleen 1NFLS Homo sapiens cONA done 3* similar to 
gb|M10817|IGURRAA Iguana iguana 5S (rRNA );. mRNA sequence 
135107 RC.T97257J T97257 Hs.337531 ESTs. Moderately simBar to 138022 hypottietical protein [H.sapiens] 
129550 RC^T97599J AA845462 Hs.124024 deltex (Drosophiia) homotog 1 

1251 18 RC.T97620 R10606 gb:yf35f11.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 1MAGE:128877 3* 

similar to contains Alu repetitive element;, mRNA sequence. 



125120 RC.T97775 
134160 RC^T98152 
125138 RC_W31479 
125144 RC.W37999 
125150 RC_W38240 
104180 RC_W40150 
131987 RC_W45435 
125178 RC.W58202 
125180 RC W58344 
125182 RC W58650 
130588 RC^W68736 
125197 RC_W69106 
133497 RC_W69111 
100562 RC^W69385^s 
125639 RC.W69399^s 
129232 RC_W69459 
101495 RC.W72424 
125209 RC W72724 
125212 RC_W72834 
129132 RC_W73955 
125223 RC_W74701 



T97775 
T98152 
AW962364 
AB037742 
W38240 
AA247778 
AW453069 
W93127 
W58469 
AA451755 
AL030996 
AF086270 
BE617303 
NM«006185 
Z97630 
RgB881 
W72424 
W72724 
AA746225 
BE383436 
Ai916269 



WARNING ENTRY |H.sapiensl 



125225 
125228 
132393 
125238 
125247 
129296 
125263 
125266 



131601 
131677 
120837 
125277 
125278 
125280 
131856 
131844 
125284 
313447 
130799 



RC.W76540 
RC W79397 
RC^W85888 
RC W86038 
RC W86881 
RC.W87804 
RC_W88942 
RC.W90022 
PRECURSOR [H.sapiens] 
131321 RC.W92272 
RC W92764^s 
RC W93040 
RC_W93092 
RC„W93227 
RC W93523 
RC.W93659 
RC.W94003^s 
RC W9440LS 
RC.W94688 
RC.W94787.S 
RC Z38294.S 
12S289 RCZZ36311 
128874 RC.Z38465.S 



W74169 
AA033982 
AL135094 
N99713 
AA694191 
AI051967 
AA098878 
Wg0022 

U91543 

NM.007115 

H05317 

BE149656 

W93227 

A1218439 

A1123705 

Vyf93949 

AI419294 

NM.002666 

AW016321 

AB02894S 

T34530 

H06245 



Hs.100717 
Hs.79432 
Hs.129051 
Hs.24336 

Hs.1 19155 

Hs.3657 

Hs.31845 

Hs.103120 

Hs.263560 

Hs.16411 

Hs.278554 

Hs.74266 

Hs.301512 

Hs.226117 

Hs.109655 

H5.112405 

Hs.103174 

Hs.103173 

Hs.108847 

Hs.109057 

Hs.16492 
Hs.1 10059 
Hs.47334 
Hs.109514 
Hs.163914 
Hs.1 10122 

Hs.18680g 

Hs.25601 

Hs29352 

Hs.283549 

Hs.306621 

Hs.103245 

Hs.129998 

Hs.106932 

Hs.33245 

HS324342 

Ks.103253 

Hs.82308 

Hs.12696 

Hs.4210 

Hs.106801 



EST 

fibrillin 2 (congenital contradurai arachnodactyly) 
ESTs 

KIAA1321 protein 

Empirically seleded from AFFX sing te probeset 

Homo sapiens mRNA full length insert cDNA done EUROIMAGE 614975 

activity-dependent neuroprotective protein 

ESTs 

ESTs 

ESTs 

hypothetical protein LOC57187 

heterochromatin-ljke protein 1 

hypothetical protein MGC4251 

nudear mitotic apparatus protein 1 

HI histone family, member 0 

sex comb on midleg (Drosophila)-li)(e 1 

S100 catdum-binding protein A9 (calgnanuiin B) 

ESTs. Weakly similar to TSP2_HUMAN THROMBOSPONDIN 2 PRECURSOR IH.saplens] 
ESTs 

hypothetical protein MGC2749 

ESTs. Weakly similar to ALU5.HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 
DKFZP564G2022 protein 

ESTs, Weakly similar to 138022 hypothetteal protein [H.saplens] 

hypothetical protein FL114495 

ESTs 

ESTs 

ESTs 

gb:zn45g10.r1 Stratagene HeLa ceQ s3 937216 Homo sapiens cDNA done 5\ mRNA sequence 
ESTs, Highly similar to LCT2.HUMAN LEUKOCYTE GEa-DERIVED CHEMOTAXIN 2 

chromodomain helicase DNA binding protein 3 
tumor necrosis factor, alpha-induced protdn 6 
ESTs 

Homo sapiens cDNA FU1 1983 fis. done HEMBB1001051 
EST 

enhancer of potyoomb 1 
ESTs 

ESTs . 
ESTs 



destrin (actin depolymerizing factor) 

oortocQn SH3 domain-b'md'mg protein 

Homo sapiens cDNA FU13069 fis. done NT2RP3001752 

ESTs, Weakly similar to PC4259 fenifin associated protein [H.saplens] 
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130966 RC_238525.s 
128875 RC.Z38538J 
133200 RC_238551.s 
130158 RC^8783.s 
125295 RC_Z39113 
domain, (5emaphorin)4F 
125298 RC_Z39255J 
RC_Z39591 
RC.Z39783 s 
RC^239920 
RC.240166J 
RC.Z40388_S 
RC^Z40646 
RC_Z41697 
RC_Z99349 
RC_Z99394_s 
RC_AA027168 
D58024 s 
RC_AA251113 
RC^AA347573 
M31210 
C01527 
RC_N62948_s 
RC_W42674 
RC_W43000_s 
RC.N31750.S 
AF009301 
L37347 
M99564 
RC_M430673 
RC_AA251297 
RC_AA620722 
RC^W46577_s 
RC^W81237 
RCJ^020825 
PROTEIN TC10 
115243 RC_AA278766 
RC^H93708^s 
RC^N49394 
RC_H56559„s 
RC„AA446887 
RC.AA293656 
RC_AA251152 
RCJ\A233334_s 
dominant, ataxin 3) 
311463 RC.239920 

RC_AA192173 
RC.AA455044 
RCJ\A398504 
U41813 
RC_AA166965 
RC.AA398222 
RC^9095 
RC_AA284744J 



125300 
323122 
311463 
130882 
128888 
125310 
125315 
125317 
13509B 
104786 
132837 
120456 
132459 
101545 
133505 
132360 
132738 
119586 
129914 
130839 
132813 
134342 
131878 
105426 
132968 
132173 
113932 
114452 



134403 
129647 
111428 
115967 
120726 
114995 
303876 



120302 
133071 
121032 
129829 
120245 
120985 
114184 
447503 
cds 

132837 
121034 
119718 
120455 
125280 
132155 
120609 
121278 
109023 
129815 
108051 
113287 
114082 
116334 
131486 



RC_AA428201 
RC^AA398507 
RC_W69216 
RC_AA251083 
RC_W93659 
RC_AA227903 
RC_AA283902 
RC_AA401631 
RC_M157293 
RC_D60208_f 
RC_M043979 
RC_T66847 
RC_Z38239 
RCJ^M91457 
RC_Z40071^ 
107860 RCJkAQ24961 
131283 RCJ^443826 
132207 RCJ^A443294 
129183 RCJ^155743 
408431 RC,T23708 
120575 RCJ^A2B0934 



AW971018 Hs.21659 ESTs 

AB040923 Hs.106808 kelch (Drosoph!Ia)-like 1 

AB037715 Hs.183639 hypothetical protein FU 10210 

AB032947 Hs.151301 Ca2-Mlependent activator protein forsecretion 

AB022317 Hs.25887 sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic 

AW972542 Hs.289008 Homo sapiens cDNA:FU21814 lis. done H£P01 068 

239591 Hs.101376 EST 

BE622770 Hs.264915 Homo sapiens cDNA FU12908 fiS. dbne NT2RP2004399 

R55344 Hs.22142 cytochrome b5 reductase bSR^ 

AA497044 Hs.20887 hypothetical protein FU10392 

AI760853 Hs.241558 aiiadne (Dfosophila) homc^ 2 

R59161 Hs.124953 ESTs 

R38110 Hs.105296 ESTs 

Z99348 Hs.112461 ESTs, Weakly simUar to 138022 hypothetical protein {H^iens] 

AA081258 Hs.132390 zinc finger protein 36 (KOX 18) 

AA027167 Hs.10031 KIAA0955 protein 

AA370362 Hs.57958 EGF-TM7-Iatrophflin.related protein 

AA488750 Hs.88414 BTB and CNC homology 1 , basic leucine zipper transcription factor 2 

AL1 20071 Hs.48998 fibronectin leucine rich transmembrane protein 2 

BE246154 Hs.154210 endothelial differentiation, sphingolipid G-protein-coupled receptor, 1 

AI630124 Hs.324504 Homo sapiens mRNA; cDNA DKFZp586J0720 (from done DKF2p586J0720) 

AW893660 Hs.46440 solute carter family 21 (organic anion transporter), member 3 

AK000738 Hs.264636 hypothetical protein FU20731 

AF088033 Hs.159225 ESTs 

NM.012421 Hs.13321 reananged L-myc fusion sequence 

AB011169 Hs.20141 similar to S.cerevisiaeSSIVW 

BE313625 Hs.57435 solute carter temlly 1 1 (proton-coupled divalent metal ion transporters), member 2 

NM_000275 Hs.82027 ocutocuteneous albinism II (pink-eye ditution (murine) homoiog) 

AA083764 Hs.6101 hypotheticat protein MGC3178 

W20027 Hs.23439 ESTs 

AF234532 Hs.61638 myosin X 

X89426 Hs.41716 endothelial cell-specHic molecule 1 

AA256444 Hs.126485 h]^o1hetica) protein FU12604;K1AA1692 protein 

AI369275 Hs.243010 Homo sapiens cDNA FU14445 fis. done HEMBB1001294. highly sbnliar to GTP-BINDING 

AA806600 Hs.1 16665 KIAA1842 protein 

AA334551 Hs.82767 spemn spedfic antigen 2 

AB018259 Hs.1 1 8140 KIAA0716 gene produd 

AL031428 Hs.174174 KIAA0601 protein 

AI745379 Hs.42911 ESTs 

AA293655 Hs.97293 ESTs 

AA769266 Hs.193657 ESTs 

U64820 Hs.66521 MachadOnloseph disease (spinocerebellar ataxia 3, olivopontocerebellar ataxia 3, autosomal 

R55344 Hs.22142 cytochrome b5 reductase b5R.2 

AA837098 Hs^69933 ESTs 

BE384932 Hs.64313 ESTs, Weakly similar to AF257182 1 G-protein-coupled receptor 48 rH.sapiens] 

AA393037 Hs.161796 ESTs 

AF010258 Hs.127428 homeobaxAQ 

AW959815 Hs.111045 ESTs 

A1219896 Hs.97592 ESTs 

R56434 Hs.21062 ESTs 

AA1 15496 Hs.336898 Homo sapiens, Similar to RIKEN cDNA 1810038N03 gene, done MGCS890, mRNA. comptete 

AA370362 Hs.57958 EGF-TM74atrophain-related protein 

AL389951 Hs.271623 nudeoporin 50kD 

W69216 Hs.92848 ESTs 

AA251720 Hs.104347 ESTs, Weakly similar to ALUC.HUMAN llll ALU CUSS C WARNING ENTRY HI (H^sapiensl 

AI123705 Hs.106932 ESTs 

AK001607 Hs.41127 hypothetical protein FU1 3220 

AW978721 Hs,266076 ESTs, Weakly similar to A46010 X-linked retinopathy protein (H^aptens] 

AA037121 Hs.98518 Homo sapiens cDNA FU1 1490 fis, done HEMBA1001918 

AA157293 Hs.72168 ESTs 

BE565817 Hs.26498 hypothetical protein FU21657 

AA043979 Hs.62651 EST 

T66847 Hs,194040 ESTs. Weakly simflar to 138022 hypothetfcal protein {H-sapiaisJ 

AK001612 Hs^962 Homo sapiens cDNA FU10750 fis, done NT2RP3001929 

AL038450 Hs.48948 ESTs 

F06972 Hs.27372 BMX non>reoeptor tyrosine kinase 

AA024961 Hs^0730 ESTs 

AU077002 HS24950 regutetor of G-protein slgnaOing 5 

BE206939 Hs.42287 E2F transcriptkm factor 6 

BE561824 Hs.2733e9 uncharacteiized hematopoietic stem/progenHorceDsproteniMDS027 

AI338631 Hs.43266 Homo sapiens cDNA: FU22536 fis. done HRC131S5 

AW978(^ Hs.238911 hypoflietical protein DKFZp762E1511: KIAA1816 protein 
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132121 RCJVA443284^s 
117657 RC.N39074 
134922 RC W04507^s 
118523 RC^R41828_s 
116845 RC_H64973 
115291 RC_AA279943 
120326 RC.AA196300 
130174 M29550 
129131 RCJVA436489 
129868 RCJVA287032 
118661 RC_N70777 
129829 RC.AA496921 
115985 RCJUV447709 
•134637 RCJ\A369856.s 
132714 RCJ^2598 
129771 RC_H73237 
123360 RC_AA504784 
132902 RC.AA490969 
113716 RC.T97750 
113825 RC.W48860 
130367 RC.Z38501 
120541 RC^AA278298 
116727 RC.F13684 
118219 RC.N62231 
119767 RC W72562 
128917 RC_AA481252 
451553 RC.AA020928 
132716 RC_AA251288 
118525 RC.N67861 
114618 RC.AA084162 
119743 RC_W70242 
108154 RC^AA425151^s 
122798 RCJ^460324 
133746 U44378 
119822 RC.W74471 
122186 RC_AA435842 
114941 RC.AA243017 
118053 RC_N53367 
123234 RC.AA490227 
129280 M63154 
118995 RC.N94591 
116750 RC_H05960 
129026 Mg8833 
105127 RC_AA158132 
114513 RC_AA044825 
411856 RC_T35697 
132036 W01568 
130091 RC_W88999 



NWL004529 

N39074 

AI718295 

Y07759 

AA649530 

BE545072 

AA196300 

M29551 

AB026436 

AW172431 

AL137554 

AF010258 

AA447709 

U87309 

W39388 

AL096748 

AA532718 

Aig36442 

AA001356 

AW014486 

AL135301 

W07318 

R76472 

AA862391 

W72562 

AI365215 

AA018454 

BE379595 

N67861 

AW979261 

AA947552 

NIVL005754 

AW366286 

AW410035 

AF086409 

AA3g8811 

AA236512 

N53391 

NM.001938 

M63154 

N94591 

AA760689 

AL120297 

AA045648 

AA044873 

H67899 

AL1 57433 

W88999 



sequence 
414108 U09564 
119881 RC.W81456 
117770 RC^N47953 
119850 RC„W80447 
115439 RC_AA284561 
123107 RC„AA486071 
406698 M24364 
121231 RC_AA400780 
132074 AB002366 
413670 AB000115 
125277 RC.W93227 
114056 RCJ\A186324 
121153 RCJ^99640 
121609 RCJVA416867 
120661 RCJ^A287556 
120850 RC.AA349647 
124947 RC.T03170 
130529 RC_AA280886 
repetitive element. mRNA sequence 
117683 RC_N40180 N40180 
IMAGE:276387 3' similar to contains Ll, 
120745 RCJ^302809 AA302809 
120936 RCJVA385934 
112597 RC.R78376 
120183 RC^40174 
120644 RCJ^287038 



AI267592 

W81486 

AW957372 

AI247568 

AI567972 

AA225048 

X03068 

AA814948 

AA478486 

AB000115 

W93227 

AA188175 

AA399640 

AA416667 

AA287556 

AA349647 

T03170 

AA178953 



AA385934 
R78376 
AW082866 
AI869129 



Hs.404 ' myelokj/lymphoid or mbced-lineage leukemia (trithorax (DrosophOa) homolog); transtocated to. 3 

Hs.44933 ESTs 

Hs.91161 prefoldin4 

Hs.170157 myosin VA (heavy polypeptide 1 2. myoxin) 

gb:ns44fl)5.s1 NC(_CGAP_A{v1 Homo sapiens gONA done, mRNA sequenoe 

Hs.122579 hypothetical protein FU10461 

Hs.21145 hypothetical protein RG083M05^ 

Hs.151531 protein phosphatase 3 (formerly 28). catatytic subunit. beta isofbnn (caldneurin A beta) 

Hs.177534 dual speaTtdty phosphatase 10 

Hs.13012 ESTs 

Hs.49927 protein kinase NYD^PIS 

Hs.127428 homeoboxA9 

Hs.2681 15 ESTs. Weakly simflar to T08599 probable transcription factor CA150 (H^ens] 

H&180941 vacuolar protein sorting 41 (yeast homolog) 

Hs.55336 Homo sapiens, ctone MGC:17421 . mRNA, complete cds 

Hs.102708 DKFZP434A043 protein 

Hs.178604 ESTs 

Hs.59838 hypothetical protein FU10808 

Hs.18159 ESTs 

Hs.22509 ESTs 

Hs.8768 hypothetk^ protein FU10849 

Hs.240 M-phase phosphoprotein 1 

Hs,65646 ESTs 

Hs.48494 ESTs. Moderately similar to A46010 X4lnked retinopathy protein [H^ens] 

Hs.58119 ESTs 

Hs.206097 oncogene TC21 

Hs.269211 ESTs 

Hs.283738 casein kinase 1. alpha 1 

Hs.49390 ESTs 

Hs.291993 ESTs 

Hs.58086 ESTs 

Hs.220689 Ras-GTPase-activating protein SH3-domain-binding protein 

Hs.145696 splicing factor (CC1 .3) 

Hs.75862 MAD (mothers against decapentapleglc. Drosophila) homolog 4 

Hs.301327 ESTs 

Hs.104673 ESTs 
Hs.87331 ESTs 
Hs.47629 ESTs 

Hs.16697 down-regulator Of transcription I.TBP-binding (negative oofactor 2) 

Hs.1 10014 gastric intrinsic factor (vitamin B synthesis) 
Hs.323056 ESTs 
Hs.92418 ESTs 

Hs.108043 Friend leukemia vims integration 1 

Hs.301957 nudix (nucleoside diphosphate linked moiety X)-type motif 5 

Hs.103446 ESTs 

Hs.4190 Homo sapiens cDNA: FU23269 fis. done 00109533 
Hs^7706 hypothetical protefri DKFZp434E2220 

gb:zh70h03.s1 Scares JetaLliver_spleen_1NFLS_S1 Homo sapiens cDUA done 3', mRNA 

Hs.75761 SFRS protein kinase 1 
Hs.58648 ESTs 

Hs.46791 ESTs. Weakly similar to 138022 hypothetrcal protein {H.sapiens] 
Hs.58452 ESTs 

Hs.193090 ESTs, Highly similar to AF161437 1 HSPC319[H^apiens] 
Hs.104207 ESTs 

Hs.73931 major histocompatibility complex, class II. DQ beta 1 

Hs.96343 ESTs. Weakly similar to ALUC J1UMAN IHI ALU CLASS C WARNING ENTRY III [H^piens] 

Hs.3852 KIAA0368 protein 

Hs.75470 hypothetkal protein, expressed in osteoblast 

Hs.103245 EST 

H5.82506 K1AA1254 protein 

Hs.97694 ESTs 

Hs.98185 EST 

Hs.263412 ESTs. Weakly simflar to ALUB.HUMAN fill ALU CLASS B WARNING ENTRY III [H^piens] 
Hs.96927 Homo sapiens cDNA FU12573 fis. done NT2RM4000979 
Hs.100165 ESTs 

gb:zp39e03.s1 Stratagene musde 937209 Homo sapiens cDNA done 3' similar to contains Atu 

gb7y44d02^1 Soares_multiple_sderosls_2NbHMSP Homo sapiens cDNA done 
t1 LI repetiSve element mRNA sequence. 

gb£ST10426 Adipose tissue, white I Homo sapiens cONA 3' end. mRNA sequence. 
Hs.97184 EST. Highly similar to (defline not available 7499803) [Celegans] 
Hs^733 EST 
Hs.65882 ESTs 
Hs.96616 ESTs 
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119023 RC_N98488 N98488 
IMAGE:310129 3'. mRNA sequence. 

107582 RC„AA002147 AA002147 

118249 RC_N62580 N62580 

116022 RC_AA252029 AA252029 

117710 RC_N45198 N45198 

115341 RC.AA281452 AA281452 

118896 RC_N90680 N46213 

121121 RC_AA399371 AA399371 

118329 RC.N63520 N83520 
3', mRNA sequence. 

119496 RC_W35416 W35416 

118111 RC^N55493 N55493 
mRNA sequence, 

119062 RC_R16698 AW444881 

116710 RC_F10577J F10577 

119261 RC^T15956 T15956 

122723 RC^AA457380 AA457380 
similar to contains LI .63 LI repetitive element 

117732 RC_N46452 N46452 



gb:zb82h0U1 Soaies.senescenUbroblasts.NbHSF Homo sapiens cDNA done 
Hs.59952 EST 

Hs.322925 EST, Weakly simUar to putative p150 [H.sapiens] 
Hs.87935 ESTs 

Hs.47248 ESTs. H'^hiy similar to similar to Cdc14Bl phosphatase [H.sapiens] 
Hs.88840 EST. Weakly similar to granule cell maiker protein [M.miJSGutus] 
Hs.54642 methionine adenosyltransferase II, beta 
HS.18909S similar to SAai (sal (DrosophilaHike 

gb:yy62l01.s1 Soares^multiple_sclerosis^NbHMSP Homo sapiens cDt^ done IMAGE:278137 

Hs.156861 ESTs. Moderately similar to A46010 X-ltnked retinopathy protein [H.sapiens] 

gb:yvS0c0Zs1 Soaies fetal liver spleen 1NFLS Homo sapiens cDI^ done IMAGE:246146 3', 

ESTs 



HS77829 
Hs.306088 
Hs.65289 



IMAGE:279521 3' similar to contains LI .t2 LI repetitive element mRNA sequence. 



v-crk avian sarcoma virus CT10 oncogene homolog 
EST 

gb:aa86b10.s1 Stiatagene fetal retina 937202 Homo sapiens cDNA done iMAGE:838171 3' 
mRNA sequence. 

gb:yy76h09^1 Soares.multiple.sderosisJ!NbHMSP Homo sapfens cONA done 



104787 RC^AA027317 AA027317 
similar to contains Alu repetitive element; 



100071 
115819 
130882 
125225 
108339 



A28102 
RC^AA426573 
RC^Z40166J 
RC^W76540 
RC_AA070801 



WARNING ENTRY [H.sapiensl 



A28102 
AA486620 
AA497044 
W74169 
AW151340 



100338 D63483 
121636 RCJ^417027 
103875 RC.AA418387 
118716 RC^N73460 
119763 RC.W72450 
121917 RC^AA428218 
132806 M91488 
130949 Y10659 
108806 RC.AA129933 
133276 RC AA490478 
134760 RC^H16758 
132867 AA121287 
132051 AA091284 
114208 RC^Z39301 
104094 AA418187 
128718 AA426361 
302032 RC^N20407 
115501 RC_AA291553 
101997 U01160 
103708 AA037206 
101899 S59184 
115839 RCJ^29038 
409459 D50678 
103563 Z22534 
123233 RC.AA490225 
[H^piens] 

121305 RC AA402468 
114798 RC_AA159181 
RC_AA196549 



133145 
131567 
112300 
129507 
121033 
121151 
121402 
123203 
132271 
125197 
114935 



086864 

AA379203 

T26379 

AI658908 

R54146 

AA406397 

AI699432 

AV656840 

AF070578 

AW978439 

NIVL000121 

AF226667 

AA393968 

AL049466 

AA418187 

NM_002959 

NM_001992 

AA291553 

AU076536 

AA430591 

S59184 

BE300266 

D86407 

L02911 

AW974175 

AA402468 
AA159181 
H94227 



RC^AA291015.s AF015592 



RC^R54554 

RC_AA192099 

RC^AA398505 

RC.AA399636 

RC_AA406063 

RC_AA489671 

RC_AA236466 

RC.W69106 

RC_AA242809 



H24334 

AJ236885 

AA398505 

AA399636 

AA406063 

AA352335 

AB030034 

AF086270 

H23329 



WARNING ENTRY [H-sapiens] 



125279 RC_W93640 

108778 RC AA128548 

108087 RCJ\A045709 

132466 RC.N66810.S 

133328 R38553 

124057 RC.F13604 

124800 RC.R45115 



AW401809 
AF133123 
AA045708 
AI597655 
AW452738 
AA902384 
AW864086 



gb:ze97d11.s1 SoaiesJetaLhearLNbHHISW Homo sapiens cDI^ done ll\AAGE:366933 3* 
mRNA sequence. 

Human GABAa receptor al|^-3 subunit 

Hs.41135 endomucin-2 

Hs.20887 hypothetical protein FLJ10392 

Hs.16492 DKFZP564G2022 protein 

Hs.51616 ESTs, WeaWy similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

Hs.57735 acetyl LDL receptor; SREC 

Hs.306654 Homo sapiens cDNA FLJ13574 fis, done PLACE1008625 

Hs.48802 Homo sapiens done 23632 mRNA sequence 

Hs.1 18722 fucosyttransfeiase 8 (alpha (1 .6) fucosyltransferase) 

Hs.10450 Homo sapiens cONA: FU22063 fis. done HEP10326 

Hs.98038 ESTs 

Hs.278619 hypothetical protein FU10099 

Hs.285115 interleukin 13 receptor, alpha 1 

Hs.71 1 68 HonfK) sapiens done 24674 mRNA sequence 

Hs.69504 ESTs 

Hs.89548 erythropoietin receptor 

Hs.58553 CTP synthase II 

Hs.180145 HSPC030 protein 

Hs7859 ESTs 

Hs.330515 ESTs 

Hs.281706 sortnin 1 

Hs. 1 28087 coagulation factor II (thrombin) receptor 

Hs.190086 ESTs 

Hs.50984 sarcoma amplified sequence 

Hs.72071 hypothetical protein FU20038 

Hs.79350 RYK receptor-like tyrosine kinase 

Hs.28935 ttansdudn-like enhancer of split 1 , homolog of Drosophila E(sp1 ) 

Hs.54481 tow density Gpoprotein receptor-related protein 8. apolipoprotein e receptor 

Hs.150402 Adivin A receptor, ^e I (ACVR1) (ALK-2) 

Hs.188751 ESTs. Weakly sim^ar to MAPB.HUMAN MICROTUBULE-ASSOCIATED PROTEIN IB 

Hs.291557 ESTs 

Hs.54900 serologically defined ooton cancer antigen 1 

Hs.6592 Homo sapiens, done IMAGE:2961368, mRNA, partial ods 

Hs.28853 CDC7 (cell division cyde 7. S. cerevisiae. homolog)-ia(e 1 

Hs.26125 ESTs 

Hs.112180 ^c finger protein 148 {pHZ-52) 

Hs.97350 ESTs 

Hs.143629 ESTs 

Hs.98003 ESTs 

Hs.65641 hypothetical protein FLJ20073 

Hs. 115175 steriie-xilpha motif and ieudne zipper confeining kinase AZK 

Hs.278554 heterochromatin-fike protein 1 

Hs^90880 ESTs. WeaWy simOarto ALUI^HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.4779 K1AA1150 protein 

Hs.90847 general transcription fector IIIC, polypeptide 3 (102kD) 

Hs.40545 ESTs 

Hs.49265 ESTs 

HS265327 hypothetk^al protein 01^7611141 

Hs.73853 bone morphogenetic protein 2 

Hs.138617 thyrokS homione receptor inteiactor 12 
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121029 RCJ\A398482 AA398482 

120663 RC AA287627 AA827798 

102133 U15173 AU076845 

108246 RC AA062855 AW23132 

125226 RC^W78134 AA782536 

120260 RCJ^171739 AK000061 

124906 RC^R87647 H75984 

109406 RC_AA226877 AA199883 

109271 RCJVA195668 AW137422 

125052 RC_T80174„s T85104 

109101 RC AA167708 AW60893D 

115241 RC_AA278723 AA648278 

117163 RC^H97909 N36861 

113530 RC.T90313 T90313 

120375 RCJ^7260 AF028706 

129435 AA314256 AF151852 

114864 RC AA235256 AA135332 

103988 AA314389 AA314389 

131006 RCJ^42763 AF054104 

106781 RC_AA478474 AA330310 

106141 RC_AA424558 AF031463 

116213 RC_AA476738 AA292105 

135266 AB002326 R41179 

135058 RCJ^430152 AI379720 

119908 RC^W85844 AA524470 

103695 AA018758 AW207152 

103978 AA307443 NM_016940 
109485 RC_AA233472 BE619092 

129574 AA458603 AA026815 
115347 RC_AA281528 AA356792 
120765 RC_AA338735 AW961026 
WARNING ENTRY [H.saplensl 
121059 RC_AA398628 AA393283 
131887 AA046548 W17064 
member 1 

112064 RC^R43812 AL049390 
115606 RC_AA400465 AI025829 
131750 RC H94855_s NN/L004349 
102123 U14518 NM_001809 
129847 RC.W46767 N64025 
133809 RC.AA235275 AV649326 
132210 RC.N51499.S NM.007203 
122356 RCJ\A443794 AA443794 
114958 RCJ\A243708 N20912 
103951 AA287840 AL353944 
134703 RC.AA280704 AF117065 
128727 AA287864 AI223335 
105743 RC_AA293300_s BE246502 
domain, (semaphorin) 48 



103744 AA076003 
sequence 
114348 N80402 
114009 RC_W90067 
134704 RC_AA280849 
128629 AA399187 
104410 H65925 
110200 RC.H21075 
124483 RC.N53976 
101391 M14648 
109657 RC_R)4826 
117140 RC^H96813 
132937 RCJ^A233706. 
129799 R36410 
105077 RC.AA142919 
100850 RC.N58561.S 
131043 RC.AA4g0925 
118417 RC_N6e048_f 
129254 RCJ\A243695 
119149 RC.R58910 
133998 AA091367 
110223 RC_H23747 
117626 RC^N36090 
135286 RC_AA424469, 
122967 RCJ\A478521 
131236 AA282640 
128568 AA463380 



Hs.97641 

Hs. 105089 

Hs.155596 

Hs.146343 

Hs.122647 

Hs.101590 

Hs.107815 

Hs.67624 

Hs.86022 

Hs,222779 

Hs,52184 

H5.193859 

Hs.42344 

Hs.16732 

Hs.1 11227 

Hs.1 11449 

Hs.71608 

Hs.42500 

Hs^116 

Hs.24181 

Hs.9302 

Hs.326740 

Hs.97393 

Hs.93814 

Hs.58753 

Hs.186600 

Hs.34136 

Hs.28465 

Hs.1 1463 

Hs.334824 

Hs.96752 



Hs.332848 

Hs.22689 

Hs.86320 

Hs.31551 

Hs.1594 

Hs.296178 

HS76359 

Hs.42322 

Hs.98390 

Hs.42369 

Hs.50115 

Hs.88764 

Hs.50651 

Hs.9598 



AA079267 

AL050321 

At248544 

AA837124 

AL095748 

AI807519 

H21075 

AI821780 

NWL002210 

R60900 

H96813 

f AW952912 
AW967473 
W55946 
AA836472 
AF084535 
AF080229 
AA252468 
BE304701 
AA380267 
H19836 
AK001757 

.8 AW023482 
AA806187 
AF043117 
H12912 



EST 
ESTs 

BCL2/adenovin)s E1B 19kD4nterading protein 2 
ESTs 

N-mynstoyltransferase 2 

hypothetical protein 

ESTs 

ESTs 

ESTs 

ESTs, Moderately similar to similar to NEDEM [H.saplens] 

hypothetical protein FLJ20616 

ESTs 

ESTs 

ESTs 

Zic family menter 3 (odd-paired Diosophila homolog. heterotaxy 1) 

CGI-94 protein 

ESTs 

ADP-ribosylatlon factor-like 5 

CDC14 (ceil division cyde 14, S. oerevlsiae) homolog B 

ESTs 

phosdudn-IIke 

hypothetical protein MGC10947 

KIAA0328 protein 

hypothetical proteon 

ESTs 

ESTs 

chromosome 21 open reading frame 6 

Homo sapiens cDNA: FU21869 fis. done HEP02442 

UIVIP-CI\AP kinase 

hypothetical protein FU14825 

ESTs. WeaWy simflar to ALU8^HUMAN ALU SUBFAIVIILY SX SEQUENCE CONTAMINATION 

gb:zt74e03.r1 Soares Jestis.Nlfr Homo sapiens cDNA done 5\ mRNA sequence 
SWI/SNF related, matrix assodated. actin dependent regulator of chromatin, subfeimily e. 

Homo sapiens mRNA; cDNA DKFZp58601318 (from done DKFZp58601318) 

ESTs 

core-binding factor, lunt domain, alpha subunit 2; translocated to, 1; cydin D-retatsd 
centromere protein A (17kD) 
hypothetical protein FU22637 



Hs.301532 

Hs.103000 

Hs.88780 

Hs.102708 

Hs.104520 

Hs.31802 

Hs.179864 

Hs.295726 

Hs.26814 

Hs.42241 

Hs,300383 

HS239114 

Hs^34863 

Hs^7939 

Hs.22464 

Hs.1098 

Hs.65732 

Hs.78277 

Hs.31697 

Hs.281348 

Hs.97849 

Hs^89101 

Hs^4594 

Hs.274e91 



A kinase (PRKA) anchor protein 2 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp761J1112 (from done DKFZp761J1112) 
mate-specific lethaI-3 (Drosophna)-nke 1 
Janus kinase 1 (a protein tyrosine kinase) 

sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic 

gb:zm97e10^1 Stratagene colon HT29 (937221) Homo saptens cDNA done 3*. mRNA 

CRP2 binding protein 
KIAA0831 protein 
ESTs 

DKF2P434A043 protein 

Homo sapiens cDNA FU13694 fis. done PU\CE2000115 

ESTs. H^hly simiter to A59266 unconventional myosin-15 [H.sapiens} 

ESTs 

integrin, alpha V (vitronectin receptor, alpha polypeptkle, antigen 0051) 

ESTs 

ESTs 

hypothetical protein MGC^032 

roannosidase, alpha, class 1 A, member 2 

Homo sapiens cDNA FU12082 fts. done HEMBB1002492 

cathepsin B 

epitepsy, progressive myodonus type 2, Lafora disease (teforin) 

gb:Kuman endogenous retrovirus K done 10.1 polymerase mRNA, partial ods 

DKFZp434J1813 protein 

ESTs 

DKFZP434F2021 protein 
ESTs 

hypottietical protein FU10895 
ESTs 

glucose regulated protein, 58kD 

ubiquitinatten factor E4B (homologous to yeast UFD2) 

adenytete kinase 3 
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112888 RCJ03872 AW195317 Hs.107716 

115192 RCj^A261920 M741024 Hs.88378 

118688 RC_N71484 AK000708 Hs.169764 

122264 RC_AA436837 AA436837 

128981 AA135452 AA927177 Hs.86041 

131042 RC^R42457 A1826288 Hs.171637 

103704 AA028171 AA028171 Hs.151258 

121341 AA233107 AF035528 Hs.153863 

106593 RC.AA456826 AW296451 Hs.24605 

115195 RC_AA262156 AW968619 Hs.155849 

115425 RCJA284071 AA611895 Hs.180680 

117258 RC_N21299 AF086041 Hs.42975 

120209 RC_Z40892 F02951 
sequence 

134082 L16991 L16991 

RC.AA026066 AW959755 

RCJWV401630 AA059459 

N28707 N28707 

W20054 N91453 

RC_R21510 R22039 

RC^AA280026 AA280095 

RC^AA282292 AA282292 

RC_H97819 N41549 

M95767 AI269096 

AA004987 AL1 37275 

144574 NM.007331 

RC_T26981_s NM_002267 

RC_R36083 R36095 

RC_AA252163 AA463550 

RC^AA252219 AL137939 

H81492 BE246400 

R24011 R24024 
PROTEIN 91 

116167 RCJ\A461562 AI091731 

AA236771 R85350 

RC.H48459_s AA3559» 

RC_N80181 N80181 

RC_W93092 BE149656 

D21852 NM_015361 

U41815 AF071076 

AA0B1B76 AA075000 
sequence 

132019 RC_AA134965J H56995 

132310 RC_AA284107 AA173223 

117367 RC_N24954 AI041793 

103743 AA075998 AA075998 

gb:M15887 ACYL-COA-BINDING PROTEIN (HUMAN);. mRNA sequence 



104774 
115625 
104469 
107401 
111686 
115300 
115378 
132224 
113791 
129144 
104448 
132084 
111831 
114765 
115029 
100457 
104536 



103889 
131978 
118843 
120837 
133647 
129521 
103746 



Hs.79006 

Hs.288896 

Hs.62592 

Hs.154304 

Hs.102987 

Hs^3217 

Hs.88689 

Hs.279841 

Hs.285410 

Hs.135578 

Hs.20137 

Hs.1 10457 

Hs,3B86 

Hs.26d695 

Hs.337532 

Hs.40096 

Hs^85176 

Hs.158101 

Hs.87293 

Hs.101388 

Hs.36232 

Hs.221498 

Hs.306621 

Hs268053 

Hs.112255 



Hs.37372 

Hs.289044 

Hs.42502 



hypothetical protein FLJ22344 
ESTs 

hypothetical protein FU20701 

gb:zv57g07^1 Soares_testis__NHT Homo sapiens cDNA done 3*. mRNA sequence 
CGG tnplet repeat binding protein 1 
hypothetical protein MGC2628 
hypothetical protein FLJ21062 

MAD (mothers against decapentaplegic. Drosophila) homoiog 6 
ESTs 
ESTs 

ESTs, Weakly similar to 154374 gene NF2 protein IK.sapiens] 
ESTs 

gb:HSC1HB082 nomialized infant brain cDNA Homo sapiens cDNA done c-1hb08 3'. mRNA 

deoxythymidylate Idnase (thymidylate kinase) 
Homo sapiens bDNA FU 12977 fts, done NT2RP2006261 
ESTs 

Homo sapiens chromosome 19. BAC 262485 (CIT-&-344H19) 
ESTs 
ESTs 
ESTs 

hypothetical protein FU10335 
ESTs 

chitobiase, d»-N-acetyl- 
hypothetical protein DKFZp434P0116 
Wolf^iirschhom syndrome candidate 1 
loryopherin alpha 3 (importin alpha 4) 
ESTs 

ESTs, Weakly similar to A47682 B-ceil growth fedor precursor [H^apiens] 
ESTs 

acetyl-Coenzyme A transporter 

Homo sapiens cDNA FU14673 fis, done NT2RP2003714. moderately similar to ZINC FINGER 

hypothetical protein FLJ20045 
ESTs 

KIAA0186 gene produd 
ESTs 

Homo sapiens cDNA FU1 1983 tis, done HEMBB1001051 
KIAA0029 protein 
nucieoporin98kD 

gb:zm83o07.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done 3', mRNA 

Homo sapiens DNA binding peptide mRNA. partial cds 
Homo sapiens cDNA FLJ12048 fis, done HEMBB1001990 
ESTs 

gb:zm89b09.r1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done 5* similar to 



103761 AA085138 AA765163 
BINDING PROTEIN (HUMAN):. mRNA sequence 



L39060 
RC_N72879 
AA045930 
AA096412 
RC„AA599244 
AA293634 
H06583 
U79298 
23803 mRNA 
118612 RC_N69466 
322026 AA203138 
110892 RC.N38882 
RC^R01245 
RC_T76962 
AA417310 
RCJ^A226879 



AA913909 
AA504428 
A1187925 
BE219898 
AL044675 
W73933 
NM_001310 
AW294659 

AB037788 
AW024973 
AL035301 
AI038052 
AW974666 
6E465093 
AA226879 



Hs.153088 

Hs.10487 

Hs.95667 

Hs.173135 

Hs.173081 

Hs.283738 

Hs.13313 

Hs.34054 

Hs^4961 
Hs^83675 
Hs.97375 
Hs.19162 
Hs^93024 
Hs.106101 



gb:n279b1 0.s1 NCLCGAP.GCB 1 Homo sapiens cDNA done 3' similar to gb:M34539 FK506- 

TATA box binding protein (TBP)-associated fedor. RNA polymerase i, A. 48kD 
Homo sapiens, done iMAGE:3954132, mRNA. partial cds 
F-box protein 30 

dual-spedfidty tyrosine-(Y)-phosphorytafion regulated kinase 2 

K1AA0530 protein 

casein kinase 1, alpha 1 

cAMP responsive element binding protein4i)ce 2 

Homo sapiens cDNA: FU22488 tis. done HRC10948, highly sImRar to HSU79298 Human done 

cleavage and polyadenylation spedTtc factor 2. ICiOkD subunit 
NPO009 protein 

H^piens gene from PAC 108H8 
ESTs, Weakly similar to 154374 gene NF2 protein {H.sapiens] 
ESTs 

hypothetical protein FU22557 

gb:zr19c09^1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA done 



130237 
128752 
135162 
131386 
129021 
424274 
129913 
131888 



111429 

113334 
104091 
105246 

IMAGE:663856 3' similar to contains Alu repetitive element;, mRNA sequence. 

113300 RC_T67448 T67448 Hs.13101 ESTs 

117147 RC_H97225_s AW901347 HS.3B592 hypothetical protein FU23342 

121349 RC^AA405205 AA405205 Hs.97960 ESTs. Weakly similar to T51 146 ring-box protein 1 [H.sapiens] 

100294 D49396 AA331881 Hs.754M peroxiredoxin 3 

133999 M28213 AA535244 Hs78305 RAB2. member RAS oncogene family 

133259 AA278548 BE379646 Hs.6904 Homo sapiens mRNA full length insert cDNA done EUROIMAGE 2004403 

129423 AA371418 AA204686 Hs.234149 hypottietical protein FU20647 

131098 RCJVA459668 U66669 Hs.236642 3-hydroxyisobutyiyl-Coenzyme A hydrolase 

135272 AA399391 AI828337 Hs.97591 ESTs 

129155 AA046865 AI952677 Hs.108972 Homo sapiens ntRNA; cDNA OKFZp434P228 (from done DI^Zp434P228) 
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311291 AA056319 AA782601 

120750 RCJ^310499 AI191410 

101002 J04058 AV655843 

133012 AA099241 AA847843 

103879 AA228148_S BE543269 

131281 RC_AA443212 AA251716 

115109 RCJVA256383 AJ249977 

118502 RC^N67317 AL157488 

134100 L07540 AA460085 

131869 AA484944 AW968547 

115396 RC_AA282985 AA810854 

103660 AA203742 AW976877 

135089 N75611.S AI918035 

129938 U79300 AW003668 

107508 W90095 N74925 

103685 AA005190 AA158008 

125170 AA203147 AL020996 
129179 RC AA504125_s AW969025 

116262 AA477046 AI936442 

123009 RCJVA479949 AA535244 

131004 D29833 029833 

103317 X83441 X83441 

132814 RC_C15251J D60730 

103992 U77718 BE018142 

109258 X59710 AL044818 

110754 RC.N20814 AW302200 

132727 AA136382.S N27495 

100341 D63508 AF032922 

134664 AA2561Q6 AA256106 

103826 AA165564 AW162998 

111678 RC^R20628 R38487 

101341 L76159 NML004477 

115455 RC_AA285068 AA876002 

111192 RC_AA477748 AW021968 

129385 RCJ\A235604 AA172106 

125050 RC_T79951 AW970209 

122105 RC.AA432278 AW241685 

121324 RCJ^404229 AA404229 

120938 RCuAA386260 AA386260 

115001 RCJ^51376 AA251376 
sequence. 

124799 RC^R45088 R45088 
sequence. 

122724 RC^AA457395 AA457395 
117791 RC»N48325 N48325 
121895 RC_AA427396 AA427396 



Hs.319817 ESTs 

Hs.96693 ESTs. Moderately similar to 210926QA B ceQ growth factor [H^piens] 

Hs.169919 electron-transfer-flavoprotein. alpha polypeptide (glutaric aciduria II) 

Hs.62711 Homo sapiens, done IMAG£:335129S, mRNA 

Hs.50252 mitochondrial ribosomal protein L32 

Hs.25227 ESTs 

Hs.88049 protein kinase, AMP-acCivated. gamma 3 non-catalyfic subunit 

KS.S0150 Homo sapiens mRNA; cDNA DKFZp564B162 (from done OKFZp584B182) 

Hs.171075 replication factor C (activator 1) 5 (3B^kD) 

Hs.33540 ESTs, Wealdy simitar to dJ309K20.4 [Haptens] 

Hs.89081 ESTs 

Hs.38057 ESTs 

Hs.301 198 roundat)out (axon guidance receptor. OrosophOa) homolog 1 

Hs.135587 Human done 23629 mRNA sequence 

Hs.38761 Homo sapiens cOI^IA: FU21564 fis. done C01J)6452 

Hs.292444 ESTs 

Hs.8518 selenoprotein N 

Hs.109154 ESTs 

Hs.59838 hypothetical protein FU10808 

Hs78305 RAB2, meml)er RAS oncogene family 

Hs.2207 salivary proOne-rich protein 

Hs.166091 li9aselV.DNA.ATP-dependent 

Hs.57471 ESTs 

Hs.300954 Huntingtin interacting protein K 

Hs.84928 nudear transcription fador Y, beta 

Hs.6336 KIAA0672 gene produd 

Hs.5565 hypothetical protein FU22626 

Hs.881 3 syntaxin binding protein 3 

Hs.87507 ESTs 

Hs.24684 KIAA1376 protein 

Hs.169927 ESTs 

Hs.203772 FSHD region gene 1 

Hs.120551 toU-fOce receptor 10 

Hs.109438 Homo sapiens done 24775 mRNA sequence 

Hs.1 10950 Rag C protein 

Hs.111805 ESTs 

H5.96699 ESTs 

Hs.97842 EST 

Hs.104632 EST 

gb2s10a06.s1 NCLCGAP^GCBI Homo sapiens cONA done IMAGE:684754 3*. mRNA 

gb:yg38g04^1 Soares infant brain 1NIB Homo sapiens cDNA done IMAGE:34896 3', mRNA 

Hs.99457 ESTs 
H5.93956 EST 

gb:zw33a02.s1 Soares ovary tumor NbHOT Homo sapiens cDI^ done IMAGE:771050 3* 



similar to contains Alu repetitive elementcontains i^R12.t2 MER12 repetitive element ;, mRNA sequence. 



108244 RCJ^062839 
3*. mRNA sequence. 
117852 RC_N49408 
109298 RC^AA205432 
122432 RC.AA447400 
124627 RC_N74625 



AA062839 gb:2m05c09.s1 Stratagene corneal stroma (937222) Homo sapiens cDNA done IMAGE:513232 

AW877787 Hs.136102 KiAA0853 protein 

R77854 Hs.250693 Kmeppekelated zinc finger protein 

AA447400 Hs.187684 ESTs. Weakly similar to B34087 hypothetical protein (H.sapiensl 

N74625 gb:2a55c03.s1 Soares fetal Over spleen 1NFLS Homo sapiens cDNA done IMAGE298452 3* 



simflar to gb:l\/i14338 VITAMIN K-DEPENDENT PROTEIN S PRECURSOR {HUMAN);contains 0FR.b3 OFR repetitive element ;, mRNA sequence. 



115141 RC_AA258071 AA465131 Hs.64001 
128636 U49065 U49065 Hs.102665 

115373 RCJ^2197 AA664862 Hs.181022 
114651 RCJ^A101400 AA101400 Hs.189960 
132796 RCJ^180487 NM_008283 Hs.173159 
103749 RC.N35583 AL135301 Hs.6768 
107328 T83444 AW959891 Hs.76591 

115349 RC_AA281563 AF121176 Hs.12797 
111490 RC.R06862 R06862 
similar to contains LI repetitive element ;, mRNA sequence, 
103763 AA085354 AA085291 
contains Alu repetitive element;, mRNA sequence 



Homo sapiens done 25218 mRI^ sequence 
mterleukin 1 reoeptor-Glce 2 
CGI-07 protein 
ESTs 

transfomning, addic odied-ooil containing protein 1 
hypothetical protein FLJ10849 
KIAA0887 protein 

DEAD/H (Asp^hKAIa^p/Hls) box polypeptide 16 

gb7tt1e095l Soares fetal liver spleen 1NFLS Homo sapiens cONAdone IMAGE:126568 3* 

3. 

gb:2n01g06.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA done 3' similar to 



118791 RC_N75520 N75520 

116644 RC.F03032 F03032 

116823 RC_H56485 AW204742 
[H sapiens] 

108940 RC_AA148603 AA148603 
IMAGE:567198 3\ mRNA sequence. 

112218 RC_R50057 R50057 

116557 RC_D20572_i D20572 

133649 U25849 U25849 

131745 RC.C20746 AI8285S9 



Hs.261003 ESTs. IVloderately simflar to B34087 hypothetical protein iH.sapiens] 

Hs.290278 ESTs. Wealdy simQar to B34087 hypothetical protein [H.sapiens] 

HS.143M2 ESTs, Highly similar to CSA.HUMAN COCKAYNE SYNDROME Wt>«EPEAT PROTEIN CSA 

gb2o09^.s1 Stratagene neuroepithelium NT2RAIV11 937234 Homo sapiens cONA done 

Hs.272251 Homo sapiens mRNA; cDNA DKFZp586M1418 (from done DKFZp586M1418) 

Hs.90171 EST 

Hs.75393 add phosphatase 1 . sotuble 

Hs.31447 ESTs, t^eiately simQar to A48010 X-linked refinopalhy protein [H^tens] 
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AA251548 

H29882 

AA347919 

AA018298 

AF025771 

N67946 

AB020700 



BE276055 

R66740 

NM^003478 

R40697 

R73588 

AF255910 

AW137912 



116801 RC_H43879 H43879 
sequence. 

115006 RC^AA251548 
123424 RC_AA598500 
5 120831 RCJVA347919 

103691 AA018298 
121555 RCJiA412491 
111193 RC_N67946 
132061 RCJ\A058946 

10 134575 RCJ\A194568J AA194568 

115050 RC_AA252794 AA252794 

420208 U31799 

133735 AC002045j(pt1 

128546 Z21305 
15 111946 RC^R40697 

124879 RC_R73588 

115683 AA410345 

103692 AA018418 

(CACNA1F) gene, complete cds; HSP27 

20 103767 AA089688 BE244667 

125266 W90022 W90022 
PRECURSOR [H .sapiens] 

135235 AA435512 AW298244 

134497 RC_AA404494 BE258532 

25 426754 RC_AA278529J NM_014264 

412177 RC_AA342828_s Z23091 

132000 RC_AA044644 AW247017 

124738 RC^AA044644 T07568 

324000 RC.AA196729J AA604749 

30 106896 RC_AA196729J AW073202 

132000 RC_AA025858 AW247017 

129577 RC_AA025858 N75346 

107091 RC_AA233519 AI949109 

130296 RC_N52271 D31139 

35 102855 RC_N68399 NM_003528 

113689 RC^AA098874 AB037850 

100939 RC^AA279667 s L04288 

130430 RC_H22556 W27893 

106734 RC^N45979_s BE296690 

40 intersectin 2 long Isofomi (ITSN2) mRNA 

135148 RC_AA431288_s AA306478 

134221 RC_AA609862 BE280456 

105376 RC,N35583 AW994032 

124541 U77718 API 12222 

45 134546 AA203147 AL020996 

134000 RC_W93092 AW175787 

125656 RC_W93092 AW516428 

100939 RC_N58561.s L04288 

125656 RC_W93092 AW516428 

50 101779 RC.W69385.S BE543412 

332489 RC.R22947 R23053 

133000 RC«N38959J AL042444 

125905 RC.N38959J AI678638 

129000 RC_H73050.s AA744902 

55 100920 RC_H73050_s X54534 



gb:yo69h09^1 Soaies breast 3NbHB8t Homo sapiens cDNA done IMAGE:183233 3*. mRNA 

Hs.87886 EST 

Hs.162614 ESTs 

Hs.96889 EST 

Hs.103332 ESTs 

Hs.50123 zinc finger protein 189 

Hs.1 17569 ESTs 

Hs.3d30 KtAA0893 protein 

Hs.85938 EST 

Hs.88009 ESTs 

Hs.95972 silver (mouse homotog) lilce 

Hs.1 10613 KIAAQ220 protein 

IHs.101299 cullinS 

Hs.76666 C9orf10prolBin 

Hs.101533 ESTs 

Hs.54650 junctional adiieslon molecule 2 

Hs.227583 Homo sapiens chromosome X map Xp1153 L4ype caldum channel alpha-1 subunit 
pseudogene, complete sequence; and JM1 protein, JM2 protein, and Hb2E genes, complete cds 

Hs^96155 CGMOO protein 

Hs.186809 ESTs. Highly similar to LCT2_HUMAN liUKOCYTE CELL-DERIVED CHEIWOTAXIN 2 

Hs.293507 ESTs 

Hs.251871 CTP synthase 

Hs.172052 serine/threonine Idnase 18 

Hs.73734 glycoprotein V (platelet) 

Hs.36978 melanoma antigen, family A, 3 

Hs.1 37158 ESTs 

Hs.190213 ESTs 

Hs.334825 Homo sapiens cDNA FU14752 fis, clone NT2RP3003071 

Hs.35978 melanoma antigen, family A, 3 

Hs.82906 CDC20 (cell division cyde 20, S. cerevisiae. homolog) 

Hs.246885 hypothetical protein FU20783 

Hs.154103 LIM protein (similar to rat protein kinase C-binding enigma) 

Hs.2178 H2B histone family, member Q 

Hs.16621 DKFZP434I116 protein 

Hs.297939 cathepsin B 

Hs.150580 putative translation initiation factor 

Hs.288173 Homo sapiens cDNA: FLJ21747 fis. done COLF5160. highly similar to AF182198 Homo sapiens 

Hs.95327 CD3D antigen, delta polypeptide (TiT3 complex) 

Hs.80248 RNA-bindlng protein gene with multiple splicing 

Hs.8768 hypothetical protein FU10849 

Hs.44499 pinin, desmosome associated protein 

Hs.8518 selenoprotein N 

Hs.334841 selenium binding protein 1 

Hs.78687 neutral sphingomyelinase (N-SMase) activation associated factor 

Hs.29793g cathepsin B 

Hs.78687 neutral sphingomyelinase (N-SMase) activation associated factor 

Hs^50505 reSnoic add receptor, alpha 

NA HuOIChipRedos 

Hs.62402 p21/Cdc42/Rac1-actfvated Icinase 1 (yeast Ste20-related) 

H5.6456 chaperonin containing TCP1, subunit 2 (beta) 

Hs.107767 hypothetical protein PR01489 

Hs.278994 Rhesus btood group. CcEe antigens 
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TABLE 1 A 

Table 1A shows the accession numbers for those pkeys tacking untgenelD*s for Tables 1 The pkeys in Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have Ftsted the gene duster number from which the oligonudeotides were designed. Gene dusters were compiled 
S using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 

Alignment Tools (DoubteTwist. Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 'Accessbn* 
column. 



1 0 Pkey: Unique Eos probeset identifier number 

CAT number Geiie dusternumber 
Aocesston: Genbank accession numbers 



15 Pkey CAT Number Accession 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



108469 116761J AA079487 AA128547 AA128291 AA079587 AA079600 
124106 125446 1 H 12245 AA094769 R14576 
108501 13684.-12 AA083255 

108562 36375J AA1 00796 AF020589AA074629AA075946AA100849AA085347AA126309AA079311 AA079323 AA085274 

125008 1802095 1 T91251 T64891 T85665 

125020 116017 J T69981T69924AA078476 

125066 1814993.1 T86284 T81933 

116661 1532859.1 R61504 F04247 

125104 413347.1 T95590 AA703278 H62764 

124575 1666649.1 N68168 N69188 N90450 

125263 1547 2 AA098878 W88942 

116845 393481.1 AA649530 AA659316 H64973 

1 18417 37186_1 AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 AI680833 

At633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587 AA463992 AW208B02 AI970376 AI583718 AI672574 
N25695 AW665466 AI816326 AA126128 AI480345 AW013827 AA248638 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW4g6808 A1080480 AI631703 At651023 A18G7418 AW818140 AA502500 At208199 AI871282 A1352545 BE501030 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA7033g6 H92278 AW139734 H92683 U87589 U87595 H69001 U875g4 BE466420 A1624817 BE46661 1 AI206344 AA574397 
AA3483S4 AI493192 

118584 532052.1 AW1 36928 AI685655BE218584BE465D78N68963AA975338BE1471 99 N76377 

103743 112194 1 AA075998 AA0759S9 AA070986 AA070896 AA129207 AA078942 AA070783 AA078941 

103744 114161.1 AA079267AA076003 
103746 113452.1 AA075000 AA081876 

103761 1 14208.1 AA765163 AW298222 AA126126 AA085138 AA076088 

103763 48290.6 AA085291 AA085354 

120209 1531817.1 F02951 Z40892 F0471 1 

120284 158963.1 AA1 79656 AA1 82626 AA1 82603 

1 12540 1605263.1 R69751 R70467 H69771 H80879 H80878 

111904 1719336.1 Z41572R39330 

121059 273450.1 AA393283 AA398628 

121094 275729.1 AA402505 AA398900 

114106 1182096 1 AW602528 BE073859Z38412 

130091 23951.-3 W88999 

122264 295527.1 AA436837 AA442594 

108280 110682.1 AA065069 AA085108 

129961 1706092.1 R23053 R79884 R76271 

130529 158447.1 AA1 78953 AA1 92740 

AA069818 AA069971 AA069923 AA089908 
AA021473 
AA609839 
D45652 
AA128980 
T68875 
T85352 
T85373 



108309 111495.1 
107832 genbanK_AA021473 
123731 genbanK.AA609839 
116571 genbanleD45652 
132225 genbank.AA128980 
125017 genbank_T68875 

125063 genbanK_T85352 

125064 genbank.T85373 
100964 entfe^J00212J00212 

125118 149288J R10606 T97620 AA576309 
102269 entrez^U30245U30245 
125160 NOT_FOUND.entTe?_W38240 W38240 
116801 genbanK.H43879 H43879 
118111 genbank.N55493 N55493 
118129 genbanK.N57493 N57493 
118329 genbank N63520 N63520 
118475 genbanK_N66845 N66845 
111490 genbanK.R06862 R06862 
111514 genbanleR07998 R07998 
104534 R22303.at R22303 
120340 genbanKJ^206828 AA206828 
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10 



15 



20 



25 



30 



35 



40 



45 



120376 genbanIeAA227469 
104787 genbanK^AA027317 
120409 genbanK^AA235050 
120745 genbankJ\A302809 
120809 genbanK_AA346495 
120839 genbanleAA348913 
113702 genbanK_T97307 
115001 genbanK_AA251376 
122562 genbanlLAA452156 
122635 genbanlLAA454085 
108244 genbanluAA062839 
108277 genbanlL.AA064859 
122723 genbank^AA467380 
124028 genbank_F04112 
108403 genbanLAA075374 
122860 genbankJiiA464414 
108427 genbankj^076382 
108439 genbanK_AA078986 
131353 231290J 
AA969360 

108533 genbanLAA084415 
117031 genbank^H88353 
124254 genbanleH69899 
101447 entrez_M21305 
101458 entrez^M22092 
124577 genbank_N68300 

108940 genbank AA148603 

108941 genbank.M148650 
124627 genbanK.N74625 
124720 144582_1 
124793 genbanK.R44519 
124799 genbanK.R45088 
117683 genbank_N40180 
117732 genbank_N46452 
124991 genbanK„T50116 

genbanK_N98488 
95573.2 



AA227469 

AA027317 

AA235050 

AA302809 

AA346495 

AA348913 

T97307 

AA251376 

AA452156 

AA454085 

AA082839 

AA064859 

AA457380 

F04112 

AA075374 

AA464414 

AA076382 

AA078986 

AW411259 H23555 AW015049 A1684275 AW015886 AW068953 AW014085 AI027260 R52686 AA918278 AI129462 

N34869 AI948416 AA534205 AA702483 AA705292 

AA084415 

H88353 

M21305 
M22092 



119023 
119239 
119558 
119654 
105246 



N68300 
AA148603 
AA148650 
N74625 

R05283R11056 

R44519 

R45088 

N40180 

N46452 

T50116 

Ng8488 

T11483T11472 



NOT.FOUND_entre2_W38194 W38194 



genbanK_W57759 W57759 

genbanlL.AA226879 AA226879 

121350 genbank_AA405237 AA405237 

121558 genbanK»AA412497 AA412497 

105985 genbank_AA406610 AA406610 
100071 entre^A28102A28102 

114648 genbank^AA101056 AA101056 

121895 genbank^AA427396 AA427396 
100327 entre^D55640D55640 
123315 714071.1 AA496369 AA496646 
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TABLE 2: 



Pkey-. Unique Eos probeset idenlifter number 

5 Accession: Accession nuntber used for previous patent filings 

ExAocn: Exemplar Accession number. Genbanic accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

10 Pkey Accession ExAccn UnigenelD UnigeneTitle 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



100420 
100484 
100991 
101168 
101261 
101447 
101543 
101560 
101714 
101838 
102012 
102164 
102263 
102564 
102759 
102804 
102898 
103036 
103095 
103166 
103280 
103850 
104592 
104786 
104865 
104952 
105178 
105330 
105729 
105977 
106031 
106155 
106423 
107174 
107295 
108756 
108888 
109166 
109768 
110906 
111006 
111133 
113073 
113923 
115061 
115145 
115947 
116339 
116589 
117023 
117563 
118475 
119073 
119174 
119416 
121335 
123160 
123523 
123964 
124315 
124669 
124875 
125103 
125565 



100420 
100484 
100991 
101168 
101261 
101447 
101543 
101560 
101714 
101838 
102012 
102164 
102283 
102564 
102759 
102804 



103036 
103095 
103166 
103280 
103850 
104592 
104786 
104865 
104952 
105178 
105330 
105729 
105977 
106031 
106155 
106423 
107174 
107295 
108756 
108888 
109166 
109768 
110906 
111006 
111133 
113073 
113923 
115061 
115145 
115947 
116339 
116589 
117023 
117563 
118475 
119073 
119174 
119416 
121335 
123160 
123523 
123964 
124315 
124669 
124B75 
125103 
125565 



086983 Hs.1 18893 

NIVL005402 Hs.288757 

J03836 Hs.82085 

NM»005308 Hs.211569 

D30857 Hs.82353 
M21305 

M31166 Hs.2050 

AW958272 Hs.347326 

M68874 Hs.211587 

BE243845 Hs.7551l 

BE259035 Hs.1 18400 

Nf^000107 Hs.77602 

AW161552 Hs.83381 

U59423 Hs.79067 

NIVL005100 Hs788 

NIi^002318 Hs.83354 

Nl^002205 Hs.149609 

M13S09 Hs.83169 

NM.005424 Hs.78824 

AA159248 Hs.180909 



U84722 

AA187101 

AW630488 

AA027167 

T79340 

AW076098 

AA313825 

AW338625 

H46612 

AK001972 

X64116 

AA425414 

AB020722 

BE1 22762 

AA1 86629 

AA127221 

AA135606 

AA219691 

F06838 

AA035211 

BE387014 



HS.76206 

Hs.213194 

Hs.25338 

Hs.10031 

Hs.22575 

Hs.345588 

Hs.21941 

Hs.22120 

Hs.293815 

Hs.30822 

Hs.171844 

Hs.33287 

Hs.16714 

Hs.25338 

Hs.80120 

Hs.1 17037 

Hs.189384 

Hs.73625 

Hs.14763 

Hs.17404 

Hs.166146 



AW580939 Hs.97199 
N39342 Hs.103042 



AW953484 
AI751438 
AA740907 
R47479 
AK000290 
AI557212 
AW070211 
AF055634 
N66845 
BE245360 
R71234 
T97186 
AA404418 
AA4B8687 
AA6a8588 
C13961 

NMJ005402 Hs.288757 
AI571594 Hs.102943 
AI887664 Hs^5814 
AA570056 Hs.122730 
R20840 



Hs.3849 

Hs.41271 

Hs.88297 

Hs.94761 

Hs.44033 

Hs.17132 

Hs.102415 

Hs.44553 

Hs.279477 



H5.284235 



Melanoma associated gene 
v-ral simian leukemia viral oncogene horn 
serine (or cysteine) proteinase lnh3)ito 
G protein-coupled receptor kinase 5 
protein C receptor, endothetlal (EPCR) 
gbiHuman alpha satellite and satellite 3 
pentaxin-related gene, rapkily induced b 
intencelluter adhesbn molecute 2 
phosphoDpase A2. group IVA (cytosollc, 
connective tissue growth factor 
singed (DrosophilaKike (sea urchin fas 
damage-specific DNA binding protein 2 (4 
guanine nucleotide bbiding protein 11 
MAD (mothers against decapentaplegte. Dr 
A kinase (PRKA) anchor protein (gravin) 
lysyloxidase-like2 
Integrin. alpha 5 (fibronectin receptor, 
matrix metalloproteinase 1 (interstitial 
tyrosine kinase with bnmunogtobulin and 
peroxiredoxin 1 

cadherin 5. type 2. VE-cadherin (vascula 
hypothetk:aI protein MGC10895 
protease, serine. 23 
KIAA0955 protein 

B-cell CUJymphoma 6. member B (zinc fi 
desmoplakin (DPI. DPil) 
AD036 protein 
ESTs 

Homo sapiens HSPC285 mRNA, partial cds 
hypothetica! protein FUIIIIO 
Homo sapiens cDr4A: FL122296 fis. ctone H 
nuclear factor l/B 

Rho guanine exchange fedor (GEF) 15 
ESTs 

UDP-N-aoe^V^a-D-gaiactosanUne:polyp 
ESTs 

gb;zl10805.s1 Soaresj)regnanLutenjs_NbH 
RAB6 interacting, kinesin-like (rabkines 
ESTs 
ESTs 

Homer, neuronal immediate early gene, 3 
complement component Clq receptor 
mlcrotubule^ociated protein 16 
hypothetica) protein FU22041 similar to 
Homo sapiens mRNA full length insert cDN 
ESTs 

KIAA1691 protein 

dipeptidyl peptklase 8 

ESTs, Moderately similar to 154374 gene 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f 

unc5 (Celegans homolog) c 

gb:za46c1 1^1 Soaies fetal Bver spleen 

ESTs 

gb7'64c08.s1 Soares placenta Nb2HP Homo 
gb7e50h09.s1 Soares fetal liver spleen 
gb:zw37e02.s1 Soares«totaLfetus.Nb2HF8_ 
ESTs. Weakly similar to 138022 hypotheti 
gb2e54e06.s1 Stiatagene lung carcinoma 
gb:C13961 Clontech human aorta polyA-t-mR 
v-ral simian leukemia viral oncogene horn 
hypothefical protein MGC12916 
sprouty (Drosophlla) homokjg 4 
ESTs. Moderately similar to KIAA1215 pro 
gb:yg05o08n Soares infant brain 1NIB H 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 

SNSDCXrO: <WO_ 



126511 


126511 


TOO') 40 


ns.o/yoo 


126649 


126649 




Uc 97Q<\11 

ns>«cf youi 


449602 449602 


AAnniARn 










He 997QAQ 


iOAQQO 


4 00QQ0 


nu*r 191/ 


ns* lu/ /uo 




IZ9IOO 




He inQ99R 


l£OOf 1 






He iinfln9 




1Z9760 




He 19^ 




4 oooo>i 




He 

ns* i«3 iw 1 




ldOD09 




He 171^9 
no* If 10& 




4 0nD00 


MVVDO It 09 


He 9n^91^ 


4 O 4 AO A 

131080 


131080 


MM nniQR<; 


He 9971 




131182 


AIA9il1^ 


He 9^019 


131573 


131573 




Ue 9AQRQ 

ns.zoyoy 


131756 


131756 


A A^^OQCR 


ns.tSioyo 


131881 


131881 


AtA/QCiniA 
MWJD lUlO 


ns.oooo 


132083 


132083 


DCOOQHgU 


Lio 07OCC^ 


132o5o 


132358 




He ARAO'i 


4 00XCC 


132456 




He ZAQ9^ 


1o2o7d 


132676 




He Oftinm 


132718 


132718 


KIM nn^RHD 


He l\f^ 


132760 


132760 


AA19'5QA«? 
r\r\ 1 ^usOu 


He f\MAf\ 


4 oonco 
1o29oo 


132988 


AF9'W.«;'^9 


He ftlR^A 


133061 


133061 


AI1Rf>A'3i 


ns^yoooo 


133161 


133161 


Awn9im'5 

AWUZ 1 lUO 


He CA^i 

ns.oDoi 


133260 


133260 


A A^n^n/iR 


Ue cone 
nS-byuo 


133491 


133491 


□boiyuoo 


u» 47nAn4 


133550 


133550 




Um 7iieco 


133614 


133614 


KIM finvm'} 
NiVLvUoUUo 




133691 


133691 




Ub 04'tC71 


133913 


133913 


Al in7fiQA^ 


Ue 77^1 


133985 


133985 




riS.fo14b 


134088 


134088 


A 177001;^ 


Ue 70A91« 


134299 


134299 


A\A/CQnO'}Q 

Awoouyoy 


ns.y/ lyy 


118470 


116470 


A 10704 ^4 




134989 


134989 


A\A/QRQni?0 

Mwyoouoo 


Uo 00004 


135073 


135073 




nS.y4UoU 


100114 


100114 




Ue fiOQAO 


100143 


100143 


Al tfY7CiACiK. 

AUU/040O 


Uo 07Q/l^4 

nS.z/o441 


100208 


100208 




Uo 7fiOOyl 


100405 


100405 


AIA/004 CO*7 

AWiJyl oo7 


Hs.o2733 


100455 


100455 


AYftrQQDQX 4 

AWoooy4l 


nS./o/oy 


100618 


100618 


AI7C04CO 


Uo 4 4>ICQQ 

HS. 114599 


100658 


100658 


UOD r 41:0 


Uo 'ffln>i4>i 
nS.loU414 


100718 


100718 




Uo TCA'iA 

nS./o4z4 


100828 


100828 


Al A4ft7Ca 




100991 


100991 


JUJOOO 


Uo oonoc 


101110 


101110 


MI4oyU11 


Uo Qcooe 


101156 


101156 


AA'^nQA7 


Ue 7RAQQ 

ns./ODyo 


101184 


101184 


MM nniR7^ 


U* A(U\ 


101317 


101317 


1 A917A 




101345 


101345 


KIM nniP7iu; 


Uo 4eo<i7e 


101475 


101475 


Del IUhUO 


Uo 70000 


101496 


101496 


Y197A^ 

AlZ/o4 


nS.1 19129 


101543 


101543 


(L /I04 4CC 


Hs^050 


101560 


101560 


A\A/QC0070 


ris.3473Zo 


101592 


101592 


Acnc/Qco 

ArUD4oOJ 


nS.9i299 


101634 


101634 


Av/ecnoco 


Hs.75765 


101682 


101682 


ArUHoU40 


Uo D4nnQ 


101720 


101720 


IViOaUHO 


Uo 04 000 


101744 


101744 


MiO t yooz 


Uo 4 4QCOC 


101837 


101837 






101840 


101840 


A AOqROQ4 


Uo lOOKOO 


101864 


101864 


OC09ZO00 


Uo 7R777 

ns./o/ / / 




luiyoD 




nS.f DU90 


102013 


102013 




Ue 170^i;9 

nS.1r040z 


102059 


102059 


Ml/ 0^000 


He 7ARAQ 


102283 


102283 


AW161559 


He M^M 


102378 


102378 


AU076887 


Hs.28491 


102460 


102460 


U48959 


H5.211582 


102499 


102499 


BE243877 


Hs.76941 


102560 


102560 


R97457 


Hs.63984 


102589 


102589 


AU076728 


Hs.8867 


102645 


102645 


AL1 19566 


Hs.6721 


102693 


102693 


AA532780 


Hs.183684 


102759 


102759 


NM_005100 


Hs.788 



EGF-TM7-iatrophifin<related protein 

ESTs 

ESTs 

SEC13(S.cerevisiaeHike1 
ESTs 

vascular cell adhesion molecule 1 
von Willebrand factor 
amelogenin (Y chromosome) 
lysosomal 

ESTs. Moderately similar to 154374 gene 
ESTs 

endo0ielin 1 
ESTs 
ESTs 
ESTs 

upstream regulatoiy element binding piDt 

Pirin 

H4 histone family, member G 
KIAA0512 gene product; AI-EX2 
ESTs, Weakly similar to 138022 hypotheti 
Sjogren syndrome antigen A2 (60kD, ribon 
tliymosin, beta, identified in neuroblast 
myosin X 

prostate differentiation factor 

hypothetical protein FU20373 

Homo sapiens cDNA: FLJ23197 fis, done R 

eukaryotic translation initiation factor 

veside^ssociated membrane protein 5 (m 

SEC14 (S. cerevisiaeHil^e 1 

heparan sulfate proteoglycan 2 (perlecan 

calumenin 

platelet/endothelial ceO adhesion molec 
KIAA0096 protein 

complement component Clq receptor 

SRY (sex detennining region Y)-box 4 

nudix (nucleoside diphosphate linked mol 

Homo sapiens mRNA; cDNA DKFZp586E1S24 (f 

thymidylate synthetase 

KIAA0015 gene product 

ribonuclease, RNase A family, 1 (pancrea 

nidogen 2 

N-myc downstream regulated 
collagen, type Vlil, alpha 1 
heat shock 70kD protein 2 
inhibitor of DNA bindir^ 1, dominant neg 
small inducible cytokine A2 (monocyte ch 
serine (or cysteine) proteinase inhibito 
myeloid cell leukemia sequence 1 (BCL2-r 
prolylcaitoxypeptidase (angiotensinase C 
actuating transcription factor 3 
four and a half LIM domains 2 
calcrtonin leceptor-like 
calpain 2, (m/ll) targe subunit 
collagen, type IV. alpha 1 
pentaxirHrelated gene, rapidly induced b 
intercellular adhesion molecule 2 
guanine nucleotide binding protein (G pr 
GR02 oncogene 

ftlamin B. beta (actin-btnding prDtein-2 
nudear factor of kappa light polypeptid 
hexokinase 1 

zinc finger protein homologous to Zfp-36 
serine (or cysteine) proteinase Inhibito 
transgelin 

Immediate early response 3 
catenki (cadherin-associated protein), a 
nicotinamide N-methyltransferase 
guanine nudeotide binding protein 11 
spermidine/spenn'ine Nl-acetyitransferase 
myosm, light polypeptide kinase 
ATPase. Na-»/K+ transporting, beta 3 poly 
cadherin 13, H^^adherin (heart) 
qfstelne-rfch. angngenic inducer, B1 
Ij^osomal 

eukaiyotic translation inifiation Cactor 
A kinase (PRKA) anchor protein ^ravin) 
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102682 102882 AI767736 Hs^90070 gelsoGn (amyloidosis. Finnish type) 

102915 102915 X07820. Hs^58 matrix metailopiDteinase 10 (stromelystn 

102980 102960 AI904738 Hs.76053 DEAD/H (Asp-Glu-Ala-Asp/His) txM polypep 

103020 103020 X53416 Hs.195464 filatron A. alpha (acKn-binding protfiln- 

S 103036 103036 M13509 Hs.83169 matrix metaDoproteinase 1 (interstitial 

103080 103080 AU077231 Hs.82932 cydin D1 (PRA01: parathyroid adenomatos 

103138 103136 X65965 gb:H^piensS0D-2 gene for manganese su 

103195 103195 AA351647 Hs2642 eulcaryotic translation elongation factor 

103371 103371 X91247 Hs.13046 thioredoxin reductase 1 

10 103471 103471 Y00615 Hs.75216 protein tyrosine phosphatase, receptor t 

104447 104447 AW204145 H8.156044 ESTs 

104783 104783 AA533513 Hs.93659 protein disuIRde isomerase related prot 

104865 104865 T79340 Hs^575 B-cefl CU/lymphoma 6, member B (zinc fi 

104894 104894 AF065214 Hs.18858 phosphdipase A2. group IVC (cytosofic. 

15 105113 105113 AB037816 Hs.8982 Homo sapiens, done ll\/IAGE:3506202, mRNA. 

105196 105196 W84893 Hs.g305 angiotensin receptor-lilce 1 

105263 105263 AW388633 Hs.6682 solute canter family 7. (cationic amino 

105330 105330 AW338625 Hs.22120 ESTs 

105492 105492 AI805717 Hs.289112 CGI-43 protein 

20 105594 105594 AB024334 Hs.25001 tyrosine 3-monooxygenase/tjyptophan 5-mo 

105732 105732 AW504170 Hs.274344 hypothetical protein MGC1 2942 

105882 105882 W46802 Hs.8198d disat>ted (Drosophila) homolog 2 (mitogen 

106031 106031 X64116 Hs.171844 Homo sapiens cDNA: FU22296 fis, done H 

106222 106222 AA356392 Hs.21321 Homo sapiens done FLB921 3 PR02474 mRNA. 

25 106263 106263 W21493 Hs.28329 hypothetical protein FU1 4005 

105366 106366 AA186715 Hs.336429 RIKEN cDNA 9130422N19 gene 

105634 106634 W25491 Hs.288909 hypothetical protein FU22471 

105793 106793 H94997 Hs.16450 ESTs 

105842 106842 AF124251 Hs.26054 novel SH2-containing protein 3 

30 106890 106890 AA489245 Hs.88500 mitogen^ctivated protein Kinase 8 Inter 

106974 106974 AI817130 Hs.9195 Homo sapiens cDNAFU13698fis, done PL 

107061 107061 BE147611 Hs.6354 stromal ceD derived factor receptor 1 

107216 107216 D51069 H&211579 melanoma cell adhesion molecule 

107444 107444 W28391 Hs.343258 profiferation-assodated 2G4, 38kD 

35 108507 108507 AI554545 Hs.68301 ESTs 

108931 108931 AA147186 gb2o38d01.s1 Stratagene endothelial cel 

109195 109195 AF047033 Hs.132904 solute carnerfemlly 4, sodium bicaibon 

109456 109456 AW956580 Hs.42699 ESTs 

110411 110411 AW001579 Hs.9645 Homo sapiens mRNA for K1AA1741 protein. 

40 110906 110906 AA035211 Hs.17404 ESTs 

111091 111091 AA300067 Hs.33032 hypothetical proteb DKF2p434N 185 

111378 111378 AW160993 Hs.326292 hypothetical gene DKFZp434A1 114 

111769 111769 AW629414 Hs.24230 ESTs 

112951 112951 AA307634 Hs.6650 vacuolar protein sorting 45B (yeast homo 

45 113195 113195 H83265 Hs.8881 ESTs. Weakly simiter to S41044 chromosom 

113542 113542 H43374 Hs7890 Homo sapiens mRNA for K1AA1 671 protein. 

113847 113847 Nt^005032 Hs.4114 plastin 3 (T isofonn) 

113947 113947 W84768 gb:zh53d03.s1 Soaresjetaljiver.spleen, 

115061 115061 AI751438 Hs.41271 Homo sapiens mRNA full length insert cDN 

50 '115870 115870 NM_QQ5985 Hs.48029 snail 1 (drosophila homolog). zinc finge 

116228 116228 A1767947 Hs.50841 ESTs 

116314 116314 AI799104 Hs.178705 Homo sapiens cDNA FU1 1333 fis, done PL 

117023 117023 AW070211 Hs.102415 Homo sapiens mRNA; cDNA DKF2p586N0121 (f 
117156 117156 W73853 ESTs 

55 117280 117280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 (is. done C 

119866 119866 AA496205 Hs.193700 Homo sapiens mRNA; cDNA DKFZp586l0324 (f 

121314 121314 W07343 Hs.182538 phosphoGp{dscramblase4 

121822 121822 AI743860 metallothtonein IE (functfonal) 

122331 122331 AL133437 Hs.110771 Homo sapiens cDNA:FU21904 fis. done H 

60 123160 123160 AA488687 Hs.284235 ESTs. Weakly slmDar to 138022 hypothetl 
124059 124059 BE387335 Hs.263713 ESTs. Weakly similar to S64054 hypothetl 
124358 124356 AW070211 Hs.102415 Homo sapiens mRNA; cDNADKFZp586N01 21 (f 
124726 124726 NM_003654 Hs.104576 carbohydrate (keratan sulfate Gal^) sul 
125167 125107 AL137540 Hs.102541 netrin4 

65 125307 125307 AW580945 Hs.330466 ESTs 

107985 107985 T40064 Hs.71968 Homo sapiens mRNA; cDNA DKF2p564F053 (!r 
125598 125598 T40064 Hs.71968 Homo sapiens mRI^; cDNA DKF2p564F053 (fr 
413731 413731 BE243845 Hs.75511 connective tissue growth factor 
116024 116024 AA088767 Hs.83883 transmembrane, prostate androgen induced 

70 418000 418000 AA932794 Hs.83147 guanine nucleotide binding prolein-IIke 

126399 126399 AA088767 Hs.83883 transmembrane, prostate androgen induced 

127568 127566 AI051390 Hs.1 16731 ESTs 

128453 128453 X02761 Hs.287820 fibronedinl 

128515 128515 BE395085 Hs.10086 type 1 transmembrane protein Fn14 

75 128623 128623 BE076608 Hs.105509 CTI^gene 

128669 128689 W2B493 Hs.180414 heat shock 70kD protein 8 
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u _ AAA "nn 
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Hs.75511 
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Hs.17132 
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AI077288 


Hs.296323 


131080 
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NM_001955 
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131091 


131091 


AJ271216 
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131182 


131182 


A 1 A A A A A A 

A1824144 


Hs.23912 


131319 


131319 


NM_003155 


1 t APfAA 

Hs,25590 


131328 


131328 


AW939251 


Hs.25647 


131328 


A Oil f>f> 

131328 


AW939251 


Hs.25547 


131555 


131555 


T47364 


Hs.278613 


131573 


131573 


AA04031 1 


ns.28959 


131756 


131756 


AA443966 


Hs.31595 


131909 


131909 


N^/L016558 


11^ A^ A A A A 

Hs.274411 


132046 


132046 


AI359214 


Hs.1 79260 


132151 


132151 


BE379499 


Hs.173705 


132187 


132187 


AA235709 


Hs.4193 


132314 


132314 


AF1 12222 


Hs.323806 


132398 


132398 


AA876616 


Hs.16979 


132490 


132490 


NM_001290 


Hs.4980 


132546 


132546 


M24283 


Hs.168383 


132716 


132716 


BE379595 


Hs^738 


132883 


132883 


AA373314 


Hs.5897 


A A A AAA 

132969 


132989 


AA480074 


Hs.331328 


133071 


133071 


BE384932 


Hs.64313 


133099 


133099 


W16518 


1 ft A*tAP^A 

Hs^9518 


133149 


133149 


AA370045 


Hs.6607 


133200 


133200 


AB037715 


Hs.1 83639 


133260 


133260 


AA403045 


Hs.6906 


133349 


133349 


AW631255 


Hs.8110 


133398 


133398 


NM_000499 


Hs.72912 


133454 


133454 


BE547647 


Hs.177781 


133491 


133491 


BE619053 


Hs.170001 


133517 


133517 


NM_000165 


Hs.74471 


133538 


133538 


NM_003257 


Hs.74614 


133584 


133584 


D90209 


Hs.1 31 243 


133617 


133617 


BE244334 


Hs.75249 


133671 


133671 


AW503116 


Hs.301819 


133681 


133681 


AI352558 




133730 


133730 


BE242779 


Hs.179526 


133802 


133802 


AVV239400 


Hs.76297 


133838 


133838 


BE222494 


Hs.180919 


133889 


133889 


U489S9 


Hs.211582 


133975 


133975 


C18356 


Hs^95944 


134039 


134039 


NM_002290 


Hs.78672 


134081 


134081 


AL034349 


Hs.79005 


134203 


134203 


AA161219 


H5.799 


134299 


134299 


AW580939 


Hs.97199 


134339 


134339 


R70429 


Hs.81988 


134381 


134381 


AI557280 


Hs.1 84270 


134416 


134416 


X68264 


Hs.211579 


134558 


134558 


NM_001773 


Hs.85289 


134983 


134983 


r<HAA AAr 

D28235 


Hs.1 96384 


135052 


135052 


Al «iOP^fA 

AL1 36653 


Hs.93675 


135069 


135069 


AA876372 


Hs.93961 


135073 


135073 


W55956 


Hs.94030 


135196 


135196 




ns.90l5 


134404 


13M04 


AB000450 


Hs.82771 


100082 


100082 


AA130080 


Hs.4295 


130150 


130150 


BE094848 


Hs.15113 


130839 


130839 


AB011169 


Hs^0141 


100113 


100113 


NM.001269 


Hs.84746 


100129 


100129 


AA4G9369 


Hs^l 


100169 


100169 


AL037228 




100190 100190 


M91401 


HS.17865B 



plasmalemnia veslde associated protein 
vascular cell adhesion molecule 1 
dual spedficity phosphatase 1 
secreted protein^ acidic, cysteine-rich 
connective tissue growth factor 
tetraspan NET-6 protein 
tripartite motif protein TRIM2 
metallothionelnlL 

insulin-lil(e growth factor-binding prote 
cainexin 

pituitary tuntor*tfansfomiing 1 

ESTs, l\toderateiy similarlo i54374 gene 

Homo sapiens cONA RJ10934 fis, done OV 

hypolheticsd protein SP329 

serum/gtucocorticoid reguiated Idnase 

endothelin 1 

dipeptidylpeptidase Hi 

ESTs 

stannlocaldn 1 

v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
interferon, alpha-indudble protein 27 
ESTs 
ESTs 

SCAN domain-containing 1 

chromosome 14 open reading frame 4 

Homo sapiens cDNA: FU22050 fis, done H 

DKFZP58601624 protein 

pinin, desmosome assodated protein 

ESTs. Weakly simitar to A43932 mudn 2 p 

LIM domain binding 2 

interceliidar adhesion molecule 1 (CD54) 

casein idnase 1, alpha 1 

Homo sapiens mRNA; cDNA DKFZp586P1622 (f 

hypothetical protein FU13213 

ESTs. Weakly similar to AF2S7182 1 G-pro 

amyloid beta (A4) precursor-iiice protein 

AX1N1 up*regulatfid 

hypothetical protein FU 10210 

Homo sapiens cDNA: FU23197 fis, done R 

L-3-hydroxyacyI-Coenzyme A dehydrogenase 

cytochrome P450. subfamily I (aromatic c 

hypothetical protein MGC5618 

eukaryotic translation inHiatbn fador 

gap Junction protein, alpha 1. 43kD (con 

tight junction protein 1 (zona ocdudens 

activating transcription factor 4 (tax-r 

ADP-ribosyIatk)n factor-like 6 interacti 

zinc finger protein 146 

tyrosine 3-monooxygenase/tryptophan 5-mo 
upregulated by 1 ,25-dihydroxyvitamin D-3 
G protein-coupled receptor kinase 6 
inhibitor of DNA binding 2, dominant neg 
myosin, light polypeptide kinase 
tissue factor pathway inhibitor 2 
laminin, alpha 4 

protein ^rosine phosphatase, receptor t 
diphthers toxin receptor (heparin-bind} 
complement component Clq receptor 
disabled (Drosophila) homotog 2 (mitogen 
capping protein (actin filament) musde 
melanoma cell adhesion molecule 
CD34 ant^en 

prostaglandin-endoperoxide synthase 2 (p 

deddual protein induced by progesterone 

Homo sapiens mRNA; cDNA DKFZp667D095 (fr 

Homo sapiens mRNA; cONA DKFZp586E1624 (f 

myosin regulatory light chasi 2. smooth 

vacdnia related kinase 2 

proteasome (prosome. macropain) 26S subu 

homogentisate 1.2-dioxygenase (homogenti 

similar to S. cerevisiae SShW 

chromosome condensation 1 

tissue inhibitor of metalloprote^iase 1 

D123 gene product 

RAD23 (S. cere>nsiae) homotog B 
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100211 100211 D^28 Hs.123058 DEAD/H (Asp^tu-Ala^p/His) box polypep 

130283 130283 Nr4.012288 Hs.153954 TRAI\Mce protein 

100248 100248 NM.015156 Hs.78398 K1AA0071 protein 

100262 100262 D38500 Hs.278468 postmeiotic segregation increased 2*like 

S 100281 100281 AF091035 Hs.184627 KIAA01 18 protein 

100327 100327 D55640 gb:Human monocyte PABL (pseudoautosomal 

134495 134495 D63477 H5.84D87 KiAAOl 43 protein 

135152 1 35152 M98954 Hs.182741 TIA1 cytotoxic granule^sodated RI4A-bi 

100372 100372 NM_014791 Hs.184339 KIAA01 75 gene product 

10 100394 100394 DB4284 Hs.66052 CD38 antigen (p45) 

100418 100418 D86978 Hs.847gD KIAA0225 protein 

134347 134347 AF164142 Hs.82042 solute carrier famQy 23 (nudeobasetra 

100438 100438 AA013051 Hs.91417 topoisomerase (DMA) II binding protein 

100481 100481 X70377 HsJ21489 cystatinD 

15 100591 100591 Nl\4_004091 Hs.231444 Homo sapiens, Similar to hypothetical pr 

100662 100662 A1368680 Hs.816 SRY (sex detennining region Y)-box 2 

100905 100905 L12260 Hs.172B16 neureguDn 1 

100950 100950 AF128542 Hs.166846 polymerase (DNA directed), epsilon 

135407 135407 J04029 i^s.99936 keratin 10 (epidermotytic hyperkeratosis 

20 131877 131877 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 

134786 134786 729618 Hs.89640 TEK tyrosine kinase, endothelial (venous 

134078 134078 L08895 HsJ8995 MADS box transcnptlon enhancer factor 2 

134849 1 34849 BE409525 Hs.902 neuroRbromin 2 (bi^teral acoustic neur 

101152 101152 AI984625 Hs.9884 spindle pole body protein 

25 131687 131687 BE297635 Hs.3069 heat shock 70kD protein 9B(mortalin-2) 

421155 421155 H87879 Hs.102267 lysyl oxidase 

133975 133975 C18356 Hs.295944 tissue factor pathway inhibitor 2 

130155 130155 AA101043 Hs.151254 kalfikrein 7 (chymotiyptic, stratum com 

132813 132813 BE313625 Hs.57435 solute carrier family 11 (proton-coupled 

30 101300 101300 BE535511 transmembrane trafficking protein 

130344 130344 AW250122 Hs.154879 DiGeorge syndrome crHkal region gene D 

101381 101381 AW675039 Hs.1227 aminolevuilnate. delta-, dehydratase 

133780 133780 AA5576G0 Hs.76152 deoorin 

101447 101447 M21305 gb:Human alpha satellite and satellite 3 

35 101470 101470 NWL000546 Hs.1846 tumor protein p53 (U-Fraumeni syndrome) 

101478 101478 NIV)_002890 Hs.758 RAS p21 protein activator (GTPase activa 

133519 133519 AW583062 Hs.74502 chymotrypsinogen B1 

134116 134116 RB4694 Hs.79194 cAMP responsive element binding protein 

130174 130174 M29551 Hs.151531 protein phosphatase 3 (fbnnerly 28). cat 

40 132983 132983 M30269 nidogen (enacUn) 

101543 101543 M31166 Hs^050 pentaxin-related gene, rapidly induced b 
101620 101620 S55271 Hs.247930 Epsiton.lgE 
133595 133595 AA393273 Hs.75133 transcription factor 6-like 1 (mitochond 
101700 101700 D90337 Hs.247916 natriuretic peptide precursor C 

45 134246 134246 D28459 Hs.80612 ubiquitin-conjugating enzyme E2A (RAD6 h 
133948 133948 X59960 Hs.77813 sphingomyelin phosphodiesterase 1, add 
133948 133948 X59960 Hs.77813 sphingomyelin phosphodiesterase 1, add 
133948 133948 X59960 Hs.77813 sphingomyelin phosphodiesterase 1. add 
101812 101812 BE439894 Hs.78991 DNA segment, numerous copies, expressed 

50 133396 133396 M96326 Hs.72885 azuroddin 1 (cationic antimicrobial pro 
129026 129026 AL120297 Hs.108043 Friend leukemia vims integratton 1 
134831 134831 AA85347g Hs.89890 pynivate carboxylase 
134395 134395 AA456S39 Hs.8262 lysosomal 
101977 101977 AP1 12213 Hs.184062 putative Rab5Hntefacting protein 

55 101998 101998 U01212 Hs.248153 olfectory martcer protein 

102007 102007 U025S6 Hs.75307 t-<X}mplex-associated-testis-expressed 1- 
416658 416658 U03272 Hs.79432 fibrillin 2 (congenital contractural ara 
135389 135389 U05237 Hs.99872 fetal Alzheimer ant^en 
130145 130145 U34820 Hs.151051 mitogen^ctivated^ protein kinase 10 

60 420269 420269 U72937 Hs.96264 alpha thatassemia/mental retardation syn 
102123 102123 NM_001809 Hs.1594 centromere protein A (17kD) 
102133 102133 AU076845 Hs.155596 BCl-2/adenovims E1B 19kD-interacting pro 
102162 102162 AA450274 Hs.1592 . CDC16 (cell division cyde 16, S.cerevi 
427653 427653 AA159001 Hs.180O69 nudear respiratory factor 1 

65 102200 102200 AA232362 Hs.157205 branched chain aminotransferase l.cytos 
102214 102214 U23752 Hs.32964 SRY(sexdetenminingregtonY)-bax11 
131319 131319 NN/L003155 Hs.25590 stanniocakan 1 
132316 132316 U28831 Hs.44566 KIAA1 641 protein 
134365 134365 AA568906 Hs.82240 syntaxin3A 

70 102298 102298 AA382169 Hs.54483 N-myc (and STAT) interactor 

302344 302344 BE303044 Hs.192023 eukaryotk: translation Initation factor 
102367 102367 U39656 Hs.1 18825 mitogen^ctivated protein kinase kinase 
102394 102394 NIVL003816 Hs.2442 a disintegrin and metaltoproteinase dome 
129521 129521 AF071076 Hs.112255 nudeoporin 9BkD 

75 102251 102251 NM_004398 Hs.41706 DEAD/H (Asp^h^Ala-Asp/Hte) box polypep 

133746 133746 - AW410035 HS.75B62 MAD(momeisagaBnstdecapenteplegicDr 
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134662 


134662 


NM_007048 


Hs.284283 


103500 


103500 


AW408009 


Hs.22580 


132084 


132084 


NM_002267 


Hs.3886 


133152 


133152 


211695 


1 1^ 0\^A A"V^ 

Hs.324473 


103612 


103612 


BE336654 


Hs.70937 


103692 


103692 


AW137912 


Hs.227583 


129796 


129795 


BE218319 


Hs.5807 


132683 


132683 


BE264633 


1 1^ 4 4A/«4A 

Hs. 143638 


103723 


103723 


BE274312 


LJ_ A4 A'TOA 

Hs.214783 


133260 


133260 


A A A/\on4e 

AA403045 


Hs.6906 


103766 


103766 


Ai920783 


LIm 4 A4 A*iC 

ns.191435 


132051 


132051 


A A <SA4A^B 

AA3939o8 


ns.180145 


135289 


135289 


AW372569 


Hs.9788 


103794 


103794 


A en A A 4 

AF244135 


AAA^A 

Hs.30670 


134319 134319 


BE304999 


Hs.285754 


119159 


119159 


A PA AAA A A 

AF142419 


Hs.15020 


103850 


103850 


AA187101 


Hs213194 


322026 322026 


AW024973 


Hs.283675 


103861 


103881 


AA206236 


Hs.4944 


447735 


447735 


AA775268 


Hs.6127 


131236 


131236 


AF0431 17 


Hs.24594 


129013 


129013 


A A 074 4 CC 

AA371150 


1 l_ 4A7A4A 

Hs, 107942 


103988 


103988 


A A 44 A 4Dn 

AA314389 


Hs.342849 


425284 


425284 


A Cr4 CCC£fO 


Hs.348043 


133281 


133281 


A I^AA4 CA4 


Hs.69594 


108154 


108154 


NlvL0Uo7o4 


Hs.220d89 


135073 


135073 


W55956 


t 1^ AAAAA 

Hs.94030 


129593 


129593 


AI33o247 


LJ^ AAA4 4 

Hs.93314 


132084 


132064 




Lie IfiQfi 


131427 


131427 


AF151879 


Hs.26706 


104282 


104282 


C14448 


Hs.332338 


130443 


130443 


0^216 


Hs.155650 


132837 


132837 


AA370362 


Hs.57958 


104334 


104334 


D82614 


HS7B771 


134731 


134731 


D89377 


Hs.89404 


131670 131670 


H03S14 


Hs.15589 


104402 104402 


H56731 


Hs.132956 



fibroblast growth factor 8 (androgervind 
fibroblast growth factor 8 (androgervind 
protein kinase. DNA-adtvated. catalytic 
Human 6RCA2 region, mRNA sequence CG006 
cullin4B 

cadherin 13, H-cadherin (heart) 
ubiquitin-like I (sentrin) 
Homo sapiens cDNA: FLJ23037 fis, done L 
karyopherin (iniportin) beta 2 
protein phosphatase 4, regulatory subuni 
melanoma ant^n. feuriBy A, 2 
procollagen-lysine, 2-oxoglutarate 5-dto 
cytDsolte acyl coenzyme A thtoester hydr 
thyroid honnone receptor interactor 13 
IGF-II mRNA-binding protein 3 
2,3-^isphosphogtycerate mutase 
apoiipoprotein A>l 
thrombospondin 1 

methylene tetrahydrofolate dehydrogenase 
Integrin, alpha 6 

multifunctional polypeptide similar to S 
5' nucleotidase (CD73) 
adenylate kinase 3 

dipeptidylpepiidase IV (CD26, adenosine 
Rhesus blood group. D antigen 
genera! transcnption factor IIP. polype 
fms-related tyrosine kinase 4 
retinoblastoma-binding protein 7 
BMX non-receptor tyrosine kinase 
cyclin-dependent kinase 8 
transient receptor potential channel 1 
uracii-DisiA glyoosylase 
endothelial ceH-spedfic molecule 1 
purine-ridi element binding protein A 
HCGVIIM protein 

DRI-assoclated protein 1 (negative cofac 

M>phase phosphoprotein 6 

SMT3 (suppressor of mif two 3. yeast) ho 

Ric (Drosophila)-1ike, expressed in many 

butyrophiTin, subfamily 3, member AI 

atkylgiycerone phosphate synthase 

karyopherin alpha 3 (importin alpha 4) 

mitogen-activated protein kinase 1 

H3 histone family, member A 

Homo sapiens chromosome X map Xpll.23 L- 

GTPase Rab14 

WD repeat domain 4 

Homo sapiens cDNA FLJ14041 fis, done HE 
Homo sapiens cDNA: FU23197 fis, done R 
ESTs 

HSPC030 protein 

hypoffietlcal protein MGC10924 similar to 
hepatocellular carcnioma-associated anfi 
fumarate hydratase 

homotog of mouse quaking QKi (KH domain 

hypothetical protein MGC10895 

NPD009 protein 

hypothetical protein FLJ12783 

Homo sapiens cDNA: FLI23020 fis, done L 

ubiquitination fadorE4B (homok)gous to 

DKFZP564M112 protein 

ADP-ribosylation factor-like 5 

NSI-assodated protein 1 

high-mobility group 20A 

Ras-GTPase-activating protein SH3-domain 

Homo sapiens mRl^; cDNA DKFZp586E1624 (f 

Homo sapiens mRNA; cDNA DKFZp586L0120 (f 

serum-tnducible kinase 

CGI-121 protein 

EST 

KiAA0014 gene produd 

EGF-TM7-!atrophilin-reIated protein 

phosphogtycerate kinase 1 

msh (DrosophOa) homeo box homotog 2 

ESTs 

ESTs 
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129077 129077 N74724 Hs.108479 ESTs 

134927 134927 L36531 Hs.91296 Integrin, alpha 8 

134498 134498 AW246273 Hs.84131 threonyl-tRNA synthetase 

104488 104488 N56191 Hs.108511 protocadherin 17 

5 129214 129214 AL044335 Hs.109526 zinc finger protein 198 

104530 104530 AK001676 Hs.12457 hypothetical protein FU10814 

104544 104544 AI091173 Hs.222382 ESTs. Weakly similar to p40[H.sapiens] 

104567 104567 AA040620 Hs.5672 hypothetical protein AF140225 

129575 129575 F08282 Hs278428 progestin induced protein 

10 104599 104599 AW815036 Hs.151251 ESTs 

104667 104667 AI239923 Hs.63931 ESTs 

104764 104764 At039243 Hs.278585 ESTs 

104787 104787 AA027317 gb:ze97d11^1 SoaresJetaLheait.NbHH19W 

104804 104804 AI858702 Hs.31803 ESTs. Wealdy similar to N-WASP[H^apien 

IS 130828 130828 AW631469 Hs.203213 ESTs 

104943 104943 AF072873 Hs.1 14218 frizzled (DrosophOa) homolog 6 

105024 105024 AA126311 Hs.9879 ESTs 

105038 105038 AW503733 Hs.9414 K1AA1 488 protein 

105096 105096 AIX)42506 Hs^1599 Kwppel-like factor 7 (ubiquitous) 

20 105169 105169 BE245294 Hs.180789 S164 protein 

130401 130401 BE396283 Hs.173987 eukaryotic translation initiation factor 

130114 130114 AA233393 Hs,14992 hypothetical protein FU1 11 61 

105337 105337 AI468789 Hs.347187 myotubularin related protein 1 

105376 105376 AW994032 Hs.8768 hypothetical protein FU 10849 

25 131962 131962 AK000046 Hs.343877 hypothetical protein FU20039 

128658 128658 6E397354 Hs.324830 diptheria toxin resistance protein requi 

105508 105508 AA173942 Hs.326416 Homo sapiens mRNA; cDNA DKFZp564Hl 916 (f 

135172 135172 AB028956 Hs.12144 KIAA1033 protein 

132542 132542 AL137751 Hs^63671 Homo sapiens mRNA; cDNA DKFZp434I0812 (f 

30 105559 105659 AA283044 Hs.25625 hypothetical protein FU1 1323 

105674 105674 AI609530 Hs^79789 histone deacetylase 3 

105722 105722 AI922821 Hs.32433 ESTs 

115951 115951 BE546245 Hs.301048 seel 3-Jilce protein 

105985 105985 AA406610 gb:ztf1 5b1 0^1 SoaresJJhHMPu^SI Homo sapi 

35 131216 131216 A1815486 Hs.243901 Homo sapiens cDNA FLJ20738 fis. done HE 

113689 113689 AB037850 Hs.16621 DKFZP434I1 16 protein 

130839 130839 AB011169 Hs.20141 simOartoS.cerevisiaeSSAM 

130777 130777 AW135049 Hs.26285 Homo sapiens cDNA HJ10643 fis. clone NT 

108198 106196 AA525993 Hs.173699 ESTs. Wealdy similar to ALU1_HUMAN ALUS 

40 133200 133200 AB037715 Hs.183639 hypothetical protein FU1 0210 

106328 106328 A1J079559 Hs^8020 K1AA0766 gene product 
106423 106423 AB020722 Hs.16714 Rho guanine exchange factor (GEF) 15 
439608 439608 AW864696 Hs.301732 hypothetical protein MGC5306 
106503 106503 AB033042 Hs.29679 oofactor required for Spl transcriptiona 

45 106543 106543 AA676939 Hs.69285 neuropiUn 1 

106589 108589 AK000933 Hs.28661 Homo sapiens cONA FU1 0071 fis. done HE 
106596 106596 AA452379 ESTs. Moderately similar to ALU7_HUMAN A 

106636 106636 AW958037 Hs.286 ribosomai protein L4 
131353 131353 AW754182 gb:RC2-CT0321-131199^11-o01 CT0321 Homo 

50 131710 131710 NM_015368 Hs.30985 pannexin 1 

131775 131775 AB014548 Hs.31921 KIAA0648 protein 
106773 106773 AA478109 Hs.188833 ESTs 
106817 106817 D61216 Hs.18672 ESTs 

106848 106848 AA449014 Hs.l21025 chromosome 11 open reading frame 5 
55 418699 418699 BE539639 Hs,173030 ESTs. WealdysimBar to ALU8.HUMAN ALUS 
130538 130638 AW021276 Hs.17121 ESTs 

107059 107059 BE614410 Hs.23044 RAD51 (S. cerevisiae) homolog (E coli Re 
107115 107115 BE379623 Hs^693 peptidylprolyl isomerase (cyclophQin)-! 
107156 107156 AA137043 Hs.9663 programmed cell death &4nteractingprot 

60 130621 130621 AW513087 Hs.16803 LUC7 (S. oerevlsiaeHike 

132626 132626 AW504732 Hs^1275 hypofiiefical protein FU11 Oil 
131610 131610 AA357879 Hs.29423 scavenger receptorwithC-type lectin 
107295 107295 AA186629 Hs.80120 UDP-N-aoetyl-alpha-D^aladosamine:polyp 
107315 107315 AA316241 HsJ0691 nudeophosmin/nudeoplasmln 3 

65 107328 107328 AW959891 Hs76591 K1AA0887 proteb 
134715 134715 U48263 Hs.89040 prepronodceptin 
129938 129938 AW003668 Hs.135587 Human done 23629 mRNA sequence 
130074 130074 AL038596 Hs250745 polymerase (RNA) III (DNA directed) (62k 
132036 132036 AL157433 Hs.37706 hypothetical protein DKF2p434E2220 

70 113857 113857 AW243158 Hs.5297 DKFZP564A2416 protein 
130419 130419 AF037448 Hs.155489 NSI-assodated protein 1 
132616 132616 BE262677 Hs^83558 hypothetical protein PR01 855 
132358 132358 NM_003542 Hs.46423 H4 histone family, member G 
125827 125827 NM_003403 Hs.97496 YY1 transcription factor 

75 107609 107609 R75654 Hs.164797 hypothetical protein FLJ13693 
107714 107714 AA015761 Hs.60642 ESTs 
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10 



15 



20 



25 



30 



35 



40 



107B32 
124337 
129577 
132000 
107935 
131461 
108029 
108084 
108168 
108189 
108203 
108217 
108277 
108309 
108340 
108427 
108439 
108469 
108501 
10B562 
130890 
130385 
108807 
108833 
108846 
131474 
108941 
108996 
131183 
109022 
109068 
129021 
109146 
131080 
109222 
109481 
109516 
109556 
109578 
109625 
109648 
109699 
109933 
110039 



107832 
124337 
129577 
132000 
107935 
131461 
108029 
108084 
108168 
108189 
108203 
108217 
108277 
108309 
108340 
108427 
108439 
108469 
108501 
108562 
130890 
130385 
108807 
108833 
108846 
131474 
108941 
108996 
131183 
109022 
109068 
129021 
109146 
131080 
109222 
109481 
109516 
109556 
109578 
109625 
109648 
109699 
109933 
110039 



AA021473 

N23541 

N75346 

AW247017 

AA029428 

AA992841 

AA040740 

AA058944 

AI453137 

AW376061 

AW847814 

AA058686 

AA064859 

AA069818 

AA069820 

AA076382 

AA078986 

AA079487 

AA083256 

AA100796 

AI907537 

AW067800 

A1652236 

AF1 88527 

AL1 17452 

L46353 

AA148650 

AW995610 

AI611807 

AA157291 

AA164293 

AU)44675 

AA176589 

NM_001955 

AA192833 

AA878923 

AI471639 

AI925294 

F02208 

H29490 

H17800 

H18013 

R52417 

H1 1938 



Hs.281561 

Hs.306121 

Hs.36978 

Hs.61555 

Hs.27263 

Hs,62007 

Hs.1 16602 

Hs.63176 

Hs.63335 

Hs.289005 

Hs.62588 



Hs.180909 



Hs.76698 

Hs.155223 

Hs.49376 

Hs.61661 

Hs.44155 

Hs.2726 

Hs.332436 

Hs.285107 

Hs.21479 

Hs.72545 

Hs.173081 

Hs.142078 

Hs.2271 

Hs.333512 

Hs.289069 

Hs.71913 

Hs.87385 

Hs.27214 

Hs.22697 

Hs.7154 

Hs.167483 

Hs.20945 

Hs.21907 



gb2e66c11.s1 Soares retina N2t)4HR Homo 
Homo sapiens cDNA: FLJ23582 fis, done L 
CDC20 (cell division cyde 20. S. cerevl 
melanoma antigen, family A, 3 
ESTs 

KtAA1458 protein 
ESTs 

Homo sapiens, done MGE:41 54008. mRNA, 
ESTs 

ESTs. Moderately similar to A46010 X-fin 
Homo sapiens cDNA: FU21532 fis, doneC 
ESTs 

gb:zm50f03^1 Stratagene fibroblast (937 
gb:znn67e03.r1 Stratagene neuroepHhelium 
peroxlredoxin 1 

gb:zrn81g08^1 Stratagene ovarian cancer 

gb:zm92h01^1 Stratagene ovarian cancer 

gb3m97fD8.s1 Stratagene colon HT29 (937 

gb:zn08g12.s1 Stratagene hNT neuron (937 

gb*:zm26o06^1 Stratagene pancreas (93720 

stress-associated endoplasmic reticulum 

stanniocaldn 2 

hypothetical protein FLJ20644 

ESTs. Weakly sinuiar to AF174605 1 F-box 

DKFZP586G1517 protein 

high>mobility group (nonhistone chromoso 

gb2o09e06.s1 Stratagene neuroepithelium 

EST 

hypothetical protein FU13397 

ubinuciein 1 

ESTs 

KIAA0530( 
EST 
endothePn 1 

similar to rat myomegaOn 

hypothetical protein FU21016 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens done 24993 mRNA sequerwe 
histone acetyltransferase 
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TABLE 2 A 

Table 2A shouus the accession numbers for those pkeys lacking unigeneID*s for Tab!e 2. The pkeys in Table 7 lacldng untgeneID*s are represented within 
Tables 1-6A. For each probeset we have fisted the gene cluster nuniber from which the oligonudeolides were design^^ Gene dusters were comp9ed 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
AUgnment Tools (DouUeTwisl Oakland California). The Genbank accession numbers for sequences comprising each duster are listed In the "Acoesdon" 
column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank aocessksn numbers 



Pkey 



CAT Number Accession 



108469 
108501 
108562 
101300 



125565 
132983 



121335 
130018 
121822 



108309 
107832 
123523 
123964 
118475 
104787 
106598 
113947 
108277 



116761J 
13684^-12 
36375^1 
4669J 



117156 145392_1 



1704098J 
11923.1 



133681 13893.1 



AA079487 AA126547 AA126291 AA079587 AA079600 
AA083256 

AA100796 AF020589 AA074629 AA075946 AA100849 AA085347 AA126309 AA079311 AA079323 AA085274 
BE53551 1 M62098 AA306787 AW891766 AA348998 AA338869 AA344013 AW956561 AW389343 AW403607 L40391 
AW408435 AA121738 AI568978 H13317 R20373 AW948724 AW948744 AA335023 AA435722 AA448690 C21404 
AW884390 AA345454 AA303292 AA174174 BE092290 T90614 AA035104 R76028 AA126924 AA741086 AW022056 
AW1 18940 AA121666 AI832409 AA683475 AI140901 AI823576 AW519064 AW474125 A1953923 AI735349 AW150109 
AI436154 AW118130 AW270782 AI804073 N27434 AA876543 AA937815 AI051 166 AA505378 AI041975 AI335355 
AI089540 AA662243 AI127912 At925604 AI250880 AI366874 AI564386 AI815196 AI683526 AI435885 AI160934 H79030 
AI801493 AA448691 A1673767 A1076042 AI804327 AA813438 AA680002 AI274492 T16177 AI287337 AI935050 
AAg07805 AA91 1493 AI58941 1 A1371358 AW576236 A1078866 AW516168 AA346372 AI560185 AA471009 R75857 
AA298025 AA523155 AA853168 A1696593 At658482 AI566601 AW072797 AA128047 AA035502 AW243274 AA9g2517 
R43760 

W73853 AA9281 12 W77887 AW889237 AA148524 AI749182 AI754442 AI338392 A1253102 AI079403 AI370541 AI697341 
H97538 AW188021 Ai927669 W72716 AI051402AI18e071 AI335900 N21488 AW770478 W92522 AI691028A1913512 
AI144448 W73819 AA604358 N28900 W95221 A1868132 H98465 AA148793 
R20840 R20839 

IVI30269 NM_002508 X82245 AI078760 AW957003 D78945 M27445 AA650439 AL048816 AV660256 AV660347 
AA333052 BE295257 T60999 AA383049 AW369677 226985 AW175704 AA343326 AW747957 AI818389 W17308 
W17302 H15591 AA371284 AA370412 W94966 BE384365 T28498 R80714 R16959 H21723 AW835154 D56097 D56381 
W21232 AA190565 AW379755 AW067895 

AI352558 Z82248 X78138 NM 003405 AU077248 AA223125 S80794 D78577 A1124697 AW403970 BE614089 BE296713 
BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA084159 BE538B08 AW401377 AA256774 C03366 W46595 W47608 AA305009 H69431 H69456 AL120082. 
H11706 AA303717 AA361357 H22042 H78020 AW999584 AA134358 AA322911 AA322961 H60980 N85248 N31547 
H79624 T1 1718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA808688 H82692 BE155392 AW383636 BE155394 AA487004 AW383504 AI342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25364 AA367935 AI559406 AA033522 AA374252 
AW835019 AI922133 AI697089 N99662 AW189078 AI199076 AW151598 W59944 AA662875 W94022 AA299055 
AI039008 AI829449 AA583503 AI635674 AW131665 A1473820 AW273118 AW900930 AA908944 AI6B8035 AW170272 
A1082545 AW468176 A1608761 A1082748 AI911682 A1248943 AI831016 AA192465 A1218477 AA938406 AA385288 
AI809817 AA905196 AI191245 A1470204 AI188296 A1421367 AI125315 AI087141 AA629032 AA740589 AI554181 
AA150830 A1248541 AI077943 AA775958 AA864930 AI261476 AI123121 A1310394 AA862331 AA872478 BE537084 
A1205606 AA720684 AI872093 AW150042 AL120538 AA219627 AA988608 C21397 AI359337 H25337 AI089749 
AA605146 Ai359820 AA1 50478 AI359738 AW383642 AW995424 AI766457 R56892 AI089839 W61343 N69107 W46459 
AA5659S5 N20527 Af279782 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA126874 D62630 W65437 
AI086917 AW382095 A1086877 H69&44 AW340217 W85827 L08439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520 AW028427 AA171496 AI469689 AW664539 AI81 1102 AI81 1 1 16 BE464590 BE350791 H78021 T15405 H21979 
AA219489 H13301 AA505883 A1864305 AI423963 AW084401 F04963 R69858 H67097 Ai917740 Ai655561 H69864 
AA033631 AW383484 A18e6261 H25293 AA513281 AW271187 H1 1617 N79982 AI174338 AI904207 A1904208 BE614558 
W94127 W65436 AI272249 AA700018 AI579932 AI085941 AW152629 
AA404418AI217248 

AA353093 AW957317 AW872498 AI560785 A1289110 AW135512 X97261 T68873 

AI743860 N49543 AW027759 BE349467 A1656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI611424 
AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AI611611 AI611407 AI611452 W56437 A1284566 
AI583349 AW183058 AI308085 AI074952 AA437315 AA628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AI432414 A1964994 Ai539335 AA401550 AA358757 A1609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 AI580502 AI832944 AI038530 AA425107 A1014986 AI148349 AW237721 AW779756 
AW137877 AI125293 AA4004M R28554 
AA069818 AA069971 AA069923 AA0699Q8 
genbank^AA021473 AA021473 
genbankj^608588 AA608588 
genbanleC13961 C13961 
genbank^l^5845 N66845 
genbanK.AA027317 AA027317 

304084 1 AI583948AA578212AW303715AA653450AA456981At400385 W88533AI224133AW272145AA088686R94^ 

genbaniLw84768W84768 

genbanKJVA064859 AA0648S9 



279548.1 
18986J 
244391J 



111495.1 
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10 



108427 
108439 
131353 

101447 
108931 
108941 
103136 
119174 
119416 
105985 
100327 



genbank_AA076382 AA076382 
9enbanK_AA078986 AA078986 

231290J AW411259 H23555 AW015049 AI684275 AW015886 AW068953 AW014085 AI027260 R52686 AA918278 AI129462 

AA969360 N34869 AI948416 AA534205 AA702483 AA705292 
entrez_M21305 M21305 

AA147186 
AA148650 



genbanleAA147186 
genbank^148650 
enlrez_X65965 X65965 
genbanieR71234 R71234 
genbank.T97186 T97186 
genbanK_AA406610 
entmiL.D55640 D55640 



AA406610 
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TABLE 3: 



Pkey: Unique Eos probeset identiHer number 

Accession: Accession number used for previous patent firings 

ExAccn: Exemplar Accession number, Genbank accession number 

Un^enelD: Unigene number 

Un^ene Title: Unigene gene title 



Pkey Accession ExAccn UnlGene UnigeneTitie 



100405 D86425 
100420 D86983 
100481 H610984fri098 
100484 HG1103W1103 
100718 HG3342-HT3519 
100991 J03764 
101097 L06797 
101168 L15388 
101194 L20971 
101261 L35545 
101345 L76380 
101447 M21305 
101485 M24736 
101543 M31166 
101550 M31551 
101560 M32334 
101674 IV161916 
101714 M68874 
101741 IV174719 
101838 M92934 
101857 M94856 
102012 U03057 
102024 U03877 
102164 U1B300 
102241 U27109 
102283 U31384 
102303 U33053 
102564 U59423 
102663 U70322 
102759 U81607 
102778 U83463 
102804 U89942 
102887 X04729 
102898 X06256 
102915 X07820 

103036 X54925 

103037 X54936 
103095 X60957 
103158 X67235 
103166 X67951 
103185 X69910 
103280 X79981 
103554 218951 
103850 AA187101 
104465 N24990 
104592 R81003 
104764 AA025351 
104786 AA027168 
104850 AA040465 
104865 AA045136 
104894 AA054087 
104952 AA071089 
104974 AA085918 
105178 AA1 87490 
105263 AA227926 
105330 AA234743 
105376 AA236559 
105729 AA292694 
105826 AA398243 
105977 AA406363 
105008 AA411465 
106031 AA412284 
108124 AA423987 



AW291587 Hs.82733 
D86983 Hs.1 18893 
X70377 Hs.121489 
NIVL005402HS.288757 
BE295928 Hs.75424 
J03836 Hs.82085 
BE245301 Hs.89414 
NM.005308Hs,211569 
1^71 Hs.188 
D30857 Hs.82353 
NWL005795HS.152175 
M21305 

AA296520 Hs.89546 
M31166 Hs.2050 
Y00630 Hs75716 
AW958272 Hs.347326 
NNL002291HS.82124 
M68874 Hs^11587 
NM.003199HS.326198 
BE243845 Hs.75511 
BE550723 Hs.153179 
BE259035 Hs.118400 
AA301867 Hs.76224 
NM-000107HS.77602 
NM-.007351Hs^68107 
AW161552 Hs.83381 
U33053 Hs.2499 
U59423 Hs.79067 
NWL002270HS.168075 
NM 005100HS.788 
AF0'b0652 Hs.8180 
NI\A-002318Hs.83354 
J03836 Hs.82085 
NM_002205Hs.149609 
X07820 Hs.2258 
M13509 Hs.83169 
BE018302 Hs.2894 
NM_Q05424Hs78824 
BE242587 Hs.1 18551 
AA159248 Hs.180909 
NIVL006825HS.74368 
U84722 HS76206 
AI878826 Hs74034 
AA187101 Hs.213194 
Z44203 Hs.26418 
AW630488 Hs.25338 
A(039243 Hs^78585 
AA027167 Hs.10031 
AL133035 Hs.8728 
T79340 Hs.22575 
AF065214 Hs.18858 
AW076098 Ks.345588 
Y12059 Hs.278575 
AA313825 Hs.21941 
AW388633 IHs.6682 
AW338625 Hs.22120 
AW994032 Hs.8768 
H46612 Hs.293815 
AA478756 Hs.194477 
AKD01972 Hs,30822 
AB033888 Hs.86ig 
X64116 Hs.171844 
H93366 HS7567 



nktogen2 

Melanoma associated gene 
cystatinD 

v-ral simian leukemia viral oncogene hom 
inhibitor of DMA binding 1 , dominant neg 
serine (or cysteine) proteinase inhibito 
chemokine (C-X-C motif), receptor 4 (fus 
G protein-coupled receptor kinase 5 
phosphodiesterase 4B. cAMP-specific (dun 
protein C receptor, endothelial (EPCR) 
caldtonin receptor-like 
gb:Human alpha satellite and satellite 3 
selectin E (endothelial adhesion molecul 
pentaxin-related gene, rapidly induced b 
serine (or cysteine) proteinase inhibto 
intercellular adhesion molecule 2 
laminin, beta 1 

phospholipase A2, group IVA (cytosollc. 
transcription fector 4 
connective tissue grouvth factor 
fatty acid binding protein 5 (psoriasis- 
singed (Drosoph8a)-like (sea urchin fas 
EGF-conlalning flbulin-Uke extracelhila 
damage-specific DNA binding protein 2 (4 
multlmerin 

guanine nucleotide binding protein 11 

protein kinase Olike 1 

MAD (mothers against decapentaplegk; Dr 

karyopherin (importin) beta 2 

A kinase (PRKA) anchor protein (gravin) 

syndecan binding protein (syntenln) 

lysyl oxidase-Iike 2 

serine (or cysteine) proteinase inhibito 
integrin, alpha 5 (fibronectin receptor, 
matrix metalloproteinase 10 (stromelysin 
matrix metalloproteinase 1 (interstitial 
placental growth factor, vascular endoth 
tyrosine kinase with immunoglobulin and 
hematopoietk:ally expressed homeobox 
peroxiredoxin 1 

transmembrane protein (63kD). endoplasnH 
cadherin 5. type 2. VEK:adherin (vascula 
caveolln 1. caveotae protein. 22kD 
hypothetical protein MGC10895 
ESTs 

protease, serine, 23 
ESTs 

KIAA0955 protein 

hypothetical protein DKF2p434G171 
6-cell CLUIymphoma 6. member B (zinc fi 
phospholipase A2. group IVC (cytosoBc» 
desmoplakin (DPI. DPil) 
bromodomain-containing 4 
AD036 protein 

solute carrier family 7, (cationk: amnio 
ESTs 

hypothetteal protein f=lJ 10849 

Homo sapiens HSPC285 mRNA. partial cds 

E3 ubiquitin l^ase SMURF2 

hypothetical protein FLJ11110 

SRY (sex detemilnlng regfon Y^box 18 

Homo sapiens cDNA: FU22296 fs. done H 

Homo sapiens cDNA: FU21962 fis. done H 
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AA425414 


Hs.33287 




AA398859 


Hs.18397 


106423 AA448238 


AB020722 


Hs.16714 


106793 AA478778 


H94997 


Hs.16450 


107174 M621714 


BE122762 


Hs.25338 


107216 D51069 


D51069 


Hs.211579 


107295 T34527 


AA166629 


Hs.80120 


107385 U97519 


NM_005397Hs.16426 


108756 AA127221 


AA127221 


Hs,117037 


108846 AA132983 


AL117452 


Hs.44155 


108888 


AA135606 


AA135606 


Ks.189384 


109001 


AA156125 


AI056548 


Hs.72116 


109166 AA179845 


AA219691 


HS73625 


109456 AA232645 


AW956580 Hs.42699 


109768 


F10399 


F06838 


Hs.14763 


110107 H16772 


AW151660 Hs.31444 


110906 


N39584 


AA035211 


Hs.17404 


110984 


N52006 


AW613287 Hs.80120 


111006 


N53375 


BE387014 


Hs.166146 


111018 


N54067 


A1287912 


Hs.3628 


111133 


N64436 


AW580939 Hs.97199 


111760 


R26892 


BE551929 


Hs.268754 


113073 


T33637 


N39342 


Hs.103042 


113195 T57112 


H83265 


Hs.8881 


113923 


W80763 


AW953484 Hs3849 


114521 


AA046808 


AW139036 Hs.108957 


115061 


AA253217 


AI751438 


Hs.41271 


115096 


AA255991 


A1683069 


Hs.175319 


115145 


AA258138 


AA740907 


Hs.88297 


115819 


AA426573 


AA486620 


Hs.41135 


115947 AA443793 


R47479 


Hs.94761 


116314 


AA490588 


AI799104 


Hs.178705 


116339 AA496257 


AK000290 


Hs.44033 


116430 AA609717 


AK001531 


Hs.66046 


116589 


D59570 


AI557212 


Hs.17132 


116733 


F13787 


AL157424 


Hs.6128g 


117023 


H88157 


AW070211 Hs.102415 


117186 


H98988 


H98988 


Hs.42612 


117563 


N34287 


AF055634 


Hs.44553 


117997 


N52090 


N52090 


Hs.47420 


118475 


N66845 


N66845 




118581 


N68905 


N68g05 




119073 


R32894 


BE245360 


Hs.279477 


119155 


R61715 


R61715 


Hs.310598 


119174 


R71234 


R71234 




119221 


R98105 


C14322 


Hs.250700 


119416 


T97186 


T97186 




119866 


W80814 


AA496205 


Hs.193700 


121335 


AA404418 


AA404418 




121381 


AA405747 


AW088642 Hs.97984 


123160 


AA488687 


AA488687 


Hs.284235 


123473 


AA599143 


AA599143 




123523 


AA608588 


AA608588 




123533 AA608751 


AA6087S1 




123964 


C13961 


C13961 




124006 


D60302 


AI147155 


Hs.270016 


124315 


H94892 


NM.005402Hs^757 


124659 


N93521 


AI680737 


Hs.289068 


124669 


N95477 


AI571594 


Hs.102943 


124847 


R60044 


W07701 


Hs.304177 


124875 


R70506 


AJ887664 


Hs.285814 


125091 


T91518 


T91518 




125103 


T95333 


AA570056 


Hs.122730 


125355 


R45630 


R60547 


Hs.170098 


125565 


R20839 


R20840 




125590 


.R23858 


R23858 


Hs.143375 


126511 


Ai024874 


T92143 


Hs.57958 


126563 


W26247 


AA516391 


Hs.181368 


126649 


AA856990 


AA001860 


Hs.279531 


126872 


AA136653 


AW450979 


127402 


AA358869 


AA358869 


Hs.227949 


127651 


AI123976 


AA382523 


Hs.105689 


127759 


AI369384 


AI369384 


Hs.292441 


128062 


AA379500 


AA379621 


Hs.105547 


128992 


R49693 


H04150 


Hs.107708 


129046 AA1^8 


AB029290 


Hs.108^ 



nuclear factor l/B 

hypothetical protein FU23221 

Rho guanine exchange factor (GEF) 15 

ESTs 

ESTs 

melanoma ceil adhesion molecule 

UDP-N>acetyl-a:pha-D-galactosamine:polyp 

podocaiyxin-like 

ESTs 

DKFZP586Q1517 protein 

gb:zl10a05.s1 Soare^regnanLutems^NbH 

hypothetical protein FU20992 similar to 

RAB6 interacting. Idnesln-^ke (rabkines 

ESTs 

ESTs 

ESTs 

ESTs 

UDP-N-aoetyl-atpha-D-galactosamtne:polyp 
Homer, neuronal immediate early gene, 3 
mitogen>activated protein kinase Idnase 
complement component Clq receptor . 
Homo sapiens cONA FLJ1 1949 fis. done HE 
microtubule-associated protein IB 
ESTs, Weakly similar to S41044 chromosom 
hypothetical protein FU22041 similar to 
408 ribosomal protein S27 isofonn 
Homo sapiens mRNA full length insert cDN 
ESTs 
ESTs 

endomucin-2 
K1AA1691 protein 

Homo sapiens cDNA FtJ1 1333 fis. done PL 

dipeptidyl peptidase 8 

hypothetical protein FU10669 

ESTs, Moderately ^ilar to 154374 gene 

synaptojantn 2 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs. WeaMy similar to ALU1_HUMAN ALU S 
unc5 (C.elegans homolog) c 
EST 

gb:za46c1 1 ^1 Soares fetal liver spleen 
gb:za69b09.s1 Soares JetaUung_NbHL19W 
ESTs 

ESTs. iVloderately similar to ALUI.HUMAN A 
gb7i54c08.s1 Soares placenta Nb2HP Homo 
tryptase beta 1 

9b7e50h09.s1 Soares fetai liver spleen 
Homo sapiens mRNA; cDNA DKF2p586l0324 (f 
gb:zw37e02.s1 SoaresJotaLfelus_Nb2HF8. 
hypothetical protein FU22252 similar to 
ESTs, Weakly similar to 138022 hypotheti 
gb:ae52d04.s1 Stratagene lung cardnonta 
gb:ae54e06.s1 Stratagene lung cardnoma 
gb:ae56h07^1 Stratagene lung carcinoma 
gb:C13981 Clontech human aorta polyA+mR 
ESTs 

v-ral simian leukemia viral oncogene horn 

Homo sapiens cDNA FU1 1918 fis. done HE 

hypothetical protein MGC12916 

Homo sapiens done FLB8503 PR02286 mRNA. 

sprouty (Orosophila) homofog 4 

gb7e20fD5.s1 Stratagene lung (937210) H 

ESTs, Moderately similar to K1AA1 215 pro 

KIAA0372 gene product 

gb7gD5o08.r1 Soares intent brain 1NIB H 

Homo sapiens, done IMAGE:3840937, mRNA. 

EGF-TM74atrophilHKelated protain 

U5 snRNP-spedfk: protein (220 kD). orth 

ESTs 

gb:UI-H.BI3^!a^-12-04jl5l NCLCGAP.Su 
SEC13 (S. cerevisiae)-like 1 
MSTP031 protein 
ESTs 

neural pro^ratbn. differenfiafion an 
ESTs 

adin blndmg protein; macrophin (nticrof 
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129168 

129314 

129371 

129468 

129765 

129805 

129884 

130495 

130639 

130657 

130828 

130972 

131080 

131137 

131182 

131466 

131573 

131647 

131756 

131859 

131881 

132050 

132083 

132164 

132358 

132413 

132456 

132490 

132676 

132687 

132718 

132736 

132760 

132933 

132968 

132994 

133061 

133147 

133161 

133200 

133260 

133363 

133491 

133517 

133550 

133607 

133614 

133627 

133691 

133696 

133913 

133975 

133985 

134039 

134088 

134161 

134299 

134416 

134453 

134656 

134989 

135051 

135073 

135349 

100114 

100130 

100143 

100168 

100208 

100224 

100405 

100420 

100455 

100529 

100618 

100619 



M30257 
AA028131 
M10321 
J03040 
M86933 
AA012933 
AA286710 
AA243278 
059711 
T94452 
AA053400 
AA370302 
J05008 
U85193 
AA256153 
X83107 
AA046593 
AA410480 
D45304 
M90657 
AA010163 
AA136353 
Y07867 
U84573 
X60486 
AA132969 
AA1 14250 
F137a2 
AA283035 
AB002301 
AA056731 
U66019 
H99198 
AA598702 
N77151 
AA505133 
AB000584 
D12763 
AA253193 
AA432248 
AA083572 
AA479713 
L40395 
X52947 
W80846 
M34539 
D67Q29 * 
U09567 
M85289 
D10522 
W84712 
D29992 
L34657 
S78569 
D43636 
U97188 
AA487558 
M28882 
X70583 
X14787 
AA236324 
C15324 
AA452000 
D83174 
D00596 
011428 
013640 
014874 
026129 
028476 
086425 
086983 
087953 

HG1862-HT1897 
HG261441T2710 
H6263944T2735 



NIVL001078HS.109225 
BE622768 Hs.290356 
X06828 Hs.1 10802 
AW410538 Hs.1 11779 
M85933 Hs.1238 
AA012848 Hs.1 2570 
AF055581 Hs.13131 
AW250380 Hs.109059 
AI557212 Hs.17132 
AW337575 Hs.201591 
AW631469 Hs.203213 
D81866 Hs^l739 
NIVL001955HS.2271 
W27392 Hs.33287 
AI824144 Hs.23912 
F06972 Hs.27372 
AA040311 Hs^8959 
AA359615 Hs.30089 
AA443966 Hs.31595 
AW960564 
AW361018 Hs.3383 
AI267615 Hs.38022 
BE386490 Hs.279663 
AI752235 Hs.41270 
NM_003542H5.46423 
AW361383 Hs^60116 
AB011084 Hs.48924 
NM.0012gOHs.4980 
N92589 Hs.261038 
AB002301 Hs.54985 
NIVL004600HS.554 
AW081883 Hs.211578 
AA125985 Hs.56145 
BE263252 Hs.6101 
AF234532 Hs.61638 
AA112748 Hs.279905 
AI186431 Hs.296638 
AA026533 Hs.66 
AW021103 Hs.6631 
AB037715 Hs.183639 
AA403045 Hs.6906 
A1866286 Hs71962 
BE619053 Hs.170001 
NM_000165Hs.74471 
A)129903 Hs.74669 
BE273749 

NWL003003HS.75232 
NIVL002047HS.75280 
M85289 HS211573 
AI878921 Ks.75607 
AU076984 Hs.7753 
C18356 Hs^95944 
L34657 Hs.78146 
NM.0022g0Hs.78672 
AI379954 Hs.79025 
AA634543 Hs.79440 
AW580939 Hs.97199 
X68264 Hs.211579 
Ai272141 Hs.83484 
AI750878 Hs.87409 
AW968058 Hs.92381 
AI272141 Hs.83484 
W55956 Hs.94030 
AA1 14212 Hs.9930 
X02308 Hs.82962 
NM.000304HS.103724 
AU076465 Hs.278441 
H73444 Hs.394 
NM^002933Hs78224 
AL121516 Hs.138617 
AW291587 H&82733 
086983 Ks.118893 
AW888941 Hs.757a9 
BE313693 Hs^330 
A1752163 Hs.1 14599 
N24433 Hs.241567 



vascular cell adhesion moleaite 1 

mesoderm development candidate 1 

von Wiliebrand factor 

secreted protein, acidic, cysteine-rich 

amelogenln (Y chromosome) 

tubuiin-spedftc chaperone d 

lysosomal 

mitochondrial n'bosomal protein LI 2 
ESTs, Moderately sinular to (54374 gene 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586l1516 (f 
endotheEn 1 
nudear factor t/B 
ESTs 

BMX non-iBoeptor tyrosffie kinase 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamiiy member 1 
upstream regulatory element binding prot 
ESTs 
Pirin 

procollagen-lysine, 2-oxoglutarate 5-dlo 
H4 histone temity. member G 
metalloprotease 1 (pitrilysin femily) 
KIAA0512 gene product; ALEX2 
LIM domain binding 2 
ESTs, Weakly ^milar to 138022 hypotheQ 
KIAA0303 protein 

Sjogren syndrome antigen A2 (60kD. libon 
Homo sapiens cONA: FU23037 fis. ctone L 
thymosin, beta, Identified in neuroblast 
hypothetical protein MGC3178 
myosin X 

cloneHQ0310PRCX)310p1 
prostate differentiation factor 
interleukin 1 receptor-like 1 
hypothetical protein FIJ20373 
hypothetical protein FU10210 
Homo sapiens cDNA: FLI23197 fis, done R 
ESTs, Weakly similar to B36298 proline-r 
eukaryotic translation initiation fador 
gap junction protein, alpha 1 , 43kD (con 
veside-assoctated membrane protein 5 (m 
FK506-binding protein 1A (12kO) 
SEC14 (S. cerevisiaeHike 1 
glycyMRNA synthetase 
heparan sulfate proteoglycan 2 (perlecan 
myristoylated alanhe-rich protein kinas 
calunfienin 

tissue factor pathway Inhbitor 2 
ptatelet/endothe&al cell adhesion molec 
Iaminin.alpha4 
KIAA0096 protein 
IGF*I} mRNA-binding protein 3 
complement component Clq receptor 
melanoma cell adhesion molecule 
SRY (sex detenniriing region Y>box 4 
thron^ospondin 1 

nudix (nucleoside diphosphate linked moi 

SRY (sex detennining region Y)-box 4 

Homo sapiens mRNA; cDNA 0KF2p586E1624 (f 

serine (or cysteine) proteinase inhibito 

thymidylate synthetase 

peripheral myefin protein 22 

KIAA0015 gene product 

adrenomedullin 

ribonudease. RNase A family, 1 (pancrea 
thyroid honnone receptor interactor 12 
nidogen 2 

Metanona associated gene 

N*myc downstream regulated 

calmodulin 2 (phosphorylase kinase, delt 

collagen, type VIII. alpha 1 

RNA bindir^ motif, single stranded inter 
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100658 HG2855-HT2g9S U56725 Hs.180414 

100676 HG3044-HT3742 X02761 Hs.287820 

100718 HG3342-Hr3519 BE295d28 Hs.75424 
100752 HG3543W3739 T81309 

100828 HG406g4n'4339 ALj048753 Hs.303649 

100850 HG417-HT417 AA836472 Hs^97939 



100991 J03764 
101097 L06797 
101110 L08246 
101142 L12711 
101156 L13977 
101168 L15388 
101184 L19871 
101192 L20B59 
101317 L42176 
101336 L49169 
101345 L76380 
101400 M15990 
101475 M23254 
101485 M24736 
101496 M26576 
101505 M27396 
101543 M31166 
101557 M31994 
101560 M32334 
101587 M35878 
101592 M36429 

101633 M57730 

101634 M57731 
101667 M60858 
101682 M62994 
101714 M68874 
101720 M69043 
101741 M74719 
101744 M75126 
101793 M84349 

101837 M92843 

101838 M92934 
101840 M93056 
101857 M94856 
101864 M95787 
101931 S76965 
101966 S81914 

102012 U03057 

102013 U03100 
102024 U03877 
102059 U08021 
102121 U14391 
102283 U31384 
102300 U32944 
102378 U40369 
102395 U41767 
102460 U48959 
102491 U51010 
102499 U51478 
102523 U53445 
102560 U59289 
102564 U59423 
102589 U62015 
102600 U63825 
102645 U67963 
102687 U73379 
102693 U73824 
102709 U77604 
102759 U81607 
102804 U89942 
102882 X04412 
102907 X06985 
102915 X07820 
102927 X12876 
102960 X15729 
103011 X52541 
103020 X5a416 
103029 X54489 
103036 X54925 
103056 X57206 



J03836 Hs.82085 
BE245301 Hs.89414 
A1439011 Hs.86386 
L12711 Hs.89643 
AA340987 Hs75693 
NM_005308Hs^11569 
NI\4>a01674Hs.460 
BE247295 Hs.78452 
L42176 Hs.8302 
N(VL006732Hs.75678 
NM_005795Hs.152175 
M15990 Hs.194148 
BE410405 Hs76288 
AA296520 Hs.89546 
X12784 Hs.119129 
AA307680 Hs75692 
M31166 Hs.2050 
BE293116 HS76392 
AW958272 Hs.347326 
AI762416 Hs.77326 
AF064853 Hs.91299 
NM_Q04428Hs.1624 
AV6S0262 HS75765 
Nl^005381 
AF043045 Hs.81008 
M6e874 Hs.211587 
M69043 Hs.81328 
NM_003199Hs.326198 
AI879352 Hs.1 18625 



W01076 
M92843 



Hs.278573 
Hs.343586 



BE243845 Hs75511 
AA236291 Hs.183583 
BE550723 Hs.153179 
BE392588 Hs75777 
NM_a06823Hs75209 
X96438 HS76095 
BE259035 Hs.1 18400 
BE616287 Hs.178452 
AA301867 HS76224 
AI752666 Hs76669 
NM.004998HS.82251 
AW161552 Hs.83381 
AI929721 Hs.5120 
AU076887 Hs.28491 
AU077005 Hs.92208 
U48959 Hs.211582 
U51010 

BE243877 Hs76941 



U53445 
R97457 
U59423 



Hs.15432 
Hs.63984 
HS79067 



AU076728 Hs.8887 
AI984144 Hs.66713 
AL1 19566 Hs.6721 
NIVL007019HS.93002 
AA532780 H3.183684 
AA122237 Hs.81874 
NIVL005100Hs788 
NWL002318HS.83354 
AI767736 Hs.290070 



BE409861 
X07820 



Hs.202833 
Hs.2258 



BE512730 Hs.65114 
Aig04738 H5.76053 



AJ2434^ 
X53416 



Hs.326035 
Hs.195464 



heat shock 70kD protein 2 
flbronectin 1 

inhibitor of DMA blndir^ 1, dominant neg 
insuIMe growth factor 2 (somatonnedi 
small inducible (^kine A2 (monocyte ch 
cathepsin B 

serine (or cysteine) proteinase Inhibito 

cheniokine (C-X-C motif), receptor 4 (fus 

myeiokl cell teukernia sequence 1 (BCL2Hr 

transketolase (Wemk:ke-Kor5akofr syndro 

prolylcafboxypeptidase (angiotensinase C 

G protein-coupled receptor kinase 5 

activating transcriptfon factor 3 

solute carrier family 20 (phosphate tran 

four and a half LIM domains 2 

FBJ murine osteosarcoma viral oncogene h 

cateitonin receptor-Tike 

v-yes-1 YamaguchI sarcoma viral oncogene 

calpain 2, (m/ll) targe subunit 

seiectin E (endothelial adhesion moiecul 

collagen, type IV. alpha 1 

asparagine synthetase 

pentaxin-retated gene, rapidly induced b 

aldehyde dehydix)genase 1 family, member 

Intercellular adhesion molecule 2 

insufin-iike growth factor binding prote 

guanine nucleotide binding protein (G pr 

ephrtn-AI 

GR02 oncogene 

nudeoHn 

filamin B, beta (actin^ding pioteln-2 
phosphoRpase A2. group I VA (cytosoUc, 
nudear factor of kappa light polypeptM 
transcription factor 4 
hexokinase 1 

CD59 an^en p18'20 (antigen klenfified 
zinc finger protein homologous to Zfp-36 
connective tissue growth factor 
serine (or cysteine) proteinase inhibito 
fatty acid binding protein 5 (psoriasis- 



AW800726 Hs789 
M13509 Hs^1G9 
Y18(^4 HS78877 



protein kinase (cAMP-dependent, catatyti 
immediate early response 3 
singed (Drosophila)-fike (sea urchin fas 
catenin (cadherin-assodaled protein), a 
EGF-containing fibulin-like extracelluia 
nicotinamide N-methyltransferase 
myosin IE 

guanine nudeotide binding protein 11 
dynein, cytoplasmk:. light pdlypeptkie 
spermkllne/spennine Nl^cetyltransfeiase 
a disintegrin and metalbprotelnase doma 
myosin, light polypeptkie kinase 
gbiHuman nkxytinamide N-methyltransferas 
ATPase, Na-»yK+ transporting, beta 3 poly 
downregulated in ovarian cancer 1 
cadherin 13, H-<:adhenn (heart) 
MAO (mothers against decapentapiegk; Dr 
cysteine-rtdi, angkigeiuc inducer. 61 
hepatitis delta antigen-fnterading prot 
lysosomal 

ubiquitin canier protein E2-C 
eukaryotic translation citation fador 
microsomal glutathione S-trans(erase 2 
A kinase (PRKA) anchor protein (gravin) 
iysy) oxidasemke 2 
gelsolin (amytoWosis, Finnish type) 
heme oxygenase (decyding) 1 
matrix metaltoproteinase 10 (stromelysin 
keratin 18 

DEAD/H (Asp^lu-Ala^p/His) box poiypep 
early growth response 1 
fdamin A, alpha (adin-bindir^ protein* 
GR01 oncogene (melanoma growth stimulati 
matrix metaltoproteinase 1 (interstifial 
inositol 1,4Xrisphosphate 3-kinase 6 
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103080 X59798 
103095 X60957 
103138 X65965 
103176 X69111 
5 103195 X70940 
103347 X87838 
103371 X91247 
103432 X97748 
103471 Y00815 

10 103967 AA303711 
104447 L44538 
104764 AA025351 
104783 AA027050 
104798 AA029462 

IS 104865 AA045136 
104877 AA047437 
104894 AA054087 
104952 AA071089 
105113 AA156450 

20 105178 AA187490 
105196 AA195031 
105215 AA205724 
105263 AA227926 
105271 AA227986 

25 105330 AA234743 
105461 AA253216 

105492 AA256210 

105493 AA256268 
105594 AA279397 

30 105727 AA292379 
105732 AA292717 
105767 AA346551 
105882 AA400292 
105936 AA404338 
35 106031 AA412284 
106124 AA423987 
106222 AA428594 
106241 AA430108 
106263 AA431462 
40 106264 AA431470 
106366 AA443756 
106454 AA44947g 
106634 AA459916 
106724 AA465226 
45 106793 AA478778 
105799 AA479037 
106842 AA482597 
106868 AA487561 
106890 AA489245 
50 106961 AA504110 
106974 AA520989 
107030 AA599434 
107061 AA608649 
107086 AA609519 
55 107216 D51069 
107385 U97519 
107444 W28391 
107985 AA03563B 
108507 AA083514 
60 108695 AA121315 
108931 AA147186 
109001 AA156125 
109195 AA188932 
109390 AA219653 
65 109456 AA232645 
109737 F10078 
110411 H46032 
110660 H82117 
110906 N395&4 
70 111018 N54067 
111091 N59858 
111356 N90933 
111378 N93764 
111741 R26124 
75 111769 R27957 
112318 R55470 



AU077231 Hs.82932 
NWL005424HS.78824 
X65965 

AL021154 Hs.76884 
AA351647 Hs.2642 
AU077309 Hs.171271 
X91247 Hs.13046 
X97748 

Y00815 Hs.75216 
AL120051 Hs.144700 
AW204145 Hs.156044 
AI039243 HS778585 
AA53351d H5.936S9 
AW952619 Hs.17235 
T79340 Hs.22575 
AI138635 Hs.22968 
AF065214 Hs.18858 
AW076098 Hs.345588 
AB037816 Hs.8982 
AA313825 Hs.21941 
W84893 Hs.9305 
AA205759 Hs.10119 
AW388633 Hs.6682 
AA807881 Hs.253» 
AW338625 Hs.22120 
BE539071 Hs.69388 
AI805717 Hs.289112 
AL047586 Hs.10283 
AB024334 Hs.25001 
AL135159 Hs.20340 
AW504170 Hs.274344 
AW370946 Hs.23457 
W46802 Hs.81988 
AI678765 Hs.21812 
X64116 Hs.171844 
H93366 Hs.7567 
AA356392 Hs^1321 
BE019681 Hs.6019 
W21493 Hs.28329 
AL046859 Hs.3407 
AA186715 Hs.336429 
NM.014038Hs.5216 
W25491 Hs.288909 
N48670 Hs.28631 
H94997 Hs.16450 
BE313412 Hs.7961 
AF124251 Hs.26054 
BE185536 Hs.301183 
AA489245 Hs.88500 
AW243614 Hs.18063 
AI817130 Hs.9195 
AL1 17424 Hs.25035 
BE147611 Hs.6354 
NM_012331Hs.26458 
D51069 Hs.211579 
NWL005397HS.16426 
W28391 Hs.343258 
T40064 HS71988 
AI554545 H5.68301 
AB029000 Hs.70823 
AA147188 

A1056548 Hs.72116 
AF047033 Hs.132904 
AW007485 Hs.87125 
AW956580 Hs.42699 
AA055415 Hs.13233 
AW001579 Hs.9645 
AA782114 Hs^8043 
AA035211 Hs.17404 
AI287912 Hs.3628 
AA300067 Hs.33032 
BE301871 Hs.4867 
AW160993 Hs.326292 
AB020653 Hs24024 
AW629414 Hs^4230 
AW083384 Hs.11067 



cydin D1 (PRAD1: parathyroid adenomatos 

tyrosine kinase with immunoglobulin and 

gb:H.sapiens SOD-2 gene for manganese su 

inhibitor of DNA binding 3, dominant neg 

euloryotic translation elongation factor 

catenin (cadherin-assodated protein), b 

thioiedoxin reductase 1 

gb:H.sapiens PTX3 gene promoter region. 

protein ^rosine phosphatase, receptor t 

ephiin-BI 

ESTs 

ESTs 

protein disulfide isomerase related prot 

Homo sapiens done TCCCIA00176 mRNA sequ 

B^ CtX/^mphoma 6, member B (zinc fi 

Homo saptens done IMAGE:451939. mRr^A se 

phosphoRpase A2, group IVC (cytosolic, 

desmoptaldn (DPI, DPil) 

Homo sapiens, done IMAGE:3506202, mRNA. 

AD036 protein 

angiotensin receptor-tiite 1 

hypothetical protein FU 14957 

solute carrier family 7, (cationic amino 

ESTs 

ESTs 

hypothetical protein FU20505 

CG1^3 protein 

RNA binding motif protein 8B 

tyrosine 3-monooxygenase/tryptophan 5-mo 

KIAA1002 protein 

hypothetical protein MGC12942 

ESTs 

disabled (Drosophita) homolog 2 (mitogen 
ESTs 

Homo sapiens d)NA: FU22296 fis. done H 

Homo sapiens cDNA: FU21962 fis. done H 

Homo sapiens done FLB9213 PR02474 mRNA. 

Homo sapiens cDNA: FU21288 fis. done C 

hypothetical protein FU14005 

protein kbiase (cAMP-dependent. catatyti 

RIKEN cDNA 9130422N19 gene 

HSPC028 protein 

hypothetical protein FLJ22471 

Homo sapiens cDNA: FU22141 fis, done H 

ESTs 

Homo sapiens done 25012 mRNA sequence 
novel SH2-containing protein 3 
molecule possessing anicyrin repeats tndu 
mitogen-activated protein kinase 8 inter 
Homo sapiens cDNA FU10768 fis, done NT 
Homo sapiens cDNA FIJI 3698 fis. done PL 
chloride intracellular channel 4 
stromal cell derived factor receptor 1 
methionine sulfoxide reductase A 
melanoma cell adhesion molecute 
podocalyxin-Dke 

proBferation-assodated 2G4, 38kD 

Homo sapiens mRNA; dDNA DKF2p564F053 (fr 

ESTs 

KIAA1077 protein 

gb2o38d01^1 Stratagene endothelial eel 
hypothetical protein FU20992 similar to 
solute carrier family 4, sodium bicarbon 
EH-domain containing 3 
ESTs 

ESTs. Moderately similar to A47582 B-cel 
Homo sapiens mRNA for KIAA1741 protein. 
ESTs 
ESTs 

mitogen^ctivated protein kinase kinase 
hypothetical protein DKFZp434N185 
mannosyl (alpha-13-)-g)ycoprotein beta- 
hypothetical gene DKFZp434A1114 
K1AA0846 protein 
ESTs 

ESTs. H^hly similar to T4639S hypotheti 
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112951 


T16550 


AA307634 


Hs.6650 


113057 T26674 


AW194301 Hs.339283 


113195 T57112 


H83265 


Hs.8881 


113490 T88700 


BE178110 


Hs.173374 


113542 T90527 


H43374 


Hs.7890 


113803 W42789 


AW880709 Hs^683 


113847 W60002 


NM.005032Hs.4114 


113910 W78175 


AA1 13262 


Hs.17901 


113947 


W84768 


W84768 




114047 W94427 


AL035858 


Hs.3807 


115061 


AA253217 


AI751438 


Hs.41271 


115819 AA426573 


AA486620 


Hs.41135 


115870 


AA432374 


NWL005985HS.48029 


115984 


AA446622 


AA987568 


Hs.74313 


116228 AA478771 


AI767947 


Hs.50841 


116264 


AA482594 


D51174 


Hs,272239 


116314 


AA490588 


AI799104 


Hs.178705 


116589 


D59570 


AI557212 


Hs.17132 


117023 


H88157 


AW070211 Hs.102415 


117112 


H94648 


AW969999 Hs.2g3658 


117156 


H97538 


W73853 




117176 


H98670 


H45100 


Hs.49753 


117280 


N22107 


M18217 


Hs.172129 


119559 


W38197 


W38197 




119866 


W80814 


AA496205 


Hs.193700 


120655 AA287347 


AA305599 


Hs.238205 


121314 


AA4Q2799 


W07343 


Hs.182538 


121335 AA404418 


AA404418 




121822 AA425107 


AI743860 




121835 AA425435 


AB033030 


Hs.300670 


122331 


AA442872 


AL133437 


Hs.1 10771 


122577 


AA452860 


AA82g725 


Hs.334437 


123160 


AA488687 


AA488687 


Hs.284235 


123486 


AA599674 


BE019072 


Hs.334802 


124059 


F13673 


BE387335 


Hs.283713 


124339 


H99093 


H99093 


Hs.343411 


124358 


N22495 


AW070211 Hs.102415 


124364 


N23031 


AF265555 


Hs.250646 


124726 


R15740 


NIVL003654HS.104576 


124763 


R39610 


BE410405 


HSJ6288 


125167 


W45560 


AL137540 


Hs.102541 


125304 


Z39833 


AL359573 


Hs.124940 


125307 


Z40583 


AW580945 Hs.330466 


125329 


AA825437 


AA825437 


Hs.58875 


125598 


R66613 


T40064 


Hs.71968 


125609 


AA868063 


AA868063 


Hs.104576 


418245 


AA128075 


AA088767 


Hs.83883 


127435 


N66570 


X69086 


Hs.286161 


127566 A1051390 


AI051390 


Hs.1 16731 


127619 AA627122 


AA627122 


Hs.163787 


128453 X02761 


X02761 


H5.287820 


128495 


AF010193 


NM.005904HS.100602 


128515 


AA149044 


BE395085 


Hs.10086 


128580 


U82108 


U82108 


Hs.101813 


128623 


D78676 


BE076608 


Hs.105509 


128642 


L35240 


Z28913 


Hs.102948 


128669 


AA598737 


W28493 


Hs.180414 


128903 


R69417 


AW150717 Hs.345728 


128914 


AA232837 


AW867491 Hs.107125 


129087 


N72695 


A1348027 


Hs.108557 


129188 


M30257 


NM_001078Hs.109225 


129226 


M96843 


BE222494 


Hs.180919 


129265 


X6a277 


AA530892 


Hs.171695 


129345 


AA292440 


R22497 


Hs.1 10571 


129468 


J03040 


AW410538 


Hs.1 11779 


129488 


AA228107 


AW966728 


Hs.54642 


129498 


AA449789 


AA449789 


Hs.75511 


129557 


W01367 


AL045404 


Hs.46366 


129619 


AA610116 


AA209S34 


Hs.284243 


129627 


AA258308 


T40064 


Hs.71968 


129762 AA460273 


AA453694 


Hs.12372 


129884 


AA286710 


AF055581 


Hs.13131 


130018 T68873 


AA353093 




130147 


D63476 


D63476 


Hs.172813 


130178 


M62403 


U20982 


HS.1S16 


130282 X55740 


BE245380 


Hs.153952 



vacuolar protein sorting 45B (yeast homo 
Human DNA sequence from clone RP1-187J1 1 
ESTs, Weakiy similar to S41044 chromosom 
Homo sapiens cDNA RJ10500 fis, done NT 
Homo sapiens mRNA for KiAA1671 piotein. 
chromosome 8 open reading frame 4 
plastin3(risofbrm) 

Homo sapiens, done IMA6E:3937015. mRNA. 
gb:zh53cI03^1 SoaresJetaUver.spleen. 
FXYD domain-containing ion transport reg 
Homo sapiens mRNA fiiD length insert cDN 
endomuctn*2 

snail 1 (drosophiia homolog). zinc finge 

KIAA1265 protein 

ESTs 

lysosomal 

Homo sapiens cDNA FLJ 11333 fis. cbne PL 
ESTs. IWoderately similar to 154374 gene 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs 
ESTs 

uveal autoantigen with coiled coil domai 
Homo sapiens cDNA: FU21409 fis, done C 
Empirically selected from AFFX single pr 
Homo sapiens mRNA; cDNA DKFZp586i0324 (f 
hypothetical protein PRO2013 
phospholipid scramblase 4 
gb2w37e02.s1 Soares_totaLfetus_Nb2HF8. 
metallothionein IE (functionai) 
K1AA1204 protein 

Homo sapiens cDNA: FLJ21904 fis. done H 
hypothetical protein l\4GC4248 
ESTs. WeaMy similar to 138022 hypothec 
Homo sapiens cDNA FU14680 fis. done NT 
ESTs. Weakly similar to S64054 hypotheti 
DEAD/H (Asp-Glu-Ala-Asp/His) box pofypep 
Homo sapiens mRNA; cDNA DKFZf^N0121 (f 
baculoviral lAP repeat-containing 6 
carbohydrate (keratan sulfate Gal^) stil 
calpain 2. (m/ll) large subunit 
netrin 4 

GTP-4)inding protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKF2p564F053 (fr 
carbohydrate (keratan sulfate Gal-6) sul 
transmembrane, prostate androgen induced 
Homo sapiens cDNA FLJ13613 fts. done PL 
ESTs 
ESTs 

fibionedin 1 

MAD (mothers against decapentapiegic Dr 
type I transmembrane protein Fn14 
solute canter family 9 (sodiunVhydrogen 
CTt2gene 

en^ma (LIM domain protein) 
heat shod( 70kD protein 8 
STAT induced STAT inhibitor 3 
piasmaiemma veside associated protein 
hypothetical protein PP1057 
vascular cell adhesion molecule 1 
inhibitor of DNA binding 2. dominant neg 
dual spedfidty phosphatase 1 
growth amest and DNA-damage-indudble, 
secreted protein, addic, cysteine-rnh 
methbnine adenosyttransferase 11. beta 
connective tissue growth factor 
KIAA0948 protein 
tetraspan NET-6 protein 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 

tripartite motif protein TRiM2 

lysosomal 

metaBothkinelnIL 

PAKHnteracGng exchange factor beta 
insufin-Gke growth factor-binding prote 
5'nudeofidase(CD73) 
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130431 L10264 
130495 AA243276 
130553 AA430a32 
130638 H16402 
5 130639 D59711 
130557 T94452 
130666 AA431571 
130776 R79356 
130818 AA280375 
10 130840 249269 
130899 241740 
131002 AA121543 
131080 J05008 
131084 AA101878 
15 131091 T35341 
131107 N87590 
131182 AA256153 
131207 W74533 
131319 U25997 
20 131328 V01512 
131328 V01512 
131328 V01512 
131328 V01512 
131509 X566B1 
25 131555 AA161292 
131564 AA491465 
131573 AA046593 
131692 D50914 
131756 D45304 
30 131859 M90657 
131909 W69127 
131915 AA316186 
132046 AA384503 
132050 AA136353 
35 132151 AA044755 
132164 U84573 
132187 AA058911 
132303 AA620962 
132314 AA285290 
40 132358 X60486 
132398 R31641 
132421 AA489190 
132490 F13782 
132520 AA257993 
45 132546 M24283 
132610 AA443114 
132716 T35289 
132840 N23817 
132883 AA047151 
50 132968 N77151 

132989 AA480074 
132999 Y00787 
133071 T99789 
133076 W84341 
55 133099 L09209 
133147 D12763 
133149 T16484 
133161 AA253193 
133200 AA432248 
60 133220 X82200 

133260 AA083572 
133295 L00352 
133349 N75791 
133391 X57579 
65 133398 X02612 
133436 H44631 
133454 AA090257 
133478 X83703 
133491 L40395 
70 133510 AA227913 
133517 X52947 
133526 M11313 
133538 L14837 
133582 M80721 
75 133584 D90209 
133590 T67986 



AW505214 Hs.155560 
AW250380 Hs.109059 
AF062649 Hs.252587 
AW021276 Hs.17121 
AI557212 Hs.17132 
AW337575 Hs.201591 
BE548267 Hs.337986 
AF167706 Hs.19280 
AW190920 Hs.19928 
BE048821 Hs.20144 
AI077288 Hs.296323 
AL050295 Hs^039 
NM-00l955Hs^71 
N^L017413Hs.303084 
AJ271216 Hs.22880 
BE620886 HsJ5354 
AI824144 Hs.23912 
AF104266 Hs.24212 
NIVL003155HS.25590 
AW939251 Hs.25647 
AW939251 Hs.25647 
AW939251 Hs.25647 
AW939251. Hs.25647 
X56681 Hs^780 
T47364 Hs.278613 
T93500 H5.28792 
AA040311 Hs.28959 
BE5596&1 Hs.30736 
AA443986 Hs.31595 
AW960564 

NM.016558HS.274411 
A1161383 Hs.34549 
AI359214 Hs.179260 
A)267615 Hs.38022 
BE379499 Hs.173705 
AI752235 Hs.41270 
AA235709 Hs.4193 
BE177330 Hs.325093 
AF1 12222 Hs.323806 
NM^003542Hs.46423 
AA876616 Hs.16979 
AW163483 Hs.48320 
NM.001290Hs.4980 
AA257992 Hs.50651 
M24283 Hs.168383 
AA160511 Hs.5326 
BE379595 Hs.283738 
BE218319 Hs.5807 
AA373314 Hs3897 
AF234532 Hs.61638 
AA480074 Hs.331328 
Y00787 Hs.624 
BE384932 Hs.64313 
AW946276 Hs.6441 
W16518 Hs.279518 
AA026533 Hs.66 
AA370045 Hs.6607 
AW021103 Hs.6631 
AB037715 Hs.183639 
NM^006074Hs.318501 
AA403045 Hs.6906 
A1147861 Hs213289 
AW631255 Hs.8110 
AW103364 Hs727 
NWL000499HS72912 
BE294068 Hs.737 
BE547647 Hs.177781 
X83703 Hs.31432 
BE619053 Hs.170001 
AW8B0841 Hs.96908 
NIVL000165HS.74471 
AU077051 Hs.74561 
NM.003267HS.74614 
M60721 Hs.74870 
D90209 Hs.181243 
T70956 Hs.75108 



calnexin 

mitochondrial rfbosomal protein L12 
pituitaiy tumor^transforming 1 
ESTs 

ESTs, Moderately similar to 154374 gene 
ESTs 

Homo sapiens cONA FU 10934 fis, done OV 

cysteine-rich niotor neuron 1 

hypo(helical protein SP329 

small inducible cytokine subliamlly A (Cy 

serum/glucocorticoid regulated kinase 

KIAA0758 protein 

endotheEn 1 

apelln: peptide Dgand for APJ receptor 

dipeptidylpeptidase 111 

GCN1 (general control of amlno^dd synt 

ESTs 

latrophilin 

stanniocaldn 1 

v-fos F6J murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
]un D proto-oncogene 
{nterfeion. alpha-indudble protein 27 
Homo sapiens cDNA FIJ1 1041 fis, ckxie PL 
ESTs 

K1AA0124 protein 
ESTs 

transmembrane 4 superiamily member 1 

SCAN domain-oontaining 1 

ESTs, Highly sImDar to S94541 1 done 4 

chromosome 14 open reading frame 4 

ESTs 

Homo sapiens cDNA: FLJ22050 fis, done H 

procollagen-lysine, 2-oxoglu1arate 5-dio 

DKFZP58601624 protein 

Homo sapiens cONA: FLJ21210 fis. clone C 

pinin, desmosome associated protein 

H4 hlstone family, member G 

ESTs, Weakly similar to A43932 mudn 2 p 

double ring-ftnger protein, Dorfin 

LIM domain binding 2 

Janus kinase 1 (a protein tyrosine kinas 

interceQuiar adhesion molecule 1 (CD54) 

amino add system N transporter 2; porcu 

casein kinase 1, alpha 1 

GTPase Rab14 

Homo sapiens mRNA; cDNA DKFZp588P1622 (f 
myosinX 

hypothetical protein FU13213 
interteukin 8 

ESTs. Weakly similar to AF257182 1 G-pro 

Homo sapiens mRNA; cDNA DKFZp586J021 (fr 

amyloid beta (A4) precursor-like protein 

Interieuldn 1 receptor-like 1 

AXIN1 up-regulated 

hypothetical protein FU20373 

hypothetical protein FU10210 

Homo sapiens mRNA fuD length insert cDN 

Homo sapiens cDNA: FU23197 fis, done R 

k>w density lipoprotein receptor (famSi 

L-3-hydroxyacyl-Coenzyme A dehydrogenase 

inhibin. beta A (adivin A, activin A6 a 

cytochrome P450, subfamily I (aromatic c 

immediate earty protein 

hypothetical protein MGC5618 

cardiac ankyrin repeat protein 

eukaiyotic translation initiation fador 

p53-lnduced protein 

gap junction protein, alpha 1, 43kD (con 

alpha-2-inacroglobulin 

tight junctkm protein 1 (zona ocdudens 

H2.0 (DrosophilaHike homeo box 1 

activating transcription factor 4 (tax-r 

dusterin (oomplementtyste Inhibitor, S 
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133617 AA148318 BE244334 Hs.75249 

133651 U97105 AI301740 Hs.173381 

133671 T25747 AW503116 Hs.301819 

133678 K02574 AW247252 

5 133681 D78577 AI352558 

133722 X53331 AW969976 Hs.279009 

133730 S73591 BE242779 Hs.179526 

133750 X95735 BE410769 Hs.75873 

133802 L16862 AW239400 Hs.76297 

10 133825 U44975 BE616902 Hs.285313 

133838 M97796 BE222494 Hs.180919 

133859 U86782 U85782 Hs.178761 

133889 AA099391 U48959 Hs.211582 

133960 M19267 M19267 Hs.77899 

15 133975 D29992 C18356 Hs.295944 

133977 L19314 AI125639 Hs.250666 

134039 S78569 NM_002290Hs.78672 

134075 U28811 NM_012201Hs.78979 

134081 L77886 AL034349 Hs79005 

20 134164 C14407 AW245540 Hs79516 

134203 M60278 AA161219 Hs799 

134238 R81509 AA102179 Hs.160726 

134299 AA487558 AW580939 Hs.97199 

134332 D86962 D86962 Hs.81875 

25 134339 AA478i971 R70429 Hs.81988 

134343 D50683 D50683 Ks.82028 

134381 U56637 AI557280 Hs.184270 

134403 M61199 AA334551 

134416 M28882 X68264 Hs.211579 

30 134493 X15183 M30627 Hs.289088 

134558 S53911 NM_001773Hs.85289 

134817 U20734 AU076592 Hs.198951 

134983 D28235 D28235 Hs.196384 

134989 AA236324 AW968058 Hs.92381 

35 135052 AA148923 AL136653 Hs.93675 

135062 AA174183 AK000967 Hs.93872 

135069 AA456311 AA876372 Hs.93961 

135071 L08069 W27190 Hs.94 

135073 AA452000 W55956 Hs.94030 

40 135170 AA282140 T53169 Hs.9587 

135196 J02854 C03577 Hs.9615 

135348 AA442054 U80983 Hs268177 

134404 AB00045D AB000450 Hs.82771 
439561 AB002380 AF180681 Hs.6582 

45 10D082 AB003103 AA130080 Hs.4295 

132817 AB004884 N27852 Hs.57553 

130150 AF000573 BE094848 Hs.15113 

100104 AF008937 AR)08937 

447973 AF009301 AB011169 Hs.20141 

50 332613 AF009368 AF029674 Hs.173422 

100113 D00591 NM_001269Hs.84746 

133980 D00760 AA294921 Hs.348024 

100129 D11139 AA469369 Hs.6831 

100154 D14657 H60720 Hs.81892 

55 100169 D14878 AL037228 Hs.82043 

129718 D17716 NM_002410Hs.1 21502 

100190 D21090 M91401 Hs.178658 

134742 D26135 NM_001346Hs.89462 

100211 D26528 D26528 Hs.123058 

60 100238 D30742 L24959 Hs.348 

130283 D31762 NM 01 2288Hs.1 53954 

134237 D31765 D31765 Hs.170114 

100248 D31888 NM_015156HsJ8398 

100256 D38128 D25418 Hs.393 

65 100262 D38500 D38500 Hs^78468 

134329 D38551 N92036 Hs.81848 

100281 D42087 AF091035 Hs.184627 

100294 D49396 AA331881 Hs.75454 

100327 D55640 D55640 

70 100335 D63391 AW247529 Hs.6793 

134495 D63477 D63477 Hs34087 

100338 D63483 D86864 Hs57735 

135152 D64015 M96954 Hs.182741 

134269 D79990 NM_014737Hs.80905 

75 100372 D79997 NM_014791Hs,1 84339 

134304 D80010 BE613486 Hs.81412 



ADP-ribosylation factor-like 6 interacS 

dihydropynmidinase-like 2 

zinc finger protein 146 

nucleoside phosphorylase 

tyrosine 3-monoQxygenase/tiyptophan S-mo 

matrix 61a protein 

upregulated by 1.2&<iihydroxyv{tamin [>3 
zyxin 

G protein-coupled receptor kinase 6 

core promoter element blndii^ protein 

inhibitor of DNA binding 2, dominant neg 

26S proteasome-associated padi homolog 

myosin, light polypeptide kinase 

tropomyosin 1 (alpha) 

tissue factor pathway inhibitor 2 

hairy (Drosophila)-homok}g 

taminin, alpha 4 

Gotgi apparatus protein 1 

protein tyrosine phosphatase, receptor t 

brain abundant, menibrane attached signal 

diphtheria toxin receptor (heparin43indi 

Homo sapiens cDNA FLJ11680 fis. done HE 

complement component Clq receptor 

growth factor receptor-bound protein 10 

disabled (Drosophila) homolog 2 (mitogen 

transfonnjng growth factor, beta reoepto 

capping protein (actin filament) muscle 

sperm spedfic antigen 2 

melanoma cell adhesion molecule 

heat shock 90kD protein 1, alpha 

CD34 anQgen 

jun B proto-onoogene 

prostaglandin-endoperoxkie synthase 2 (p 

nudix (nucleoside diphosphate linked moi 

deddual protein induced by progesterone 

KIAA1682 protein 

Homo sapiens mRNA; cDNA DKFZp667D095<fr 
DnaJ (Hsp40) homolog, subfamily A, membe 
Homo sapiens mRNA; cDNA DKF2p586E1624 (f 
Homo sapiens cDNA: FLJ22290 fis, done H 
myosin regulatory light chain 2, smooth 
phosphoHpase C, gamma 1 (fomieriy subty 
vaccinia related kinase 2 
Rho guanine exchange factor (GEF) 12 
proteasome (prosome, macropain) 26S subu 
tousled-Oke kinase 2 

homogentisate 1,2-dioxygenase (homogenti 
^taxln16 

sintilarto S. cerevisiae SSM4 

KIAA1605 protein 

chromosome condensatton 1 

v-ral sin^n leukemia viral oncogene horn 

tissue inhibitor of metalbproteinase 1 

KIAA0101 gene product 

D1 23 gene product 

mannosyi (alpha-1,6-)-glycoprotein beta- 
RAD23 (S. cerevisiae) homolog B 
diacylgiyceFol kinase, gamma (90kD) 
DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
catdum/calmodulin-<lependent protein kin 
TRAM-like protein 
KIAA0061 protein 
KIAA0071 protein 

prostaglandin 12 (prostacydin) receptor 
postmelotic segregation increased 2-like 
RAD21 (S. pombe) homolog 
KIAA0118 protein 
peraxtredoxin 3 

gbiHuman monocyte PABl (pseudoautosomal 

platelet^divating fectorace^lhydrola 

KIAA0143 protein 

acetyl IDl rec^ton SREC 

HAI cytotoxx: granule-associated RNA-bi 

Ras assodatkHi (RalGDS/AF-6) domain fam 

KIAA0175 gene product 

liinni 
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100394 D84276 
100405 D86425 
100418 D86978 
133154 D87012 
5 134347 D87075 
444099 D87432 
100438 D87448 
134593 D87845 
100481 86109840^1098 
10 100552 HG2167-HT2237 
100591 HG2415-HT2511 
100852 HG28254rr2949 
100662 HG2887-HT3031 
100899 HG4660^5073 
15 100905 HG4704-HT5146 
100945 HG884-HT884 
100950 H6919-HT919 
100964 J00212 
135407 J04029 
20 130149 J04031 
131877 J04088 
101016 J04543 
134786 L06139 
134100 L07540 
25 134078 L08895 
101132 L11239 
134849 LI 1353 
332736 L13773 
101152 L13800 
30 135397 L14922 
432642 L15189 
101168 L15388 
421155 L16895 
101226 L27476 
35 415138 L27624 
134739 L32976 
130155 133404 
440538 L35263 
409916 L37347 
40 101294 L40371 
101300 L40391 
101310 L41607 
130344 L77566 
101381 M13928 
45 101381 Ml 3928 
415678 M14016 
133780 M14219 
101396 M15796 
101447 M21305 
50 101458 M22092 
101470 M22898 
134604 M22995 
101478 M23379 
133519 M24400 
55 131185 M25753 
134116 M27691 
133999 M28213 
130174 M29550 
129963 M29971 
60 132983 M30269 
133900 M31158 
101543 M31166 
101545 M31210 
101620 M55420 
65 134691 M59979 
133595 M62810 
101700 M64710 
101714 M68874 
134246 M74524 
70 101760 MB0254 
415022 M81780 
415022 M81780 
415022 M81780 
415022 M81780 
75 415022 M81780 
101791 M83822 



D84284 Hs.66052 
AW291587 Hs.82733 
D86978 Hs.84790 
D87012 Hs.194685 
AF164142 Hs.82042 
D87432 Hs.10315 
AA013051 Hs.91417 
NI\L000437Hs^34392 
X70377 Hs.121489 
AA019521 Hs^Ome 
NA4J004091HS.231444 
BE613608 Hs.142653 
AI368680 Hs.616 
AL039123 Hs.103042 
L12260 Hs.172816 
AF002225 Hs.180686 
AF128542 Hs.166846 
J00212 

J04029 Hs.99936 
AW067805 Hs. 172665 
J04088 Hs.1 56346 
J04543 HS78637 
T29618 Hs.89840 
AA460085 Hs.171075 
L08895 HS78995 
L11239 Hs.36993 
BE409525 Hs.902 
Z83689 Hs.l 14765 
AI984625 Hs.g884 
L14922 Hs.166563 
BE297635 Hs.3069 
NM«005308Hs.211569 
H87879 Hs.102267 
AF083892 Hs.75608 
C18356 Hs.295944 
NM.002419HS.89449 
AA101043 Hs.151254 
W76332 Hs.79107 
BE313625 Hs.57435 
AF168418 Hs.l 16784 
BE535511 
L41607 Hs.934 
AW250122 Hs.154879 
AW675039 Hs.1227 
AW675039 Hs.1227 
AW005903 Hs.78601 
AA557660 Hs.76152 
BE267g31 Hs78996 
M21305 
M22092 

NIVL000546HS.1846 
NM_002884Hs.865 
NM 002890HS758 
AW583062 Hs74502 
BE280074 Hs.23960 
R84694 Hs79194 
AA535244 Hs78305 
M29551 HS.151531 
M29971 Hs.1384 
M30269 

M31158 Hs.77439 
M31166 Hs.2050 
BE246154 Hs.154210 
S55271 Hs.247930 
AW382987 Hs.88474 
AA393273 Hs75133 
D90337 Hs.247916 
M68874 Hs.211587 
D28459 Hs.80612 
M80254 Hs.173125 
X59960 Hs,77813 
X59960 Hs.77813 
X59960 Hs.77813 
X59960 Hs.77813 
X59980 Hs.77813 
MB3822 Hs.62354 



CD38 antigen (p45) 

nidogen 2 

KIAA0225 protein 

topoisomerase (DMA) III beta 

solute carrier family 23 (nucleobase tra 

solute carrier family 7 (cationic amino 

topoisomerase (DMA) II binding protein 

platelet-activating factor aoetylhydrola 

cystatinD 

lysosomal 

Homo sapiens. SimSar to hypothetical pr 
ret finger protein 

SRY (sex determining region Yytoc 2 
microtubute-assodated protein IB 
neuregulin 1 

ubiquitln protein Ggase E3A (human papi 
polymerase (DMA directed), epsilon 
Empirically selected from AFFX single pr 
keratin 10 (epidermolytic hyperkeratosis 
methytenetetrahydmfolate dehydrogenase 
topoisomerase (DNA) II alpha (170kD) 
annexin A7 

TEK tyrosine kinase, endothelial (venous 
repOcation factor C (activator 1) 5 (36 
MADS box transcription enhancer factor 2 
gastrulation brain homeo box 1 
neurofibromin 2 (bBateral acoustic neur 
myeloid/lymphoid or mixed-lineage leukem 
spindle pole body protein 
replication factor C (activator 1) 1 (14 
heat shock 70kD protein 9B (mortalin-2) 
G protein-coupled receptor kinase 5 
lysyl Qxklase 

l^ht Junction protein 2 (zona oodudens 
tissue factor pathway inhibitor 2 
mitogen-activated protein kinase kinase 
kalGkrein 7 (chymcAiypfic. stratum com 
mitogen-activated protein kinase 14 
solute carrier family 1 1 (proton-coupled 
thyroid hormone receptor interactor 4 
transmembrane trafficking protein 
glucosaminyt (N-acetyl) transferase 2. 1 
DiGeorge syndrome critical regton gene D 
aminolevulinate. delta-, dehydratase 
aminolevulinate. delta-, dehydratase 
uroporphyrinogen decarboxylase 
decoHn 

proliferating cell nuclear antigen 
gbiHuman alpha satellite and satellite 3 
gb:Human neural cell adhesion molecule ( 
tumor protein p53 (Li-Fraumeni syndrome) 
RAP1 A, member of RAS oncogene family 
RAS p21 protein activator (GTPase adiva 
chymotrypsinogen B1 
cydin B1 

cAMP responsive element binding protein 
RAB2, member RAS oncogene family 
protein phosphatase 3 (fomnerty 2B), cat 
0-6-methylguanine-DNA methyltransferase 
nidogen (enactin) 

protein kinase, cAMP-dependent regulato 
pentaxln-related gene, rapidly Induoed'b 
endothelial differentiation, sphingolipi 
Epsik)n , IgE 

prostaglandin-endoperoxide synthase 1 (p 
transcriptton factor 6-like 1 (mitochond 
natriuretk: peptide precursor C 
phosphollpase A2, group IVA (cytosonc, 
ubkiuitin-Gonjugating enzyme E2A (RAD6 h 
peptidytprolyl isomerase F (cydophifin 
sphingomyelin phosphodiesterase 1, add 
sphingomyelin phosphodiesterase 1. add 
sphingomyelin phosphodiesterase 1. add 
sphlngomyefin phosphodiesterase 1. add 
sphingomyelin phosphodiesterase 1, add 
cell division cyde4-Qke 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



101812 


M86934 


BE439894 


Hs.78991 


101813 


M87338 

■ ■■vf www 


NM_002914Hs.139226 


133396 


M96326 


M96326 


Hs.72885 


428161 


M96954 


M96954 


Hs.182741 


129026 


IVI900WW 


AL120297 


Hs.108043 


101901 




H38026 


Hs.308 


134831 


S72370 


AA853479 


Hs.89890 


134039 


S78569 


NM-002290HS.78672 


442355 


070873 


AA456539 


Hs.8262 


101975 


SB339S 


AA079717 


Hs^83664 


101977 


S83364 


AF1 12213 


Hs.184062 


101978 




BE561610 


Hs.5809 


101998 


U01212 


U01212 


Hs.248153 


102003 


U01922 


U01922 


Hs.125565 


102007 


U02556 


U02556 


Hs.75307 


102009 


1102680 


BE245149 


Hs.82643 


416858 


U03272 


U03272 


Hs.79432 


132951 




AW821182 


Hs.61418 


135389 




U05237 


Hs.99872 


102048 


U07225 


U07225 


Hs.339 


130145 




U34820 


Hs,151051 


303153 




U09759 


Hs.246857 


420269 




U72937 


Hs.96264 


102095 


1 M1313 


U11313 


Hs.75760 


102123 


U14518 


NM 001809HS.1594 


102126 




AW950870 Hs.78961 


102133 


1115173 


AU076845 


Hs.155596 


102139 


1115039 


NM_004419Hs.2128 


102162 


U18291 


AA450274 


Hs.1592 




iHfiono 


NM^000107Hs.77602 




\J lOwOO 


AA159001 


Hs.180069 


131fi17 

Iw to 11 


1190536 


U20536 


Hs.3280 






AA232362 


Hs.157205 


105910 




BE619413 


Hs2437 


10921 A 


1 193759 


U23752 


Hs.32964 


lOZO 1 1 




U25435 


Hs.57419 


1'^1*?1Q 

lO lO 19 


I I95QQ7 


NIVL003155HS.25590 






U28251 


Hs.53237 


132316 




U28831 


Hs.44566 


102269 


1130045 


U30245 




417526 


U32315 


AA568906 


Hs.82240 


102293 


U3243Q 


AF090116 


Hs.79348 


102298 




AA382169 


Hs.54483 


102325 


LI35139 


At815867 


Hs.50130 


428734 


LI3R7fi4 


BE303044 


Hs.192023 


102361 




AA223616 


Hs.75859 


102357 


U3Q6S7 


U39656 


Hs.1 18825 


1 uxooo 


1141 '144 


AA362907 


HS76494 


109')Qd 


1141766 
\JH I/DO 


NM_003816Hs.2442 






AF010258 


Hs.127428 


109409 


1143966 


BE300330 


Hs.1 18725 






AW410035 Hs.75862 


102423 


U44754 


247542 


Hs.179312 




LJ47011 


AB014615 Hs^lO 


132828 


U47011 


AB014615 Hs.57710 


132828 


U4701 1 


AB014615 


Hs.57710 


132828 


U4701 1 


AB014615 


Hs.57710 


425322 


U47077 


U63630 


Hs.155637 


102450 


U48251 


U48251 


Hs.75871 






U50535 


Hs.1 10630 




1156633 


U96759 


Hs.198307 




U560Q1 


AB014595 


Hs.1 55976 




IJ5R637 


AA019401 


Hs.93909 




U5Q96Q 


R97457 


Hs.63984 




1J5P6R3 


U63830 


Hs.146847 


417173 


IIR7199 


U61397 


Hs.81424 


109fi^ 


1JR731Q 


U67319 


Hs.9216 


132736 


U68019 


AW081883 Hs.211578 


133070 


UoSOl 1 


U92649 


Hs.64311 


102663 


U70322 


NWL002270HS.16B075 


134660 


U73524 


U73524 


Hs.87465 


102735 


U79267 


AF111106 


H5.3382 


102741 


U79291 


AW959829 Hs.83572 


130564 


U82671 


U82671 


Ks.3e98D 


130564 


U82671 


U82671 


Hs^6980 


132164 


U84573 


AI752235 


Hs.41270 



DMA segment numerous copies, expressed 
replication factor C (activator 1) 2 (40 
azuroddin 1 (catloolc antimicrobiai pro 
TIA1 cytotoxic granule-associated RNA-bi 
Friend leutcemia virus integration 1 
arrestin 3. retinal (X-anestin) 
pyruvate carboxyfase 
laminin, alpha 4 

lysosomakissociated membrane protein 2 

aspartate beta-hydroxylase 

putative RabS-interacGng protein 

putative transmembrane protein; homolog 

olfactoiy marker protein 

translocase of inner mitodiondiial membr 

t-complex-associatBd-testis-expressed 1- 

protetn tyrosine kinase 9 

fibrilRn 2 (congenital contractural ara 

microfibrilIar*associated protein 1 

fetal Alziieimer antigen 

purinergic receptor P2Y, 6-protein coupl 

mitogen-activated protein kinase 10 

mitogen-activated protein kinase 9 

alpha thalassemia/mental retardation syn 

sterol carrier protein 2 

centromere protein A (17kD) 

protein phosphatase 1, regulatory (inhib 

BCL2/adenovirus E1B 19kD-jnteracting pro 

dual specificity phosphatase 5 

CDC16 (cell division cycle 16, S. cerevi 

damage-speciftc DMA binding protein 2 (4 

nudear respiratory factor 1 

caspase 6, apoptosis-related cysteine pr 

branched chain ammotransferase 1 , cytos 

eukaryofic translation initiafion factor 

SRY (sex detennining regkni Y)4iqx 1 1 

CCCTOblnding factor (zinc finger protsi 

stanniocaldn 1 

ESTs, Highly similar to Z169^HU!VIAN ZINC 
K)AA1641 protein 

gb:Human myetomonocytic spedfic protein 
syntaxin 3A 

regulator of G-protein signalling 7 
N-myc (and STAT) interactor 
necdin (mouse) homolog 
eukaryotlc translation initiation factor 
chromosome 11 open reading frame 4 
mitogen-activated protein kinase kinase 
proline arginine-rich end leudne-rich r 
a disinteghn and metalloproleinase doma 
homeoboxA9 

selenophosphate synthetase 2 
MAD (mothers against decapentaptegic, Dr 
sman nudear RNA activating complex, po 
f&roblast growth factor 8 (andiogen-ind 
fibroblast growth factor 8 (androgennnd 
fibroblast growth factor 8 (androgen-lnd 
fibroblast growth iatioT 8 (andiogen-ind 
protein kinase. DNA-activated. catalytic 
protein kinase C binding protein 1 
Human BRCA2 regbn. mRNA sequence CG006 
von Hippel-Lindau binding protein 1 
cullin 4B 

cyclic nudeotide gated channel beta 1 

cadherin 13. H-cadherin (heart) 

TRAF family member-assodated NFKB activ 

ubk^uitin-like 1 (sentrin) 

caspase 7. apoptosis-related cysteine pr 

Homo sapiens cDNA: FIJ23037 fe. done L 

a disintegrin and metalloproteinase doma 

karyopherin (importin) beta 2 

ATP/GTP-binding protein 

protein phosphatase 4. regulatory subunl 

hypothetical protein MGC14433 

melanoma antigen, family A. 2 

melarumta antigen, family A. 2 

prooo(fagen4ysine. 2-oxogbJtarato 5-dk> 
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102823 U90914 
102826 Ugi316 
102831 U91d32 
102846 096131 
5 129777 U97018 
134161 U97188 
134854 V00503 
429257 X04327 
413985 X06389 

10 419768 X07496 
102915 X07820 
134656 X14767 
413858 X15525 
102968 X16396 

IS 102971 X16609 
134037 X53586 
134037 X53586 
103023 X53793 
103037 X54936 

20 130282 X55740 
134542 X57025 
128568 X60673 
128568 X60673 
103093 X60708 

25 413076 X62048 
129063 X63097 
424460 X63563 
411077 X64037 
103181 X69536 

30 103184 X69878 
103194 X70649 
103208 X72841 
129698 X74987 
131486 X83107 

35 130729 X84194 
103334 X85753 
132645 X87870 
135094 X89066 

103352 X89398 
40 103352 X89398 

103353 X89399 
132173 X89426 
103371 X91247 
131584 X91548 

45 103376 X92098 
103378 X92110 
128510 X94703 
103410 X98506 
133490 X97230 

50 332689 X97230 
103438 X98263 
103440 X98296 
103452 X99584 
133536 Y00264 

55 420234 Y07566 
426502 Y07759 
134662 Y07827 

132083 Y07867 
103500 Y09443 

60 134389 Y09858 

132084 Y12394 
103540 Z11559 
133152 Z11695 
103548 Z15005 

65 103612 Z46261 

129092 AA011243 
103692 AA018418 
103695 AA018758 
129796 AA018804 

70 434993 AA031993 
132683 AA044217 
131887 AA046548 
103723 AA057447 
453368 AA058376 

75 133260 AA083572 
103765 AA085696 



085390 Hs.5057 
NIVL007274HS.8679 
AA262170 Hs.80917 
BE264974 Hs.6566 
U97018 Hs.12451 
AA634543 HsJ9440 
J03464 Hs.179573 
AW163799 Hs.198365 
AI018666 H5.75667 
T72104 Hs.93194 
X07820 Hs.2258 
AI750878 Hs.87409 
NM.001610HS.75569 
AU076611 Hs.154672 
X16609 Hs.183805 
A1808780 Ks.227730 
AI808780 Hs^27730 
AW500470 Hs.1 17950 
BE018302 Hs.2894 
BE245380 Hs.153952 
M14156 Hs.85112 
H12912 Hs.274691 
H12912 Hs^74691 
S79876 Hs.44926 
U10564 HS75188 
X63094 Hs.283822 
BE275979 Hs.296014 
AW977263 Hs.68257 
X69636 Hs.334731 
U43143 Hs.74049 
NM.004939HS.78580 
AW411340 Hs.31314 
BE242144 Hs.12013 
F06972 Hs.27372 
AI963747 Hs.18573 
NM_001260Hs.25283 
AI654712 Hs.54424 
NM_003304Hs.2506S7 
H09366 Hs.78853 
H09366 Hs.78853 
X89399 Hs.1 19274 
X89426 Hs.41716 
X91247 Hs.13046 
AA598509 Hs.29117 
AL036166 Hs.323378 
AL1 19690 Hs.153618 
X94703 

AA1 58294 Hs.295362 
AF022044 Hs.274601 
AF022044 Hs.274601 
AW175781 Hs.152720 
X98296 Hs.77578 
NM_006936Hs.85119 
W25797.comp 
AW404g08 Hs.96038 
Y07759 Hs.170157 
NM_007048Hs284283 
BE386490 Hs.279663 
AW408009 HS.2258D 
Y09858 Hs.82577 
NIVL002267HS.3886 
NWL002197Hs,154721 
Z11695 Hs.324473 
Z15005 Hs.75573 
BE336654 Hs.70937 
D56365 Hs.63525 
AW137912 Hs^583 
AW207152 Hs.186600 
BE218319 Hs.5807 
AA306325 Hs.4311 
BE264633 Hs.143638 
W17064 Hs.332848 
BE274312 Hs.214783 
W20296 Hs288178 
AA403045 Hs.6908 
AA085696 Hs.169600 



carboxypeptKiase D 

cytosoEc acyl coenzyme A thioester hydr 
adaptor-related protein oomptex 3, sigma 
thyroid hormone receptor (nteractor 13 
echinoderm mlootubule-associated protei 
IGF-II mRNA-binding protein 3 
collagen, type I. alpha 2 
2.3-bisphosphoglyceratB mutase 
synaptophysin 
apoIipoprotetnA-l 

matrix metalloproteinase 10 (stromelysin 

thrombospondin 1 

acid phosphatase 2, lysosomal 

methylene tetrahydrofblate dehydrogenase 

ankyifnI.eiythnKiytic 

integrin. alpha 6 

integrin. alpha 6 

multifunctional polypeptide similar to S 

placental growth factor, vascular endoth 

5' nucleotidase (CD73) 

tnsulin-like growth factor 1 (somatomedi 

adenylate kinase 3 

adenylate kinase 3 

dipeptidylpeptldase IV (CD26. adenosine 

weel (S. pombe) hontotog 

Rhesus blood group, D antigen 

polymerase (RNA) 11 (DNA directed) polyp 

general transcription factor IIP. polype 

Homo sapiens, clone IMAGE:3448306, mRNA, 

fms-related tyrosine kinase 4 

DEAD/H (Asp-GUi-Ala-Asp/His) box polypep 

retinoblastoma-binding protein 7 

ATP-binding cassette, sub-family E (OABP 

BMX non-ieceptor tyrosine kinase 

acylphosphatase 1. erythrocyte (common) 

cydin^ependent kinase 8 

hepatocyte nuclear fector 4, alpha 

transient receptor poienfial channel 1 

uractl-DNA glycosylase 

uracil-DNA glycosylase 

RAS p21 protein activator (GTPase activa 

endothelial cell-specific molecule 1 

thioredoxln reductase 1 

purine-fich element binding protein A 

coated vesicle membrane protein 

HCGVlll-1 protein 

RAB28, member RAS oncogene family 

DR1 -associated protein 1 (negative cofac 

killer cell immunogbbulln-like receptor 

killer cell immunoglobulin-like receptor 

M-phase phosphoprotein 6 

ubiquitin specific protease 9, X chromos 

SMT3 (suppressor of mif two 3. yeast) ho 

Hs.1 77486 amyloid beta (A4) precursor protein (pro 

Rte (Drosophila>4ike. expressed in many 

myosin VA (heavy polypeptide 12. myoxin) 

butyrophilin. subfamSy 3. member A1 

Pirin 

aflcylglycerone phosphate synthase 
spindEn-like 

karyopherin alpha 3 (importin alpha 4) 

aconitase 1, soluble 

mitogen-activated protein kinase 1 

centromere protein E (312kD) 

H3 hislone family, member A 

poly(rC)4)inding protein 2 

Homo sapi^ duomosome X map Xpl 1 23 L- 

ESTs 

GTPase Rab14 

SUMO-1 activating enzyme subunit 2 
WD repeat domain 4 
SWI/SNF related, matrix associated, acti 
Homo sapiens cDNA FLJ14041 lis. done HE 
Homo sapiens cDNA FU1 1968 fis. done HE 
Homo sapiens cDNA: FU23197 fis, done R 
KIAA0826 protein 
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103766 


AA088744 


At920783 


Hs.191435 


103767 


AA089688 


BE244667 




132051 


AA091284 


AA393968 


Hs.180145 


103773 


AA092700 


At219323 


Hs.101077 


135289 


AA092968 


AW372S69 Hs.9788 


409659 


AA094800 


AWg70843 Hs.55682 


103794 


AA100219 


AF244135 


Hs.30670 


131471 


AA1 14885 


AA164842 


Hs.192619 


134319 


AA1 29547 


BE304999 


Hs^85754 


103807 


AA133016 


AW95d264 Hs.103832 


446392 


AA1 49507 


AF142419 


Hs.15020 


129853 


AA161005 


BE379765 


Hs.129872 


103850 


AA1 87101 


AA1 87101 


Hs.213194 


103855 


AA1 95179 


W02363 




103861 


AA206236 


AA206236 


Hs,4944 


130634 


AA227621 


AI769067 


Hs.127824 


447735 


AA248283 


AA775268 


Hs.6127 


103909 


AA249611 


AA249611 


HsAJm 


458928 


AA282640 


AF043117 


Hs.24594 


415824 


AA287199 


D42039 


Hs.78871 


129013 


AA313990 


AA371156 


Hs,107942 


129435 


AA3 14256 


AF1 51852 


Hs.1 11449 


103988 


AA314389 


AA314389 


Hs.342849 


104000 


AA324364 


AI146527 


Hs.80475 


425284 


AA329211 


AF155568 


Hs.348043 


128629 


AA399187 


AL096748 


Hs.1 02708 


133281 


AA421079 


AK001601 


Hs.69594 


104104 


AA422029 


AA422029 


Hs.143640 


332455 


AA425230 


NM O05754HS.220689 


132091 


AA447052 


AW954243 




135073 


AA452000 


W55956 


Hs.94030 


131367 




AI750575 


Hs.173933 


129593 


AA4A7015 


AI338247 


Hs.98314 


133505 


n01527 


AI630124 


Hs.324504 


132064 


C01714 


AA1 21098 


Hs.3838 


442351 


\.,«U loll 


W52642 


Hs.8261 


131427 


C02352 


AF1 51879 


Hs.26706 


433892 


C02375 


A1929357 


Hs.323966 


104282 


C14448 


C14448 


Hs.332338 


134827 


D16611 


BE314037 


Hs.89866 


425330 


D25216 


D25216 


Hs.155650 


131742 


D31352 

L^W 1 WW£» 


AA961420 


Hs.31433 


456935 


D58024 


AA370362 


Hs.57958 


425218 


□80897 


NM 014909HS.155182 


104334 


D82614 


D82614 


Hs.78771 


134593 

1 Www WW 


D87845 


NM„000437Hs.234392 


134731 


089377 


D89377 


Hs.89404 


445776 




NM_001310Hs.13313 


131670 


H40732 


H03514 


Hs.15589 


104394 


H46617 


AA129551 


Hs.172129 


104402 


H56731 


H56731 


Hs.132956 


439130 


H75570 


AA306090 


Hs.124707 


129077 


1 1 f Www 


N74724 


Hs.108479 


104417 


H81241 


AI819448 


Hs.320861 


134927 


L36531 


L36531 


Hs.91296 


129280 


M63154 


M63154 


Hs.1 10014 


134498 


M63180 


AW246273 Hs.84131 


104460 


M91504 

l*lw 1 WV^ 


AW955705 Hs.62604 


104488 


N56191 

1 «ww 1 w 1 


N56191 


Hs.106511 


131248 


N78483 


AI038989 


Hs.332633 


130017 


R14652 


AK000096 


Hs.143198 


104530 


R?fl45Q 


Af^01676 


Hs.12457 


104534 


R22303 


R22303 




104544 


R33779 


AI091173 


Hs^362 


133328 

1 wW 


R36553 

f \wWwwW 


AW452738 Hs^65327 


104567 


R64534 


AA040620 


Hs,5672 


129575 


R70621 


F08282 


Hs.278428 


130776 


R79356 


AF167706 


Hs,19280 


104599 


Ra4933 


AW815036 Hs.151251 


104660 


AA007160 


BE2g8665 


Hs.14846 


104667 


AA007234 


AI239923 


Hs.63931 


104718 


AA018409 


A1143020 


HS.362S0 


104764 


AA025351 


AI039243 


Hs.278585 


104786 


AA027168 


AA027167 


Hs.10031 


104787 


AA027317 


AA027317 




134079 


AA029423 


AK001751 


Hs.171835 



ESTs 

CGl-100 protein 
HSPC030 protein 

ESTs, Weakly similar to T22363 hypotheti 

hypothetical protein MGC10924 sMar to 

eukaryotic translation initiation factor 

hepatocellu^ cardnoma-assodated anti 

KIAA1600 protein 

fumarate hydratase 

sMar to yeast Upf3. variant B 

homolog of mouse quaking QKl (KH domain 

sperm associated ant^en 9 

hypothetical protein (^GC10895 

hypothetical protein FU10330 

hypothetical protein FU 12783 

ESTs, Weakly similar to T28770 hypotheti 

Homo sapiens cDNA: FU23020 fis, done L 

SH3 domain binding gtufcamicadd-nch pr 

ubiqultination factor E4B (homologous to 

mesoderm development candidate 2 

DKF2P564M112 protein 

CGI-94 protein 

ADP-ribosylation fador-fike 5 

polymerase (RNA) li (DNA directed) polyp 

NS1 -associated protein 1 

DKFZP434A043 protein 

high-mobility group 20A 

ESTs, Weakly sfanilar to hyperpoiarizatio 

Ras-GTPase-acGvating protein SH3-domain 

K1AA0251 protein 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 
nudear factor I/A 

Homo sapiens mRNA; cDNA DKFZp586L0120 (f 
Homo sapiens mRNA; cDNA E}KFZp586J0720 (f 
serum-indudble kinase 
hypothetical protein FLJ22393 
CGl-121 protein 

Homo sapiens done H63 unknown mRNA 
EST 

coproporphyrinogen oxklase (coproporphyr 

KiAA0014 gene produd 

ESTs 

EGF-TM7-latnophirin-related protein 
K1AA1036 protein 
phosphoglycerate kinase 1 
platelet-activating fador acetylhydrola 
msh (Drosophila) homeo box homolog 2 
cAMP responsive element binding protein- 
ESTs 

Homo sapiens cDNA: FU21409 fis. done C 

ESTs 

ESTs 

ESTs 

KmppeHIke factor 8 
integrin. alpha 8 

gastric intrinsic tedor (vitamin B synt 

threonyl-tRNA synthe^e 

Homo sapiens, done IMAGE:4299322. mRNA, 

protocadherin 17 

Bardet-Bledl syndrome 2 

inhibitor of growth family, member 3 

hypothetical protein FU10814 

gb:yh26b09.r1 Scares placenta Nb2HP Homo 

ESTs. Weakly similar to p40 [H.sapiens] 

hypothetical protein DKFZp76 11141 

hypothetical protein AF140225 

progestin induced protein 

cysteine-fich motor neuron 1 

ESTs 

Homo sapiens mRNA; cDNA OKFZpS64D016 (fr 
ESTs 

CSTs. Weakly simBar to 138022 hypotheti 
fSTs 

K(AA(»55 protein 

gb:z697d1U1 SoaresLfetaLheaiLNbHHigW 
hypothetteal protein FLJ10869 
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104804 AA031357 
104865 AA045136 
130828 AA053400 
104907 AA055829 
5 104943 AA065217 
105013 AA116054 
105024 AA126311 
132592 AA129390 
105038 AA130273 
10 105077 AA142919 
105096 AA150205 
129215 AA176867 
105169 AA180321 
132796 AA180487 
15 427210 AA187634 
105200 AA195399 
130114 AA234717 
105330 AA234743 
105337 AA234957 
20 422040 AA235604 
105376 AA236559 
105397 AA242868 
431679 AA251776 
131991 AA251909 
25 421305 AA252672 
105489 AA256157 
105508 AA256680 
105539 AA258873 
135172 AA262727 
30 131569 AA281451 
431129 AA281545 
105643 AA282069 
105659 AA283044 
105666 AA283930 
35 105674 AA284755 
105709 AA291268 
105722 AA291927 
105765 AA343514 
115951 AA398109 
40 130884 AA398109 
105962 AA405737 
105985 AA406610 
106008 AA411465 
457322 AA416886 
45 134222 AA424013 
446954 AA424148 
106141 AA424558 
447973 AA424961 
105157 AA425357 
50 428314 AA425921 
446727 AA426220 
106196 AA427735 
457714 AA430673 
133200 AA432248 
55 106302 AA435896 
105328 AA436705 
450534 AA446561 
106423 AA448238 
439608 AA449756 
60 106477 AA450303 
106503 AA452411 
446999 AA454566 
106543 AA454667 
442007 AA456437 
65 106589 AA456646 
106593 AA456826 
106596 AA456981 
423064 AA458959 
106636 AA459950 
70 106654 AA460449 
131353 AA463910 
106707 AA464603 
452909 AA464606 
106717 AA465093 
75 453141 AA465692 
108747 AA476473 



AI858702 Hs.31803 
n9340 Hs^2575 
AW631469 Hs.203213 
AA055829 Hs.196701 
AF072873 Hs.114218 
H63789 Hs.296288 
AA125311 Hs.9879 
AW803564 Hs^88850 
AW503733 Hs.9414 
W55946 Hs.234863 
AL042506 Hs^1599 
AB040930 Hs.126085 
BE245294 Hs.180789 
NM.0062B3HS.173159 
BE396283 Hs.173987 
AA3281Q2 Hs.24641 
AA233393 Hs.14992 
AW338625 Hs^120 
AI468789 Hs.347187 
AA172106 Hs.l 10950 
AW994032 Hs.8768 
AA814807 HsJ395 



AK000046 
AF053306 
BE397354 
AA256157 



Hs.343877 
Hs.36708 
Hs.324830 
Hs.24115 



AA173942 Hs.326416 
AB040884 Hs.l 09694 



AB028956 

AL389951 

AL137751 

BE621719 

AA283044 

AA426234 

AI609530 

A1928962 

AI922821 

AA299688 

BE546245 

BE546245 



Hs.12144 

Hs.271623 

Hs.263671 

Hs.173802 

Hs.25625 

Hs.34908 

Hs.279789 

Hs.26761 

Hs.32433 

Hs.24183 

Hs.301048 

Hs.301048 



AW880358 Hs.339808 
AA406610 
AB033888 Hs.8619 
A1815486 Hs.243901 
AW855861 Hs.8025 
AB037850 Hs.16621 
AF031463 Hs.9302 
AB011169 Hs.20141 
W37943 Hs.34892 
AW1 35049 Hs^6285 
AB011095 Hs.16032 



AA525993 
AA083764 
AB037715 
AA398859 
AL079559 
AI570189 
AB020722 



Hs.173699 

Hs.183639 

Hs.18397 

Hs^8020 

Hs^5132 

Hs.16714 



AW864696 Hs^01732 
R23324 Hs.41693 
AB033042 Hs.29879 
AA151520 

AA676939 Hs.69285 
AA301116 Hs.142838 
AK000933 Hs^8661 
AW296451 Hs.24605 
AA452379 
AF265208 Hs.8740 
AW958037 Hs^86 
AW075485 Hs^86049 
AW754182 
AK000566 Hs.98135 
NM_015368Hs30985 
AA600357 Hs^9489 
AB01454d HS31921 
NM.007118HS.171957 



ESTs, Weakly simflar to N-WASP (H^plen 
B-cell CLL/tymphoma 6, member B (zinc fi 
ESTs 

ESTs, Weakly similar to ALUl.HUMAN ALU S 

frizzled (DrosophHa) homotog 6 

ESTs. Weakly simOar to KIAA063d protein 

ESTs 

Homo sapiens cDNA: FU22528 fis, done H 
KIAA1488 protein 

Homo sapiens cDNA FLJ12082 fis. done HE 
KnippeMike fader 7 (ubiquitous) 
KIAA1497 protein 
S164 protein 

transfbnning, addle coiledcoO contain 
eukaiyotic translation initiatjon fador 
cytoslceleton associated protein 2 
hypothettoa) protein FU11151 
ESTs 

myotubularin related protein 1 
Rag C protein 

hypothetical protein FU10849 
hypothetical protein FLJ23182 
hypothetical protein FU20039 
budding uninhibited by benzimidazoles 1 
diptheria toxin resistance protein requi 
Homo sapiens cDNA FU14178 fis, done NT 
Homo sapiens mRNA; cDNA DKFZp564H1916 (f 
KIAA1451 protein 
KtAA1033 protein 
nucleoporin 50kO 

Homo sapiens mRNA; cDNA DKFZp434l0812 (f 

K1AA0603 gene produd 

hypothetical protein FU11323 

ESTs, Weakly similar to T17210 liypotheti 

hlstone deacetylase 3 

DKFZP586tJ)724 protein 

ESTs 

ESTs 

sec13-like protein 

sec13-like protein 

hypothetical protein FU10120 

gb:2vl5b10.s1 Soares_NhHIVlPu_S1 Homosapi 

SRY (sex detemiining region Y)-box 18 

Homo sapiens cONA FU20738 fis, done HE 

Homo sapiens done 23767 and 23782 mRNA 

DKFZP434I116 protein 

phosdudrhlike 

similar to S. cerevisiae SSM4 
KIAA1323 protein 

Homo sapiens cONA FU10643 fts. done NT 
KIAA0523 protein 

ESTs, Weakly sinular to ALU1.HU1VIAN ALU S 

hypothetical protein MGC3178 

hypothetical protein FU10210 

hypothetical protein FLJ23221 

KIAA0766 gene produd 

KIAA0470 gene produd 

Rho guanine exchange factor (GEF) 15 

hypothetical protein MGC5306 

DnaJ (Hsp40) homoiog, subiiamlly B, membe 

cofador required for Spl transciiptiona 

hypothetical protein MGC4485 

neuropHln 1 

nucleolar phosphoprotein Nopp34 

Homo sapiens cDNA FU10071 fis, done HE 

ESTs 

ESTs. Moderately similar to ALU7_HU1VIAN A 
SWl/SNF related, matrix associated, acti 
ribosomal protein L4 
phosphoserine aminotransferase 
gb:RC2-CT0321.1311994)11-d)1 CT0321 Homo 
hypothetical protein FU2D559 
pannexin 1 

TIA1 cytotoxic granule-associated RislA-bi 
K1AA0848 protein 

triple functional domain (PTPRF interad 
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inR77Q 
lUOf fO 






nS.looooo 


1 nf\7fii 

lUOf 01 


A AA7SIA7A 


AAojUjIU 


nS./4 lol 


106817 


AA480889 




nS. 10072 


106846 


AA485223 


Ad037744 


nS.34o92 


106848 


AA485254 


A A AAnf\AA 

AA449014 


HS.121025 


106856 


AA486183 


Waoooo 


n5.2o512o 


418699 


AA4g6936 


Dc5o8oo9 


HS.17oll30 


107001 


AA598589 


Al92o520 


HS.3101D 


442853 


AA598831 


A i AiAn4 *nc 


Hs.17121 


107054 


AA600150 


AI07o459 


ns.15978 


107059 AA608545 


DCD14410 


rlS.23D44 


107080 AA609210 


Al An*V\A9 

AL1 22043 


Hs.19221 


107115 


AA610108 


Bb379o23 


Hs.27d93 


107130 


AA620582 


AoUo31Ub 


ns.12913 


107156 


AA621239 


A A •! OTfX A *i 

AA1 37043 


nS.9QD3 


107174 


AA621714 


BE1 22762 


nS.25338 


130621 


AA621718 


AW5i30o7 


Hs.16803 


107190 


D19673 


AA836401 


Hs.87860 


132626 


D25755 


AW504732 


Hs.21275 


107217 


D51095 


AL080235 


Hs.35861 


332584 


D60272 


AA357879 


Hs.29423 


444655 


T08879 


AF088886 


Hs.11590 


107295 


T34527 


AA1 85629 


Hs.80120 


107299 


T40327 


Dt277457 


Hs.30661 


107315 


T62771 


AA3id241 


Hs.90691 


107316 


T63174 


T63174 


Hs. 193700 


107328 


T83444 


AVv959891 


Hs.76591 


107334 


T93641 


T93597 


MS. 187429 


456340 


U48263 


U48263 


Hs.89040 


128636 


U49065 


U49065 


Hs.1 02865 


129938 


U7g300 


AtAmr%400o 

AW003668 


Hs.135587 


107375 


U88573 


BE011845 


Hs^51064 


130074 


U93867 


AL038598 


Hs^0745 


107387 


W01094 


D86983 


Hs.118893 


132036 


W01568 


AL1 57433 


Hs.37706 


107426 


W26853 


W28853 


Hs^91003 


135388 


W27965 


W27965 


Hs.99865 


130419 


W36280 


AF037448 


Hs.155489 


107469 


W47063 


W47063 


Hs.94668 


434203 


W79060 


BE262677 


Hs.283558 


107506 


W88550 


AB028981 


Hs.8021 


132358 


X60486 


NM_003542Hs.46423 


107522 


X78931 


X78931 


Hs.99971 


456495 


Z14077 


NM^C03403Hs.97496 


107582 


AA002147 


AA002147 


Hs.59952 


107609 


M004711 


R75654 


Hs.164797 


107661 


AA010383 


AA010383 


Hs.60389 


107714 


AA016761 


AA015761 


Hs.60642 


107775 


AA018772 


AW008846 Hs.60857 


107832 


AA021473 


AA021473 




107859 


AA024835 


AW732573 Hs.47584 


107914 


AA027229 


AA027229 


Hs.61329 


107935 


AA029428 


AA029428 


Hs.61555 


410196 


AA035143 


AI936442 


Hs.59838 


131461 


AA035237 


AA992841 


Hs.27263 


108007 


AA039347 


AA039347 


Hs.61916 


108029 


AA040740 


AA040740 


Hs.62007 


108040 


AA041551 


AL121031 


Hs.159971 


108084 


AA045513 


AA058944 


Hs.l 16602 


108088 


AA045745 


AA045745 


Hs.62886 


108168 


AA055348 


AI453137 


Hs.63176 


130719 


AA056582 


AA679262 


Hs.14235 


108189 


AA056697 


AW376061 Hs.63335 


108190 


AA056746 


AA056746 


Hs.63338 


108203 


AA057678 


AW847814 Hs.289005 


108216 


AA058681 


AA524743 


Hs.44883 


108217 


AA058686 


AA058686 


Hs.62588 


108245 


AA062840 


BE410285 


Hs^9545 


108277 


AA064859 


AA064859 




108280 


AA065069 


AA065069 




108309 


M069923 


AA069818 




108340 


AA070815 


AA069820 


Hs.180909 


108403 


AA075374 


AA075374 




108427 


AA076382 


AA078382 




108435 


AAQ78787 


T82427 


Hs.194101 


108439 AA078986 


AA078986 





tSTs 
ESTs 
ESTs 

K1AA1323 protein 

chromosome 11 open reading frame 5 
Homo sapiens mRNA full length insert cDN 
ESTs. WealOy similar to ALU8.HUMAN ALU S 
putative DhtA binding protein 
ESTs 

KIAA1272 protein 

RAD51 (S. oerevlsiae) homolog (E cofi Re 
hypothetical protein DKFZp566G1424 
pepfidylprolyt isomerasa (cydophSlnH 
KiAA1280 protein 

programmed cell death €>interacting prot 
ESTs 

LUC7 (S. cenevisiaeHike 
ESTs 

hypothetical protein FLJ1101 1 
DKFZP586E1621 protein 
ESTs; Weakly similar to macrophage tecti 
cathepsin F 

UDP>N-aoetyl-atpha-D-gaiactosamine:polyp 

hypothetical protein MGC4605 

nudeophosmin/nucleoplasmin 3 

Homo sapiens mRNA; cDNA DKFZp566l0324 (f 

KIAA0887 protein 

ESTs 

prepronociceptin 
interleukin 1 receptor<like 2 
Human cione 23629 mRNA sequence 
high-mobility group (nonhistone chromoso 
polymerase (RNA) III (DNA directed) (62k 
Melanoma associated gene 
hypothetical protein DKFZp434E2220 
hypothetical protein MGC4707 
ep'mwrphin 

NS1 -associated protein 1 
ESTs 

hypothetical protein PR01855 

KIAA1058 protein 

H4 histone family, member Q 

zinc finger protein 272 

YY1 transcription factor 

EST 

hypothetical prote'si FiJ13693 

ESTs 

ESTs 

ESTs 

gb:ze66c1l.s1 Soares retina N2b4HR Homo 
potassium voltage-gated channel, deteyed 
ESTs. Weakly similar to T16370 hypotheti 
ESTs 

hypotheb'cal protein FLJ10808 

KtAA1458 protein 

EST 

ESTs 

SWI/SNF related, matrix associated, act! 
Homo sapiens, done IMA6E:4154008. mRNA, 
ESTs 
ESTs 

hypothetical protein FLJ20008; KIAA1839 
ESTs. Moderately simitar to A46010 X-fin 
EST 

Homo sapiens cDNA: FU21532 fis. done C 

ESTs 

ESTs 

proteasome (prosome, macropain) subuntt. 
gb2m50fD3.sl Stratagene fibroblast (937 
gb2m12e11.s1 Stratagene pancreas (93720 
gb:zm67e03.r1 Stratagene neuroepithelium 
peroxiredoxin 1 

gb:zm87a01^1 Stratagene ovarian <ancer 
gb:zm91g08^1 Stratagene ovarian cancer 
Homo sapiens cDNA: FLJ20869 fs, done A 
gb2ni92h01^1 Stratagene ovariantanoer 
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108465 


AA079393 




Hs.3462 


108469 


AA079487 


M079487 




108500 AA083207 


M083207 


Hs.68270 


108501 


AA083256 


M083256 




108533 


AA084415 


M084415 




108562 AA085274 


M1007g8 




108589 


AA088678 


AI732404 


Hs.68846 


130890 AA100925 


AI907537 


Hs.76698 


432645 AA101255 


D14041 


H5^7340 


130385 


AA126474 


AW067800 Ks.155223 


108749 AA127017 


M127017 


Hs.71052 


108807 


AA129968 


AI652236 


Hs.49376 


108808 


AA1 30240 


M045088 


Hs,62738 


108833 


AA131866 


AF1 88527 


Hs.61661 


108846 


AA1 32983 


AL1 17452 


Hs.44155 


108857 


AA1 33250 


AK001468 


Hs.62180 


131474 


AA1 33583 


L46353 


Hs^726 


108894 


AA1 35941 


AK001431 


Hs^lOS 


108941 


AA148650 


M148650 




108968 


AA151110 


A1304870 


Hs.188680 


108996 


Ml 55754 


AW995610 Hs.332436 


109001 


AA156125 


AI056548 


Hs.72116 


131183 


AA1 56289 


AI611807 


Hs.285107 


109019 AA156997 


M156755 


Hs.72150 


109022 


Ml 57291 


M1 57291 


Hs^1479 


109023 


M157293 


M1 57293 


Hs.72168 


109068 


M164293 


Ml 64293 


Hs.72545 


109072 


M164676 


Ar732585 


Hs,22394 


426981 


M167375 


AL044675 


Hs.1 73081 


130346 


M167550 


H05769 


H5.188757 


109146 AA176589 


M176589 


Hs.142078 


109172 AA180448 


M180448 


Hs.144300 


428438 


M187144 


NM-001955HS.2271 


129208 AA189170 


AI587376 


Hs.109441 


109222 


M192757 


M192833 


Hs.333512 


109300 


AA205650 


M418276 


Hs.170142 


109481 


AA233342 


M878923 


Hs.289069 


109485 


AA233472 


BE619092 


Hs.28465 


109516 


AA234110 


AI471639 


Hs.71913 


109537 


D80981 


AI858695 


Hs.34898 


109556 


F01660 


Af925294 


Hs.87385 


109577 


F02206 


F02206 


Hs.2g6639 


109578 


F02208 


F02208 


Hs.27214 


109595 


F02544 


M078629 


Hs,27301 


109625 


F03918 


H29490 


Hs.22697 


428376 


F04258 


AF1 19665 


Hs.184011 


109648 


F04600 


H17800 


Hs.7154 


109671 


F08998 


R59210 


Hs.26634 


109699 


F09605 


H18013 


Hs.167483 


109820 


F11115 


AW016809 


Hs.1 19021 


109933 


H06371 


R52417 


Hs.20945 


110014 


H10995 


AL10g666 


Hs.7242 


110039 


H11938 


H1ig38 


Hs.21907 


110099 H16568 


R44557 


Hs.23748 


110107 


H16772 


AW151660 Hs.31444 


110155 


H18951 


AI559626 


Hs.93522 


110197 


H20859 


AW090386 Hs.1 12278 


110223 


H23747 


H19836 


Hs.31697 


110306 


H38087 


H38087 


Hs.105509 


110335 


H40331 


H65490 


Hs.18845 


110342 


H40567 


H40961 


Hs.33008 


110395 


H46g66 


M025116 


Hs.33333 


110511 


H56640 


H56640 


Hs^1460 


110523 


H57154 


At040384 


Hs.19102 


110715 


H96712 


H96712 


Hs.269029 


110754 


N20814 


AW302200 Hs.6336 


428454 


N25249 


U55936 


Hs.184376 


431663 


N27100 


NIVL016569HS.267182 


134263 


N39616 


AW973443 Hs.8086 


110938 


N48982 


N48982 


Hs.38034 


110983 


N51957 


NM_015367Hs,10267 


111081 


N59435 


Ai14634g 


Hs^1614 


111128 


N64139 


AW505364 Hs.19074 


431548 


N86981 


AI834273 


Hs^711 


111216 N68640 


AW139408 HS.1S2940 


437S62 


N69352 


AB001636 


Hs^3 



cytochrome c oxidase subunit Vllc 
gb:zm97f08.s1 Stratagene colon HT29 (937 
EST 

gb:zn08g12^1 Stratagene hNT neuron (937 
gb2n06909^1 Strategene hNT neuron (937 
gb2ni26o06Al Stratagene pancreas (93720 
ESTs 

stress-assodated endoplasmic reticulum 
H.2K binding facbr^ 
stanniocatdn 2 
ESTs 

hypotheticai protein FLI20644 
ESTs 

ESTs. Wealdy similar to AF 174605 1 F-box 

DKFZP586G1517 protein 

anOlin (Orosophlla Scraps homoiog). act 

high-mobility group (nonhistone chromoso 

hypothetical protein FU 10569 

gb2o09e06^1 Stratagene neuroepithelium 

ESTs 

EST 

hypothetical protein FU20992 similar to 
hypothetical protein FLJ13397 
ESTs 

ubinudetn 1 

ESTs 

ESTs 

hypothetical protein FLJ10893 
KtM0530 protein 

Homo sapiens, done MGC:5564» mRNA. oomp 

EST 

EST 

endotheiin 1 
A4STP033 protein 
similar to rat myomegaiin 
ESTs 

hypothetical protein FU21016 

Homo sapiens cDNA: FLJ21869 fis, done H 

ESTs 

ESTs 

ESTs 

Homo sapiens potassium channel subunit ( 

ESTs 

ESTs 

ESTs 

pyrophosphatase (inorganic) 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens done 24993 mRNA sequence 

Homo sapiens mRNA full iengtii insert cDN 

histone aoefyltransferase 

ESTs 

ESTs 

Homo sapiens mRNA for KIM1647 protein, 

arrestin, beta 1 

ESTs 

CTL2gene 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs. Weakly similar to organic anion tr 
ESTs 

KIM0672 gene product 
synaptosomal^ssociated protein. 23kD 
T6X3^oprotetn 

Rl^ (guanine-7-) methyltransferase 
Homo sapiens cDNA FU12924 fis, done NT 
MIL1 protein 
C6M 12 protein 

l^TS (large tumor suppressor. Drosopiilia 

novel protein 

ESTs 

OEAD/H (Asp<Blu^-Asp/His) box polypep 
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mo99 


ROOloo 


AW270776 Hs.18857 


ESTs 


nioi4 


R07998 


R07998 




gb:yf16g11.s1 Scares fetal liver spleen 




KUo949 


BE267033 


Hs.192853 


ubiquitin-Gonjugafing enzyme E2G 2 (homo 


111574 


R10307 


AI024145 


Hs.188526 


ESTs 


i11tiU4 


Ko3o54 


AA482478 


Hs.181785 


ESTs 


illool 


RooQo3 


R36095 


Hs.268695 


ESTs 


426773 


R37938 


NM„015556Hs.172180 


KIAA0440 protein 


111SU4 


R393jU 


241572 




gD:HSCZYB122 nomialized infant brain cDN 


4zoa71 


K40o1O 


AB012193 


Hs.183874 


culiin4A 


44<)noo 


Qil04C0 


R49031 


Hs.22627 


ESTs 


1 00987 


R45698 


6^13269 


Hs.21893 


hypothetical protein DKFZp761N0624 


112300 


R54554 


H24334 


Hs^6125 


ESTs 


iizdi3 


R00425 


R68425 


Hs.13809 


hypomeocal protein Fu10648 


112514 


R6856B 


R68566 


Hs.183373 


src homology 3 domain-containbig protein 


112522 


R68763 


R68857 


Hs.265499 


ESTs 


112540 


R70467 


R69751 




gb:yi40a10^1 Soares placenta Nb2HP Homo 


428655 


R73565 


H05769 


Hs.188757 


Homo sapiens, done MGC:S564, mRNA, comp 


129534 


R73640 


AK002126 


Hs.11260 


hypothetical protein FLJ11264 


112597 


R78376 


R78376 


Hs.29733 


EST 


112732 


R92453 


R92453 


Hs.34590 


ESTs 


451 79o 


103855 


BE297567 


Hs.27047 


hypothetical protein FU20392 


112868 


T03872 


AW195317 Hs.107716 


hypothetical protein FU22344 


131863 


T10072 


AI656378 


Hs.33461 


ESTs 


112911 


T10080 


AW732747 Hs,13493 


lilce mouse brain protein E46 


132215 


T10132 


AL035703 


Hs.4236 


K1AA0478 gene product 


112931 


T15343 


T02966 


Hs.167428 


ESTs 


112984 


T23457 


T16971 


Hs.289014 


ESTs. Weakly similar to A43932 mucin 2 p 


112998 


T23555 


H11257 


Hs.22968 


Homo sapiens done IMAGE:451939, mRNA se 


133376 


T23670 


BE618768 


Hs.7232 


acetyl-Coenzyme A carboxylase alpha 


113026 


T23948 


AA376654 




euKaryotic translation initiation fector 


113070 


T33464 


AB032977 


Hs.6298 


KIAA1151 protein 


410781 


T34413 


AI375672 


Hs.165028 


ESTs 


113074 


T34611 


AK001335 


Hs^1137 


protein tyrosine phosphatase, receptor t 


113095 


T40920 


AA828380 


Hs.126733 


ESTs 


113179 


T55182 


BE622021 


Hs.152571 


ESTs. Highly simOarto IGF-II mRNA-bind 


113337 


T77453 


T77453 


HS.3Q2234 


ESTs 


113421 


T84039 


AI769400 


Hs.189729 


ESTs 


1 13454 


T86458 


A1022166 


Hs.16188 


ESTs 


113481 


T87693 


T87693 


Hs.204327 


EST 


453345 


T89350 


AA302862 


Hs.90063 


neurocatdn delta 


113557 


T90945 


H66470 


Hs.16004 


ESTs 


1 13559 


T90987 


T79763 


Hs.14514 


ESTs 


1 13589 


T91863 


AI078554 


Hs.15682 


ESTs 


1 13591 


T91881 


T91881 


Hs.2a0597 


KIAA0563 gene product 


113619 


T93783 


R08666 


Hs.17244 


hypothetical protein FLJ13605 


113683 


T96687 


AB035335 


Hs.144519 


T-celi ieukemia/iymphoma 6 


113692 


T96944 


AL360143 


Hs.17936 


DKFZP434H132 protein 


113702 


T97307 


T97307 




gb:ye53h05^1 Soares fetal liver spleen 


113717 


T97764 


T99513 


Hs.187447 


ESTs 


113824 


W48817 


A(631964 


Hs.34447 


ESTs 


113840 


W58343 


R72137 


Hs.7949 


DKFZP586B2420 protein 


113844 


W59949 


AI369275 


Hs.243010 


Homo sapiens cOlsIA FLJ14445 fis, done HE 


113902 


W74644 


AA340111 


Hs.100009 


acyl-Coenzyme A oxidase 1. palmitoyi 


113904 


W74761 


AF125044 


Hs.19196 


ubiqultin-conjugating enzyme HBUCE1 


113905 


W74802 


R81733 


Hs^3106 


ESTs 


113931 


W81205 


BE255499 


K5.3496 


hypothetical protein MGC15749 


113932 W81237 


AA256444 


Hs.126485 


hypothetical protein FU12604; K1AA1692 


131965 


W90146 


W79283 


Hs.35962 


ESTs 


114035 


W92798 


W92798 


Hs.269181 


ESTs 


114106 


238412 


AW602528 




gb:RC5-BT0562-260100-011-A02 BT0562 Homo 


457308 


Z38709 


AI416988 


Hs.238272 


inositol 1,4,5-triphosphate receptor, ty 


114161 


Z38904 


BE548222 


Hs^9883 


hypothetical protein FiJ23399 


424949 


Z39103 


AF052212 


Hs.1 53934 


core-binding factor, mnt domain, alpha 


457548 


Z39930 


AW069534 Hs.279583 


CGI-81 protein 


128937 


Z39939 


AA251380 


Hs.10726 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


432554 


Z40012 


AI479813 


Hs^78411 


NCK-assodated protein 1 


114277 


Z40377 


AI052229 


Hs.25373 


ESTs, Weakly similar to T20410 hypotheti 


114304 


Z40820 


AI934204 


Hs.16129 


CO IS 


114364 


Z41680 


AL1 17427 


Hs.172778 


Homo sapiens mRNA; cDNA DKFZp566P013 (fr 


432620 


AA005112 


AA777749 


Hs.5978 


LiM domain only 7 


129034 


AA005432 


AA481157 


Hs.108110 


DKFZP547E21 10 protein 


131881 


AA010163 


AW361018 Hs.3383 


upstream regulatory element binding prot 


332421 


AA026356 


A1909968 


Hs.108106 


transcription factor 


114465 


AA026901 


BE621056 


Hs.131731 


hypothetical protein FU11099 


451271 


AA036867 


AK001644 


HS26156 


hypothetical protein FU10782 


332498 AA044644 


AA303661 




lymphocyte-spedftc protein 1 
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431555 AA046426 
132944 AA054515 
114618 AA084162 
332509 AA0a5749 
S 114648 M101056 
1146S8 AA102746 
132456 AA114250 
450847 AA126561 
132225 AA128980 
10 437197 AA129757 
114709 AA129921 
458926 AA133331 
114750 AA135958 
426806 AA136524 
15 114763 AA147044 
114767 M148885 
114774 AA150043 
129388 AA151621 
457742 AA155743 
20 456200 AA156335 
130207 AA1 56336 
114798 AA159181 
114800 AA159825 
114828 AA234185 
25 114846 AA234929 
114848 AA234935 
114902 AA236359 
. 132271 AA236466 
114907 AA236535 
30 420170 AA236935 
132204 AA236942 
114928 AA237018 
132481 AA237025 
114932 AA242751 
35 314162 AA242760 
131006 AA242763 
114935 AA242809 
408908 AA243133 
437754 AA243495 
40 114957 AA243706 
114974 AA250848 
114977 AA250868 
114995 AA251152 
115005 AA251544 
45 417177 AA251792 
115026 AA252144 
115045 AA252524 
115068 AA253461 
133138 AA255522 
50 332668 AA255522 
115114 AA256468 
129584 AA256528 
115137 AA257976 
417187 AA258296 
55 115166 AA25840g 
115167 AA2S8421 
436719 AA262077 
115239 AA276650 
115243 AA278766 
60 428419 AA280791 
115322 AA280819 
413303 AA280828 
115372 AA282195 
409962 AA283127 
65 130269 AA284694 
456570 AA291137 
332675 AA291708 
407864 AA293495 
115536 AA347193 
70 408799 AA398474 
115575 AA398512 
115601 AA400277 
434428 AA400696 
115683 AA410345 
75 115715 AA416733 
132952 AA425154 



A1815470 Hs.260024 
T96641 Hs.6127 
AW979261 Hs^91993 
AA128376 M5.153884 
AA101058 

AA102383 Hs^49190 
AB011084 Hs.48924 
NM-003155HS.25590 
AA128980 
W38586 

AA397651 Hs.301959 
AB018284 Hs.158688 
AA887211 Hs.129467 
T19228 Hs.172572 
AA810755 Hs.102500 
AI859865 Hs.154443 
AV656017 Hs.184325 
AA662477 Hs.1 10964 
BE561824 Hs.273369 
AA768242 Hs.80618 
AF044209 Hs.144904 
AA159181 Hs.54900 
Z19448 Hs.131887 
AA252937 Hs.283522 
BE018682 H5.166196 
BE614347 Hs.169615 
AW275480 Hs.39504 
AB030034 Hs.1 15175 
N29390 Hs.13804 
U43374 Hs.95631 
AA235827 Hs.42265 
AA237018 Hs.94869 
W93378 Hs.49614 
AA971436 Hs.16218 
BE041820 Hs.38516 
AF064104 Hs.22116 
H23329 Hs.290880 
BE296227 Hs.250822 
R60366 Hs.5822 
AW170425 Hs.87680 
AW966931 Hs.302649 
AW296978 Hs.Sm? 
AA769266 Hs.193657 
A1760825 Hs.153042 
NM„004458Hs.81452 
AA251972 Hs.188718 
AW014549 Hs.58373 
AW512260 Hs.87767 
AV657594 Hs.181161 
AV657594 Hs.181161 
AA527548 Hs.7527 
AV656017 Hs.184325 
AW968304 Hs.56156 
AB011151 Hs.334659 
AF095727 Hs.287832 
AA749209 Hs.43728 
Y11192 Hs.5299 
BE251328 Hs.73291 
AA806600 Hs.116665 
U49436 

L08895 HS78995 
AW836130 Hs.75277 
AW014385 Hs.88678 
U82671 Hs.57698 
F05422 Hs.168352 
AA286914 Hs.183299 
6E439944 

AF069291 Hs.40539 
AK001468 Hs.62180 
AA059412 Hs.47986 
AA393254 Hs.43619 
AA148984 Hs.48849 
D14540 Hs.199160 
AF255910 Hs^eSO 
BE395161 Hs.1390 
A1658580 Hs.61426 



Cdc42 effector protein 3 

Homo sapiens cDNA: FU23020 fis. done L 

ESTs 

ATP binding protein associated with ceil 
gb2n2Sb03^1 Stratagene neuroepitheliutn 
tumor necrosb factor receptor superfaml 
KIAA0512 gene product; ALEX2 
stanniocaldn 1 

gb2o09a11^1 Stratagene neuroepHheOum 
guanine nucleotide binding protein (G pr 
proline synthetase co-transcribed (bacte 
K1AA0741 gene product 
ESTs 

hypothetical protein FU20093 
hypothetical protein dJ511E16.2 
jTunlchromosome maintenance deficient (S. 
CGI-76 protein 

hypothetical protein FU23471 

unc^aracterized hematopoietic stem/proge 

hypothetical protein 

nuclear receptor co-repressor 1 

serologically defined colon cancer antQ 

ESTs. Weakly similar to T243S6 hypotheti 

Homo sapiens mRNA; cDNA DKFZp434J1912 (f 

ATPase. Class i. type 8B, member 1 

hypotheticai protein FU20989 

hypothetical protein MGC4308 

sterile-alpha motif and leucine zipper c 

hypothetical protein dJ462Q23^ 

Human normal keratinocyte mFU^ 

ESTs 

ESTs 

ESTs 

KIAA0903 protein 

Homo sapiens, done MGC: 15887, mRNA, com 

CDC14 (ceU division cyde 14. S. cerevi 

ESTs. Weakly similar to ALU1.HUMAN ALU S 

serine/threonine kinase 15 

Homo sapiens cDNA: FU22120 fis. done H 

ESTs 

nudeosome assendbly prot^ 1-like 1 

ESTs 

ESTs 

ESTs 

fatty-add-Coenzyme A Itgase, king-Chain 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU14643 fis. done HT 
ESTs 

small fragment nuclease 
CGl-76 protein 
ESTs 

hypothetical protein t^C14139 
myelin protein zero-like 1 
hypothetk:al protein 

aldehyde dehydrogenase 5 family, member 
hypotheticai protein FU10881 
KIAA1842 protein 
KiAA1856 protein 

IVWDS box transcription enhancer factor 2 

hypothetical protein FLJ13910 

ESTs, Weakly similar to Unknown [H^apie 

TaigetCAT 

nudeoporin-like protein 1 

ESTs 

ESTs 

chromosome 8 open reading frame 1 
anOrm (DrosophOa Scraps homotog), act 
hypothetical protein 1^10940 
ESTs 

ESTs. Weakly simlter to ALU4_HUMAN AlOJ S 
myeloid/lymphoid or mixec^Iineage leukem 
juncQonai adheston molecule 2 
proteasome (prosome, macropain) subuna. 
Homo sapiens mesenchymal stem oeQ prote 
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115819 


AA426573 


AA486620 


Hs.41135 


409124 


AA43141R 


AW292809 Hs.50727 


115895 


AA43B1R9 


ABO00O35 


Hs.o1965 


458073 


fWt^l 1/99 


AA1 92669 


Hs.45032 


115962 


rVvf*rD«XlO 


AI6363o1 


Hs.179520 




AA44RAfi7 


AI745379 


Hs.42911 


1 15974 




BE513442 


Hs^8944 


115985 


AA4477fkQ 


AA447709 


H&268115 


129254 


AA453fi94 


AA2S2468 


Hs.1098 


44R7^n 


AA4<Kn^ 


BE384932 


Hs.64313 


116095 




AA043429 


Hs.62618 


426656 




R19768 


Hs.172788 


116210 


AA47RZQA 


BE622792 


Hs.172788 




AA^7R7'%A 
rvVttvt 00 


AA292105 


Hs.326740 


432645 




D14041 


Hs.347340 


1 lO^QQ 


MM4OZ0y0 


BE297412 


Hs.55189 




AA4ODU04 


AW157022 Hs.343551 






AI129767 


Hs.182874 






BE514376 


Hs.165998 


1 10 JO 1 




N41300 


Hs.71616 


1 IDOOO 


MA4y IZOU 


AF155827 


Hs.203963 




AAOUOl 00 


AA112748 


Hs.279905 


4 I0900 


A ACQO^yl7 

/v\0S044/ 


BE244323 


Hs.85951 


IIR'IQI 


A ACQQO/O 


T86558 


Hs.75113 


1 IOO«7*f 




NM_006033Hs.65370 




MMDUU J 00 


AI742845 


Hs.1 10713 


1 1R4.17 




AW4g9664 






AADUy/lU 


AF191018 


Hs.279923 




nADlUUOO 


AA251594 


Hs.43913 




A ACO'IQOO 


R80137 


Hs.302738 




A AC047CO 


AA361562 


Hs.178761 




|JZ102» 


AW449622 Hs.55200 




UlZlOU 


D12160 


Hs.249212 




ma7no 


AA114926 


Hs.169531 




n<ieQ/\4 


AA337548 


Hs.333402 


noo/ 1 


T^A CCCO 

U40D3Z 


D45652 


Hs,211604 


coo 




BE565817 


Hs.26498 




UoU0u4 


AJ224901 


Hs.10g526 


1 lOD'tO 


ruouiu 


A1367044 


Hs.153638 


1 IDDDl 


rU4Z4/ 


R61504 






r lUsOO 


AL1 17440 


Hs.170263 




no/uu 


BE549407 


Hs.115823 




nuouoo 


R52576 


Hs.285280 




n 10/00 


NM_000121Hs.89548 


1 ID/ f O 


ni /oio 


A1823410 


Hs.343581 


no/ou 


n^ooo 


H22566 


Hs.63931 


4000t>*t 


n404oy 


AA355925 


Hs.36232 


UDoiy 


uco/^7o 
nOoU/ 0 


H53073 


Hs.93698 




nooooy 


AL031428 


Hs.174174 




LIC7QC7 

no/yo/ 


AW955632 Hs.66666 




no4yoo 


H64938 


Hs.337434 


110040 


n04yro 


AA649530 


Hs.348148 




noyooo 


AI573283 


Hs.38458 




n/oiiu 


H73110 


Hs.260603 


11RQR1 


no 1/00 


N29218 


Hs.40290 


*I90 lOO 


noo&oy 


AC005757 


Hs.31809 


11/ UO 1 


nooooo 


H88353 


Hs.347265 


1 i/UOH 


nooooy 


U72209 




4011Z9 


tiooo/O 


AL137751 


Hs^63671 


41/oDl 


nyo/Uo 


AA334551 




11/ zou 


M00in7 
iMZiclU/ 


M18217 


Hs.172129 


11/ o44 


l>lz4U4o 


R19085 


Hs^1070S 


11/ 4^£ 


M'37non 


A1355562 


Hs.43880 


11/4/0 


IMOUiiUO 


N30205 


Hs.93740 


1 1/40/ 


Kf0nRO4 
lMOl/0£ 1 


N30821 


Hs.44203 


n/5>o/ 


NOO/OO 


AF044209 


Hs.144904 


1 ouzu/ 


MOOZOO 


AF044209 


Hs.144904 


117549 


N333d0 


N33390 


H5.44483 


117683 


N40180 


N40180 




117710 


N45198 


N45198 


Hs.47248 


117791 


N46325 


N48325 


Hs.93956 


117822 


N48913 


AA706282 


Hs.93963 


422544 


N49394 


AB018259 


Hs.118140 


117895 


N50656 


AW450348 Hs.93998 


452259 


N50721 


AA317439 


Hs.28707 


133057 


N53143 


AA46S131 


Hs.64001 



endomudn-2 

N-acetylglucosaminidase, alpha- (Sanfili 

K1AA1209 protein 

ESTs 

hypotfiefical protein MGC10702 
ESTs 

hypothetical protein FU10631 

ESTs, Weakly similar to T08599 piol)able 

DKFZp434J1813 protein 

ESTs. Weakly similartD AF257182 1 Gixo 

ESTs 

ALEX3pfDtein 
ALEX3 protein 

hypothettoal protein h46C10947 

i^*2K binding factor-2 

hypothetical protein 

hypothetical protein FLJ22584 

guanine nucleotide binding protein (G pr 

PAI-1 mRNA-binding protein 

Homo sapiens mRNA; cDNA DKFZp586N1720 (f 

hypothetical protein FU10339 

done HQ0310PRO0310p1 

exportin, tRNA (nudear export receptor 

general transcription fador IIIA 

lipase, endothelial 

DEK oncogene (DMA binding) 

IHuman clone 23826 mRNA sequence 

putative nudeotide binding protein, est 

P1BF1 gene produd 

Homo sapiens cDNA: FU21425 fis, done C 
26S proteasome-assodated pad1 homolog 
ESTs 

polymerase (RNA) 111 (DNA directed) (155 
ESTs 

hypothetk»l protein M6C12760 
gb:HUMGS02848 Human adult lung 3' diied 
hypothetical proteh FU21657 
zinc finger protein 198 
myeloid/lymphoid or mixed-lineage ieukem 
gb:yh16a03.s1 Soares infant brain INIB H 
tumor protein p53-binding protein. 1 
nljonudease P, 40kD subunit 
Homo sapiens cONA: FU22096 fis. done H 
erythropoietin receptor 
karyopherin alpha 1 (impoitin alpha 5) 
ESTs 

K1AA0186 gene product 
EST 

KIAA0601 protein 

ESTs, Weakly similar to S19560 pmline-r 
ESTs. Weakly similar to A46010 X-linked 
gb:ns44f05^1 NCLCGAP_Alv1 Homo sapiens 
ESTs 

ESTs. Moderately similarto A47582 6-cel 
ESTs 

hypotiietical prote&i 

gb:yw21a02^1 Morton Fetal Cochlea Homo 

YYI-assodated factor 2 

Homo sapiens mRNA; cDNA DKFZp434IOB12 (f 

spemi spedfic antigen 2 

Homo sapiens cDNA: FIJ21409 fis, done C 

Homo sapiens cDNA FU13182 fis. done NT 

ESTs, Weakly similar to A46010 X-finked 

ESTs. Weakly ^Oar to 138022 hypotheti 

ESTs 

nudear receptor co-repr^or 1 
nudear receptor co-repressor 1 
EST 

gb:yy44d02^1 Soares_multiple.sderosis_ 
ESTs, Highly sMarto similarto Cdc14 
EST 
ESTs 

KlAA0716gene produd 
ESTs. Highly similarto SORL.HUMAN SORTI 
signal sequence receptor, gamma (transk) 
Homo sapiens done 25218 mRNA sequence 
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118103 N55326 
118111 N55493 
118129 N57493 
118278 N62955 
S 118329 N83520 
118336 N63604 
417098 N64166 

118363 N64168 

118364 N64191 
10 118475 N66845 

118491 N67135 
118500 N67295 
118584 N68953 
456647 N69331 
15 118661 N70777 
118684 N71364 

118689 N71545 

118690 N71571 
118766 N74456 

20 118793 N75594 
118817 N79035 
118844 N80279 
118919 N91797 
129558 N92454 

25 407604 N94581- 
118996 N94746 
119021 N98238 
119039 R02384 
119083 R16833 

30 332622 R41828 
119111 R43203 
415115 R46395 
119146 R58863 
449224 R78248 

35 119239 T11483 
119281 T16896 
119298 T23820 
126502 T30222 
419983 W15275 

40 119558 W38194 
429641 W42414 
419445 W49632 
119650 W57613 
119654 W57759 

45 119683 W61118 
119894 W65344 
119718 W69216 
410365 W69379 
119938 W86728 

50 120128 738499 
120130 Z38630 
120148 Z39494 
120155 Z39623 
451979 Z40071 

55 120183 Z40174 
120184 Z40182 
120211 Z40904 
120245 AA166955 
120247 AA167500 
60 120254 AA1 69599 

120259 AA171724 

120260 AA171739 
120275 AA177105 
120284 AA182626 

65 417735 AA186324 
422137 AA192099 

120302 AA192173 

120303 AA192415 
120305 AA192553 

70 120319 AA194851 
408729 AA195520 

120326 AA196300 
133145 AA196549 

120327 AA196721 
75 120328 AA196979 

120340 AA208828 



AA401733 Hs.184134 

N554g3 

N57493 

N62955 Hs^16433 
N63520 

BE327311 Hs.47166 
AB017365 Hs.173859 
AI183838 H5.48938 
N46114 Hs^169 
N66845 

AV647908 Hs.90424 
W32889 Hs.154329 
AW136928 

AI252640 Hs.110364 
AL137554 Hs.49927 
N71313 Hs,163986 
AW390601 Hs.184544 
N71571 Hs.269142 
N74456 Hs.50499 
N75594 Hs.285921 
AI668658 Hs.50797 
AL035364 Hs.50891 
AW452696 Hs.130760 
AW580922 Hs.180446 
AW191962 Hs.288081 
N94746 Hs.274248 
N98238 Hs^5185 
AI160570 Hs^52097 
R16833 Ks.53106 
R10674 

TQ2865 Hs.328321 
AA214228 Hs.127751 
R58863 Hs.91815 
AW995911 Hs.299883 
T11483 

AI692322 Hs.65373 
NM_001241Hs.155478 
T10077 Hs.13453 
W55956 Hs.94030 
W38194 

AW081883 Hs.211578 
AA884471 Hs.90449 
R82342 HS79856 
W57759 

W65379 Hs.57835 
AA041350 Hs.57847 
W69216 Hs.92848 
AI287518 

AW014862 Hs.58885 
BE379320 Hs.91448 
AA045767 Hs.5300 
F02806 Hs.65765 
Z39623 Hs.65783 
F06972 Hs.27372 
AW082866 Hs.65882 
Z40182 Hs.65885 
Z40904 Hs.66012 
AW959615 Hs.111045 
AA1 67500 Hs.103939 
W90403 Hs.1 11054 
AW014786 Hs,192742 
AK000061 Hs,101590 
AA177105 Hs.78457 
AA179656 

AA188175 Hs.82506 
AJ236885 

AA837098 Hs.269933 
AI216292 Hs.96184 
AW295096 Hs.101337 
T57776 Hs.191094 
AA195764 Hs.72639 
AAig6300 Hs^1145 
H942Z7 Hs.6592 
AK000292 Hs.130732 
AA923278 Hs.290905 
AA206828 



ESTs 

gb7v50o02^1 Soares fetal liver spleen 
gb7y54c08.s1 Soares_mu!tiple_sderosis_ 
Homo sapiens cDNA FU11375 fis, done HE 
gb7y62f01.s1 Soares.muitipte.sdeiosis. 
HT021 

frizzled (Drosophitei) homolog 7 
hypothetical protein RJ21802 
hypothetical protein FU22623 
gb:za46c1 1 .si Soares fetal liver spleen 
Homo sapiens cDNA: FU23285 fis. done H 
ESTs 

gbAJI-H-BI1-adpKW)8^l^1 NCLCGAP.Su 

peptidylprolyl isomerase C (cydophiDn 

protein kinase NYD-SP15 

Homo sapiens cDNA: F1J22765 fis, done K 

Homo sapiens, done 1MAGE:3355383. mRNA, 

ESTs 

EST 

ESTs. Moderately similar to T47135 hypot 
ESTs 

hypothetical protein 

myosin phosphatase, taiget subunit 2 

karyopherin (imporlin) beta 1 

ooDagen. type VIII. alpha 2 

hypothetical protein FLJ20758 

ESTs 

pregnancy spedfic beta-1 -glycoprotein 6 
ESTs. Moderately similar to ALU 1.HUMAN A 
CSR1 protein 
EST 

hypothetical protein 
ESTs 

hypothetical protein FiJ23399 
gb:CHR90049 Chromosome 9 exon Homo sapie 
ESTs. Weakly similar to T02345 hypotheti 
cydin T2 

hypothetical protein FLJ14753 
Homo sapiens mRNA; cDNA DKFZp586E1624 (f 
Empirically selected from AFFX single pr 
Homo sapiens cDNA: FLJ23037 fis, done L 
Human done 23908 mRNA sequence 
ESTs, Weakly similar to S65657 alpha-IC- 
gb:zd20g1 1 -s1 SoaresJetaLhearLNbHHIOW 
ESTs 

ESTs. Moderately similar to ICE4.HUMAN C 
ESTs 

Homo sapiens mRNA; cDNA DKFZpS86D0923 (f 
ESTs 

MKP-1 nke protein tyrosine phosphatase 
bladder cancer assodated protein 
ESTs 
ESTs 

BMX non-receptor tyrosine kinase 

ESTs 

EST 

EST 

ESTs 

EST 

ESTs 

hypothetical protein FLJ12785 

hypothetical protein 

solute carrier family 25 (mitochondrial 

gb:zp54e11s1 Stratagene NT2 neuronal pr 

KtAA1254 protein 

zinc finger protein 148 (pHZ-52) 

ESTs 

ESTs 

uncoupling protein 3 (mitochondral, pro 

ESTs 

ESTs 

hypothetical protein RG083M05^ 

Homo sapiens, done IMAGE:2981368. mRNA. 

hypothetteal protein FLJ20285 

ESTs, Weakly similar to protease (Ksapl 

gb*^80b08^1 Stratagene hNT neuron (937 
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BNS0(XI0:<WO 



417122 


AA207123 




Ue fi19^ 


131522 


AA214S39 


At3Rnn4n 


Mq 93Q4Aq 


421787 






ns.iuooul 


120375 




AF028706 


Hs.1 11227 


120376 




AA227469 




12039D 




AA837093 


Hs.111460 


410804 


AA?^34 


U64820 


Hs.66521 


434223 




AI825842 


Hs.3776 


312771 


AA9W14 


AA018515 


Hs.264482 


1 20396 


AA9337Q6 


AA134006 


Hs.79306 


120409 


AA93'i050 


AA235050 




120414 




AW137156 Hs.181202 


120420 


AA9360'?1 


AI128114 


Hs.112885 


120422 


AAP'^fii*;? 


AL133097 


Hs.301717 


419326 


AA93fi3-Qn 


W94915 


Hs.42419 


120423 


AA93fi4«53 


AA236453 


Hs.18978 


120435 


AA94337n 


AA243370 


Hs.96450 


120453 




AA250947 


Hs.170263 


120455 


AA9'^in«3 


AA251720 


Hs.104347 


120456 


AAP'illl^ 
rv\^9 1 1 1 0 


AA488750 


Hs.88414 


120473 


AA9*i1Q73 


AA251973 


Hs.2699d8 


128922 


AA9'\9n93 


AI244901 


Hs,9589 


1 20477 


AA9S9414 


AA252414 


Hs.43141 


120479 


AA9»^9R'^n 


AF006689 


Hs.1 10299 


120488 


AA255523 


AW952916 Hs.63510 


120510 


AA95812& 


AI796395 


Hs,1 11377 


120527 




AA262105 


Hs.4094 


120528 




A1923511 


Hs.104413 






Ai434823 


Hs.104415 


1^541 


AA97fi9Ckn 


W07318 


HS^40 


1 &U3*t*l 


AA97ft791 


BE548277 


Hs.103104 




HMiTOUUOD 


BE244580 


Hs.342307 






AA807544 


Hs.24970 


100*^71 




AB037744 


Hs.34892 


1 9fm79 
l£\JOi C 


A A 000704 


H39599 


Hs.294008 






AW967495 Hs.186644 


1 oU9^9 


A AOonooc 


AA178953 


Hs.309648 






AW978022 Hs.238911 






AB020686 


Hs.54037 


120591 


AA9ft17Q7 


AF078847 


Hs.191356 


120593 


AA9fl9n47 


AA748355 


Hs.193522 






Z11773 


Hs.237786 




AA9A37nQ 


AA306166 


Hs.7145 




A A 900000 


AW978721 


Hs.266076 


409702 


AA9A4inA 


AI752244 






A A 004 4 no 


AI241084 


Hs.154353 






AA284371 


Hs.118064 






AA1 15496 


H5.336898 






BE617656 


Hs.99756 




A A 9040^ 


AW961294 Hs.143818 






AA286844 


Hs.61260 




AA9ft7n'?9 


AW172431 Hs.13012 




MMZOf UOO 


A1889129 


Hs.96616 


1 9nRfin 


AA9A7*^R 


AA286785 


Hs.99677 




AAOoyccq 


BE622187 


Hs.99670 


120651 


AA9fi7*^*^fi 


AA287556 


Hs.263412 


429828 


AA9R7«%fi4 


AB019494 


Hs.225767 


452291 


AA9Q101*! 
/v\t*> lulu 


AF015592 


Hs.28853 


1 9nRQQ 


AA9Q17lfi 
AMZ9 1 / ID 


AI683243 


Hs.97258 


inoRQn 

IUU09U 


AA9Q174Q 


AA383256 


Hs.1657 




AA9Q3R^R 


AA293655 


H5.21198 


1 907*^7 


AA3n94'^n 


AL049176 


Hs.82223 




AA?n9AnQ 
rVMOU^OUs 


AA302809 






AA3n9flpn 


U83993 


Hs.321709 


120750 


AA3in4<W 


AI191410 


Hs.96693 


120761 




AA321890 




120768 


AA3405a9 


AA340589 


Hs.104560 


120769 


AA340622 


AI769467 


Hs.9475 


135232 


AA342457 


AL038812 


Hs.96800 


120793 


AA342864 


AA342864 


Hs.96812 


120796 


AA342973 


A1247358 


Hs.96820 


120809 


AA34649S 


AA346495 




332633 


AA347573 


AL120071 


Hs.48998 


120825 


AA347614 


AI280215 


Hs.96885 


120827 


AA347717 


AA382525 


Hs.132967 


120839 


AA348913 


AA348913 





immunoglobulin superfamlly, member 3 
TIA1 cytotoxic granule-associated RNA-bl 
nudear receptor subfamily 2, group C, m 
Zic family member 3 (odd-paired Drosophi 
gb:zr18a07.s1 Stratagene NT2 neuronal pr 
caldum/calmoduiin-dependent protein kin 
Machad(K)oseph disease (spinocerebellar 
zinc finger protein 216 

Homo sapiens mRNA; cDNA DKF2:p761 A041 1 (f 
eukaryotic translatton initiation fector 
gb:zs38e04.s1 Soares_NhHMPu_S1 Homosapi 
hypothetical protein FLI10038 
spinal cord-derived growth fector-B 
hypotheticai protein DKFZp434N192d 
ESTs 

Homo sapiens cDNA: FLJ22822 fis, done K 
EST 

tumor protein p53-binding protein, 1 
ESTs, Weakly simiJartoALUC^HUMAN JIJ! 
BTB and CMC homok^y 1. bask: leucine zi 
ESTs 
ubiquilin 1 

DKFZP727C091 protein 
mitogen-activated protein kinase kinase 
KIAA0141 gene product 
ESTs 

Homo sapiens cDNA FU14208 fis. clone NT 

ESTs 

ESTs 

M-phase phosphoprotein 1 
ESTs 

hypoflietical protein FU10330 

ESTs. Weakly similar to B34323 GTP-bindi 

KIAA1323 protein 

ESTs 

ESTs 

gb:zp39e03.s1 Stratagene muscle 937209 H 
hypothetical protein DKFZp762E1511: KIAA 
ectonudeotide pyrophosphatase/phosphodt 
general transcription factor IIH, poiype 
ESTs 

zinc finger protein 187 
calpain 7 

ESTs. Weakly similar to A46010X-linked 

eukaryotic translation elongation factor 

nonselective sodium potassium/proton exc 

similar to rat nuclear ubiquitous casein 

Homo sapiens, Similar to RIKEN cDNA 1810 

mitochondffal ribosome recyding factor 

hypothetical protein FU23459 

hypothetteal protein FU13164 

ESTs 

ESTs 

ESTs 

ESTs, Weakly simllarto 138022 hypotheti 
ESTs. Weakly simDarto ALUB.HUI^N Ull 
1DN3 protein 

CDC7 (cell division cyde 7, S. cerevisi 
ESTs. Moderately similar to S29539 nlios 
estrogen receptor 1 
ESTs 

chordin-like 

gb:EST10426 Adipose tissue, white I Homo 
purinergtc receptor P2X, figand-gated io 
ESTs. Moderately similar to 2109260A B c 
branched chain keto add dehydrogenase E 
EST 
ESTs 

ESTs. Moderately similar to ALU7.HUMAN A 

ESTs 

ESTs 

gb£ST52657 Fetal heart (I Homo sapiens 
fibronectin leudne rk;h transmembrane p 
ESTs 

Human EST done 122887 mariner transposo 
gb£ST55442 Infiant adrenal gland il Homo 
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1208S0 AA349847 
120852 AA349773 
128852 AA350541 
135240 AA357159 
5 120870 AA357172 
120894 AA370132 
435737 AA370472 
120897 M370867 
120915 AA377296 
10 120935 AA383902 
120938 AA385934 

120937 AA386255 

120938 AA386260 
417632 AA386266 

15 120960 AA398014 
120985 AA398222 
120988 AA398235 
121008 AA398348 
121029 AA398482 

20 121032 AA398504 

121033 AA398505 

121034 AA398507 

121035 AA398523 
121058 AA398625 

25 121060 AA398632 
121061 AA398633 

121091 AA398894 

121092 AA398895 
121094 AA398900 

30 121096 AA398904 
121115 AA399122 

121121 AA399371 

121122 AA399373 
121125 AA399441 

35 121151 AA399636 
121153 AA399640 
121163 AA399680 
121176 AA400080 
121192 AA400262 

40 121223 AA400725 
121227 AA400748 
121231 AA400780 

121278 AA401631 

121279 AA401688 
45 121282 AA401695 

121299 AA402227 

121301 AA402329 

121302 AA402398 
121304 AA402449 

50 121305 AM02468 
134721 AA403268 

121323 AA403314 

121324 AA404229 
444422 AA404260 

55 131074 AA404271 
121344 AA405026 
121348 AA405182 
121350 AA405237 
121400 AA406061 

60 121402 AA406063 
121403 AA406O7O 
121408 AA406137 
121431 AA406335 
121471 AA411804 

65 121474 AA411833 
121526 AA412219 
121530 AA412259 

121558 AA412497 

121559 AA412498 
70 121584 AA416586 

121609 AA416867 
121612 AA416874 
121737 AA421133 
121740 AA421138 
75 436032 AA422079 
121784 AA423B37 



AA349647 Hs.98927 
AA349773 Hs.191564 
R40622 Hs.106601 
AA357159 Hs.96988 
AA357172 Hs.292581 
AA370132 Hs.97063 
AF229839 Hs.173202 
AA370867 Hs.97079 
AL135556 Hs.97104 
AL048409 Hs.97177 
AA385934 Hs,97184 
AA386255 Hs.97186 
AA386260 Hs.104632 
R20855 Hs.5422 
AA398014 Hs.104684 
AI219896 Hs.97592 
AA398235 Hs.97631 
AA398348 Hs.1 30546 
AA398482 Hs.97641 
AA393037 Hs.161798 
AA398505 Hs.97360 
AL389951 Hs.271623 
AA398523 Hs.210579 
AA398625 Hs.97391 
AA398632 Hs.97395 
AA393288 Hs.97396 
AA398894 HsS7657 
AA398895 Hs.97658 
AA402505 

AA398904 Hs.332690 
AA3g8187 Hs.104682 
AA399371 Hs.l89095 
A1126713 Hs.192233 
AL042981 Hs.251278 
AA399636 Hs.143629 
AA399640 Hs.97694 
AI676062 Hs.1 11902 
AL121523 Hs.97774 
AA400262 Hs.190093 
A1002110 Hs.97169 
AA400748 Hs.97823 
AA814948 Hs.96343 
AA037121 Hs.98518 
AA292873 Hs.177996 
AA401695 Hs.97334 
AA402227 Hs.22826 
NIVL006202HS.89901 
AA402587 Hs.325520 
AA293863 Hs.97316 
AA402468 Hs291557 
AK000112 Hs.89306 
AA291411 Hs.97247 
AA404229 Hs.97842 
A1768623 Hs.108264 
U16125 Hs.181581 
AA405026 Hs.193754 
AA405182 Hs.97973 
AA405237 

AA406051 H3.98001 
AA406063 Hs.98003 
AA406070 Hs.98004 
AA406137 Hs.98019 
AA035279 Hs.176731 
AA411804 Hs.261575 
AA402335 Hs.1 88760 
AW665325 Hs.98120 
AA778658 Hs.98122 
AA412497 

AI192044 Hs.104778 
A1024471 Hs.98232 
AA416867 Hs.98185 
AA416874 Hs.98168 
AA421133 Hs.104671 
AA421138 Hs.143835 
AA150797 Hs.109276 
T90789 Hs.94308 



Homo sapiens cONA FU12573 fis, done NT 

ESTs 

ESTs 

EST 

ESTs, Moderately simfiar to ALUI.HUMAN A 
ESTs 

)-kappa>B-(nteracting Ras-Dke protein 1 
ESTs, Moderately simitartoAF174605 1 F 
ESTs 

ESTs, Weakly similar to ALU1.HUMAN ALU S 
EST, HIglity simitar to (defiine not avai 
EST 
EST 

glycoprotein M6B 

EST 

ESTs 

ESTs 

Human DMA sequence from done RP11-251J8 

EST 

ESTs 

ESTs 

nudeoporin 50i(D 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs. l^derateiy similar to ALU8_HUMAN A 
EST 

gb:zt62h10.r1 SoaresJtestis.NHT Homo sap 
ESTs 

ESTs, Weakly similar to mitochondrial d 

similar to SALL1 (sal (DrosophDaHike 

ESTs. Highly stmflar to T00337 hypotheti 

KIAA1201 protein 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs. Weakly similar to dJ667H12.2.1 [H, 
Homo sapiens mRNA; cDNA DKFZp434D024 (fr 
ESTs. Weakly simiter to ALUC^HUMAN 111! 
Homo sapiens cDNA FU11490 fis, done HE 
ESTs 
ESTs 

tropomodulin 3 (ubiquitous) 

phosphodiesterase 4A. cAMP-spedfk: (dun 

LATI^TM protein 

EST 

ESTs 

hypothetical protein FU20105 

ESTs 

EST 

ESTs 

glutamate receptor, ionotropic kainate 

ESTs 

ESTs 

gbiztOOelO^I NCLCGAP.GCB1 Homo sapiens 

EST 

ESTs 

EST 

EST 

ESTs 

ESTs 

ESTs. Highly similar to Trad (H^piens] 

ESTs 

ESTs 

gb.zt95g12£l Soares.testis JJHT Homo sap 

ESTs 

ESTs 

EST 

ESTs 

erythrocyte transmembrane protein 
EST 

tatexin protein 

RAB35, member RAS oncogene family 
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1216Q2 
121803 
135286 
332778 
121806 
129517 
121845 
121853 
121891 
121895 
121899 
121917 
121918 
121919 
121941 
121942 
121970 
121993 
418706 
122022 
122050 
122051 
122055 
122105 
* 122125 
135235 
122162 
422072 
415106 
122186 
122235 
412970 
419288 
122310 
122334 
122382 
122425 
122431 
122450 
426284 
122477 
122500 
122522 
122536 
122538 
122540 
122560 
421919 
122562 
122585 
122608 
122635 
122636 
122653 
122660 
122703 
122724 
122749 
122772 
430242 
429838 
122777 
135362 
122798 
122837 
122860 
122861 
122910 
132899 
122967 
422845 
123009 
128917 
123081 
123133 
123184 



AA42432d 
AA424339 
AA424469 
AA424469 
AA424502 
AA425004 
AA425734 
AA425887 
AA426456 
AA427396 
AA427555 
AA428218 
AA428242 
AA428281 
AA428865 
AA428994 
AA429666 
AA430181 
AA430184 
AA431293 
AA431478 
AA431492 
AA431732 
AA432278 
AA434411 
AA435512 
AA4356g8 
AA435711 
AA435815 
AA435842 
AA436475 
AA436489 
AA442060 
AA442079 
AA443151 
AA446133 
AA447145 
AA447398 
AA447643 
AA447742 
AA448226 
AA448825 
AA449444 
AA450087 
AA450211 
AA450244 
AA452123 
AA452155 
AA452156 
AA453036 
AA453526 
AA454085 
AA454103 
AA454642 
AA454935 
AA456323 
AA457395 
AA458850 
AA459662 
AA459668 
AA459679 
AA459702 
AA460017 
AA460324 
AA461509 
AA464414 
AA464428 
AA470084 
AA476606 
AA478521 
AA478523 
AA479949 
AA4812S2 
AA485351 
AA487264 
AA489072 



AI251870 Hs.188d98 ESTs 

AI338371 Hs.157173 ESTs 

AW023482 Hs.97849 ESTs 

AW023482 Hs.97849 ESTs 

AA424313 H5.98402 ESTs 

AW972853 Hs.112237 ESTs 

AI732692 Hs.165066 ESTs. Moderately simflarto ALU2 HUMAN A 

AA425887 Hs.98502 hypothetical protein FU14303 

AA426456 Hs.98469 ESTs 

AA427396 gb:2gv33a02.s1 Scares ovary tumor NljHOT H 

R55341 Hs.50421 K)AA0203genepjDduct 

AA406397 Hs.139425 ESTs 

BE274689 Hs.184175 chromosome 2 open reading frame 3 

AA428281 Hs.98560 EST 

AA428865 Hs.98563 ESTs 

AW452701 Hs.293237 ESTs 

AA429666 Hs.98617 EST 

AW297880 Hs.98661 ESTs 

U73524 Hs.87465 ATP/GTP-blnding protein 

AA431293 Hs.98716 ESTs, Moderately similar to T42650 hypot 

AI453076 ELAV (embryonic lethal, abnomial vision, 

AA431492 Hs.98742 EST 

AA431732 Hs.98747 EST 

AW241685 Hs.98699 ESTs 

AK000492 Hs.98806 hypothetical protein 

AW298244 Hs266195 ESTs 

AA628233 Hs.79946 cytochrome P450, subfamily XIX (aromatiz 

AB018255 Hs.111138 KIAA0712 gene product 

U40763 Hs.77965 peptidyl-prolyl Isomerase G (cyclophilin 

AA398811 Hs.104673 ESTs 

AA436475 Hs.1 12227 membrane-assodated nucleic add binding 

AB026436 Hs.177534 dual specHidty phosphatase 10 

AA256106 Hs.87507 ESTs 

AW192803 Hs.98974 ESTs, Weakly similar to S65824 reverse t 

BE465894 Hs.98365 ESTs, Weakly similar to LB4DJiUMAN NADP- 

AA446440 Hs.98643 ESTs 

AB007859 Hs.100955 KIAA0399 protein 

AA447398 Hs.99104 ESTs 

AA447643 Hs.112095 hypothetical protein DKFZp434F1819 

AJ404468 Hs.284259 dynein. axonemal, heavy polypeptide 9 

AA448226 Hs.324123 ESTs 

AA448825 Hs.99190 ESTs 

AA299607 Hs.98969 ESTs 

AF060877 Hs.99236 regulator of G-protein signalling 20 

AA450211 Hs.99239 ESTs 

AA476741 Hs.98279 ESTs. Weakly simUar to A43932 mucin 2 p 

AW392342 Hs.283077 centrosomal P4.1 -associated proieh; unc 

AJ224901 Hs.109526 zinc finger protein 198 

AA452156 gb:zx29c03.s1 SoaresJotaLfetus_Nb2HF8 

AI681654 Hs.170737 hypothelfcal protein FU23251 

AA4S3525 Hs.143077 ESTs 

AA454085 gb*^3a085l Soares.totaLfetus^Nb2HF8 

AW651706 Hs.99519 hypothetical pTOt^ FU14007 

AW009166 Hs.99376 ESTs 

AI816827 Hs.180069 nuclear lespiratoiy factor 1 

AA455323 Hs269369 ESTs 

AA457395 Hs.99457 ESTs 

AA458850 Hs.293372 ESTs. Weakly similar to B34087 hypolhetl 

AW1 17452 Hs.99489 ESTs 

U66669 Hs,236642 3-hydroxyisobutyTyl-Coen2yme A hydrolase 

AW904907 Hs.30732 hypothetical protein FU13409; KIAA1711 

AK001022 Hs.214397 hypothetical protein FU10160 similar to 

AA978128 Hs.99513 ESTs, Weakly similar to T17454diaphanou 

AW366286 Hs.145696 splidng factor (CC1.3) 

AA461509 Hs.293565 ESTs. Weakly similar to putative pi 50 [H 

AA464414 gb:2x78g01.s1 Soares ovaof tomor NbHOT H 

AA335721 Hs.213628 ESTs 

AA4700M Hs.98358 ESTs 

AA476606 Hs.59666 SMAD in the antisense orientatnn 

AA806187 Hs.289101 glucose regulated protein. 58kD 

AA317841 Hs.7845 hypothetical protein MGC2752 

AA535244 Hs.78305 (^2. member RAS oncogene family 

AI365215 Hs2a6097 oncogene TC21 

AI81S486. Hs^43901 Homo sapiens cDNAFU20738ris. done ^ 

AA487264 Hs.154974 Homo sapiens mRNA; cDNA DI^Zp667N064 (fr 

BE247767 Hs.18166 KIAA0870 protein 
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332467 AA489630 

123233 AA490225 

123234 AA490227 
123236 AA490255 
123255 AA490890 
430015 AA490916 
448892 AA490g25 
123259 AA490955 
123284 AA495612 
123286 AA495824 
123315 AA4gS369 
457397 AA504125 
433049 AA521473 
123421 AA598440 
123449 AA598899 
426981 AA599244 
409986 AA599694 
123497 AA600037 
123604 AA609135 
123712 AA609684 
123731 AA609839 
123800 AA620423 
123841 AA620747 
123929 AA621364 
123978 C20653 
133184 D20085 
132835 D20749 
435147 D51285 
128695 D59972 
124029 F04112 
124057 F13604 
449316 H01662 
130973 H05135 
124106 H12245 
124136 H22842 
124165 H30894 
429627 H43442 
124178 H459g6 
129948 H69281 
452114 H69485 
124-HD826254 
129056 H70627 
427580 H73260 
426793 H77531 
124274 H80552 
129076 H80737 
457658 H93412 
124315 H94892 
437712 H95643 
124324 H96552 
452933 H97146 
132231 H99131 
421877 H99462 
443123 H99837 
132983 N22140 
420473 N22197 
417381 N23756 
130365 N24134 
456610 N24195 
439311 N26739 
124383 N27098 
124387 N27637 
129341 N33090 
419793 N35967 
124433 N39069 
124441 N46441 
132338 N48270 
436575 N48365 
124466 N51316 
408048 N51499 

124483 N53976 

124484 N54157 

124485 N54300 
124494 N54831 
129200 NS9849 
124527 N62132 



NM.014700Hs,1 19004 
AW974175 Hs.151875 
NM_001938Hs.16697 
AW968504 Hs.123073 
AA830335 Hs.105273 
AW768399 H5.106357 
AF084535 Hs^2464 
AI744152 Hs.283374 
AA488968 Hs.293796 
AA495824 Hs.168822 
AA496369 

AW969Q25 Hs.109154 
AU076668 Hs.334884 
AA598440 Hs.291154 
AL049325 Hs.1 12493 
AL044675 Hs.173081 
NM»014777Hs.57730 
AA765256 Hs.135191 
AA609135 Hs.293076 
AA609684 

AA609839 Hs.334437 
AA620423 Hs.1 12862 
AA620747 Hs.1 12896 
AA621364 Hs.1 12981 
T89832 Hs.170278 
AA001021 Hs.6685 
283844 Hs.5790 
AL133731 Hs.4774 
NiVL003478Hs.101299 
F04112 Hs.312553 
AA902384 Hs.73853 
AI609045 Hs.321775 
At636416 Hs.1440 
H12245 

H22842 Hs.101770 
H30039 Hs.107674 
N^L015340Hs.2450 
BE463721 Hs.97101 
AI537162 Hs.263988 
N22687 Hs.8236 
H698g9 H6g899 
AI769958 Hs.1 08336 
AK001507 Hs.44143 
X89887 Hs.172350 
H80552 Hs.1 02249 
AI351010 Hs.102267 
AW952124 Hs.13094 
NM.005402HS.288757 
X04588 Hs.85844 
H96552 Hs.159472 
AW391423 Hs.288555 
AA662910 Hs.42635 
AW250380 Hs.109059 
AA094538 Hs.272808 
AA099693 Hs.34851 
AL1 18782 H5.300208 
AF164142 H5.82042 
W56119 HS.15S103 
AF172066 Hs.106346 
BE270668 Hs.151945 
N27098 HS.1Q2463 
N27637 Hs.109019 
AI193519 Hs.226396 
AI364933 Hs.168913 
AA280319 Hs.268840 
AW450481 Hs.161333 
AA353868 Hs.182982. 
A1473114 

R10084 Hs.1 13319 
NM_007203Hs.42322 
AI821780 Hs.179864 
H66118 Hs.285520 
AB040933 Hs.15420 
N54831 Hs.271381 
N59849 H5.13565 
N79264 Hs^1G4 



KIAA0665 gene product 

ESTs, Weakly similar to MAPB^HUMAN MICRO 

dowTHeguiator of transcription 1. TBP-b 

CDC2-Felated protein kinase 7 

ESTs 

ESTs 

epilepsy, progressive myoclonus type 2, 
ESTs. Weakly similar to CA15_HUMAN COLLA 
ESTs 

ESTs. Weakly similartDA46010 X4inked 
gb2v37d10^1 Soares ovary tumor NbHOT H 
ESTs 

SEC10 (S. oerevisiae)-fike 1 

EST, Weakly similar to 138022 hypofhefic 

Homo sapiens mRNA; cDNA DKFZp564D036 (fr 

KIAA0530 protein 

KIAAOl 33 gene product 

ESTs, Weakly similar to unnamed protein 

ESTs 

Homo sapiens cDNA: FU21543 fis, c/one C 

gb:ae62101.s1 Stratagene lung carcinoma 

EST 

ESTs 

ESTs 

ESTs 

thyroid hormone receptor interactor 8 

hypothetical protein dJ37E16.5 

Homo sapiens mRNA; cDNA DKFZp761C1712 (f 

cu!Iin5 

gb:HSC2JH062 normalized infant bia'm cDN 
bone morphogenetic protein 2 
hypothetical protein DKFZp434D1428 
OEAO/H (Asp-GIu-Ala-Asp/Hts) box poiypep 
gb:ym17a12.r1 Soares infant brain 1NIB H 
EST 
ESTs 

leucyl-tRNA synthetase, mitochondrial 
putative G protein-coupled receptor 
ESTs 
ESTs 

gb:yu70cl2.s1 Weizmann Olfactory Epithet 
ESTs, Weakly simUar to ALUE.HU^MN llll 
Homo sapiens done FLB6914 PR01821 mRNA, 
HIR (histone cell cyde regulation defec 
EST 

lysosomal 

presenllins associated rhomboid-ltke pro 
v-ral simian leukemtei viral oncogene hom 
neurotrophic tyrosine kinase, receptor. 
Homo sapiens cDNA: FU22224 fts, done H 
Homo sapiens cONA: FIJ22425 fis. done H 
hypothetical protein DKF2p434K2435 
mitochondrial ribosomal protein LI 2 
putative transcriptbn regulation nudea 
epsiion-tubulin 

Sec23^teracting protein p125 
solute carrier fami^ 23 (nudeobase tra 
eukaryotk: translatton initiation factor 
retinok: add repressible protein 
mitochondrial ribosomal protein L43 
EST 
ESTs 

hypothetical protein FU11126 

serine/threonine kinase 24 (Ste20. yeast 

PR01575 protein 

ESTs 

goIgin-67 

ESTs 

kinesin heavy chain member 2 ' 
A kinase (PRKA) anchor proton 2 
ESTs 

ESTs. Wealdy similar to 2109260A B oeU 
KIAA1500 protein 

ESTs. Weakly stmOar to 138022 hypotheti 
Sam68-(ike phosphotyrosine protein. T-ST 
ESTs 
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124532 


N62375 


N62375 


H« 109731 


133213 


N63136 


AA903494 


rio>Dr ou 


124539 


N63172 


D54120 


He 14R40Q 


129196 


N63787 






124575 


N68168 


N68168 




124576 


N68201 


N68201 




124577 


N66300 

1^ www WW 


N68300 


Hs.136465 


124578 


N68321 


N68321 


Hs^1500 


124593 


N69575 


N69575 


Hs.102788 


128501 


N750D7 


AL1 33572 


Hs.199009 


332434 


N75542 


A1680737 


Hs.289068 


128473 


N90066 


T78277 


Hs.100293 


128639 


N91246 


AW582962 Hs.l02897 


124652 


N92751 


W19407 


Hs.3862 


133137 


N93214 


AB002316 


Hs.65746 


124671 




AK001357 


Hs.102951 


133054 


rxwf Or u 


AA464836 


Hs.291079 


425266 


R10RR5 
rv iwOUw 


JD0077 


Hs.155421 


124720 


R 11056 


R05283 




124722 


R11488 


T97733 


Hs.185685 


128944 


r\£w«/^w 


AL137586 


Hs.52763 


132965 


R96589 

1 \£UWUw 


AI248173 


Hs.191460 


426504 


R3758a 


AW162919 Hs.170160 


438828 


R37613 


AL134275 


Hs.6434 


124757 


R38398 


H11368 


Hs.141055 


124762 


R30179 


AA553722 


Hs.92096 


124773 


R4Q993 
r\*tww£w 


R45154 


Hs.338439 




R4117Q 
r\H 11(9 


R41179 


Hs.97393 


427961 


R41904 


AW293165 Hs.143134 




R49307 


NM-004427HS.165263 




R431AQ 


AW297929 Hs.328317 




R4^3nR 


W38537 


Hs.280740 






R44357 


Hs.48712 


124793 


R4451Q 


R44519 




124799 




R45088 




124812 


R47948 


R47948 


Hs.188732 


124821 


R51524 


H87832 


Hs.7388 


424123 


R 54050 


AW966158 Hs,58582 


124835 


R55241 


R55241 


Hs.101214 


124845 


R5Q5R5 
r\w>7»juw 


R59585 


Hs.101255. 


124847 


R60044 


W07701 


Hs.304177 


440630 


R60872 


BE561430 


Hs.239388 


124861 


R6R6Q0 
noDwsi/ 


R67567 


HS.10711D 


332503 




NM_004455Hs.l50956 


124879 


r\ 1 oooo 


R735B8 


Hs.101533 






A1970003 


Hs.23756 






H75964 


Hs.107815 




r\900^ 


R93622 


Hs.12163 


194040 




AF068846 


Hs.103804 


194041 


RQQR19 


AI766661 


Hs^7774 


124943 


1*02888 


AW963279 


Hs.123373 


124947 


T03170 


T03170 


Hs.100165 


124954 


T10465 


AW964237 


Hs.6728 


456882 


T15418 


U55184 


Hs.154145 


410653 


T15597 


BE383768 


Hs.65238 . 


418133 


T15652 


R43504 


Hs.6181 


440014 


T 16898 


AW960782 


Hs.6856 


131082 




AI091121 


Hs.246218 


194QR0 


T40ft41 


T40841 


Hs.98681 


194QR4 




BE313210 


Hs.334798 


124991 


T5011fi 


T50116 






1 wU IHw 


NM_004477Hs.203772 


195000 




158615 


Hs.235887 




T5Q040 

1 099HU 


AW118826 Hs.6093 


4444A4 


1 WU99^ 


AK002126 


Hs.11260 


195008 


T64891 


T91251 




195009 


T64924 


T64924 


Hs.303046 


445384 


T64933 


T79136 


Hs.127243 


125017 


T68875 


T68875 




125018 


169027 


T69027 


Hs.269481 


125020 


T69924 


T69981 




437871 


T70353 


AI084813 


Hs.114088 


134204 


779780 


AI873257 


Hs.7994 


125050 


T79951 


AW970209 Hs.111805 


125052 


T80174 


T65104 


Hs^779 


125054 


T80622 


T80622 


HS.268G01 



EST 
ESTs 

cell division cyde 42 (GTP-blnding prot 
ESTs, Weakly similar to 138022 hypotheti 
9b:za11c01.s1 Scares fetal liver spleen 
ESTs. Weakly simHar to 138022 hypotheti 
gb:za12g07.s1 Soaies fetal liver spleen 
EST 
ESTs 

protein contalnbig CXXC domain 2 
Homo sapiens cONA FU1 1916 fis; ctone HE 
0-!inked N-acetylglucosamine (GteNAc) tr 
CGI-47 protein 

regulator of nonsense transcripts 2; DKF 
KIAA0318 protem 

Homo sapiens cDNA FLJ10495 fis, done NT 
ESTs, Weakly similar to T27173 hypotheti 
alpha-fetoprotein 

gb:ye91c08.s1 Soares fetal liver spleen 
ESTs 

anaphase-promoting complex subunit7 

hypothetk:al protein MGC12936 

RAB2, member RAS oncogene family-like 

hypothetical protein DKF2p761F2014 

Homo sapiens done 23758 mRNA sequence 

ESTs, Moderately similar to A46010 X-Bn 

ESTs 

KIAA0328piotdn 
ESTs 

early development regulator 2 (homotog 0 
EST 

hypothetkal protein MGC3040 
hypothetteal protein FLJ20736 
gb:yg24h04^1 Soares in^t brain 1NIB H 
gb:yg38g04^1 Soares Infant brain 1NIB H 
ESTs 

keteh (Drosophila)>&ke 3 

Homo sapiens cDNA FLJ12769 fis. done NT 

EST 

ESTs 

Homo sapiens done FLB8503 PR02286 mRNA, 
Human DMA sequence from done RP1-304B14 
ESTs 

exostoses (multiple)-like 1 
ESTs 

hypothetical protein similar to swine ac 
ESTs 

eukaryottc translation initiation factor 
heterogeneous nudear ribonucteoprotein 
ESTs. Highly similar to AF161349 1 HSPCO 
ESTs. Weakly sfanilarto ALU1_HUMAN ALU S 
ESTs 

KIAA1548 protein 

hypothetical protein FU1 1585 

95 kDa retinoblastoma protem bind'mg pr 

ESTs 

ash2 (absent small, or homeolic Drosop 
Homo sapiens cDNA: FLJ21781 fis. done H 
ESTs 

eukaryotk; translatton ek}ngation factor 
gb7b77c10^1 Stratagene ovary (937217) 
FSHD region gene 1 
ESTs 

Homo sapiens cDNA: FU22783 fis. done K 
hypothetx»} protein FLJ11264 
gb:yd60a10^1 Soares fetal liver spleen 
ESTs 

Homo sapiens mRNA for KIAA1724 protein. 
gb7c30105.s1 Stratagene fiver (937224) 
sex comb on mkileg homolog 1 
gb7cl9d03.r1 Stratagene hing (937210) H 
ESTs 

hypothetkal protein FU20551 
ESTs 

ESTs, Moderately similar to similar to N 
ESTs. Weakly similar to envetope (H^ 
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125063 


T85352 


T85352 




125064 


T85373 


T85373 




125066 


T86284 


T86284 




416507 


T89579 


AL045364 


Hs.79353 


125080 


T90360 


T90350 


Hs^68620 


125097 


T94328 


AW576389 Hs.335774 


125104 


T95590 


T95590 




135107 


T97257 


T97257 


Hs.94560 


423122 T97599 


AA845462 


Hs.124024 


125118 T97620 


R10606 


Hs.269890 


125120 T97775 


T97775 


Hs.100717 


134160 T98152 


T98152 


Hs.79432 


125136 


W31479 


AW962364 H5.129051 


125144 W37999 


AB037742 


Hs.24336 


125150 


W38240 


W38240 




450142 


W40150 


AW207469 H5.24485 


131987 


W45435 


AW453069 Hs.3657 


125178 


W58202 


W93127 


Hs.31845 


125180 


W58344 


W58469 


Hs.103120 


125182 


W58650 


AA451755 


Hs.263560 


446888 


W68736 


AL030996 


Hs.16411 


125197 


W69106 


AF086270 


Hs.278554 


133497 


W69111 


BE617303 


Hs,74266 


429922 


W69399 


297630 


Hs.226117 


129232 


W69459 


R98881 


Hs.109655 


422166 


W72424 


W72424 


Hs.1 12405 


125209 


W72724 


W72724 


Hs.103174 


125212 


W72834 


AA746225 


Hs.103173 


456631 


W73955 


BE383436 


Hs.108847 


125223 


W74701 


AI916269 


Hs.109057 


125225 


W76540 


W74169 


Hs.16492 


125228 


W79397 


AA033982 


Hs.1 10059 


132393 


W85888 


AL1 35094 


Hs.47334 


125238 


W86038 


N99713 


Hs.109514 


125247 


W86881 


AA694191 


Hs.163914 


129296 


W87804 


AI051967 


Hs.1 10122 


125263 W88942 


AA098878 




125266 W90022 


W90022 


Hs.186809 


450862 W92272 


U91S43 


Hs.25601 


452401 


W92764 


NM.007115H529352 


428243 W93040 


H05317 


Hs.283549 


125277 


W93227 


W93227 


Hs.103245 


125278 


W93523 


AI218439 


Hs.129998 


125280 


W93659 


AI123705 


Hs.106932 


448205 


W94003 


W93949 


Hs.33245 


131844 


W94401 


AI419294 


Hs.324342 


125284 


W94688 


NM_002666Hs.103253 


417111 


W94787 


AW016321 Hs.82306 


445424 


Z38294 


AB028945 


Hs.12696 


125289 


238311 


T34530 


Hs.4210 


446313 


Z38465 


H06245 


Hs.106801 


431342 


238525 


AW971018 Hs.21659 


433227 


Z38538 


AB040923 


Hs.106808 


428306 


238551 


AB037715 


Hs.183639 


424624 


Z38783 


AB032947 


Hs.151301 


125295 


239113 


AB022317 


Hs.25887 


125298 


239255 


AW972542 Hs.289008 


125300 


239591 


239591 


Hs.101376 


448376 Z39783 


BE622770 


Hs.264915 


444562 


Z39920 


R55344 


H&22142 


130882 Z40166 


AA497044 


Hs.20867 


128888 Z40388 


A1760853 


Hs.241558 


125310 Z40646 


RS9161 


Hs.124953 


125315 Z41697 


R38110 


Hs.106296 


125317 Z99349 


Z99348 


Hs.112461 


135096 


Z99394 


AA081258 





gb:yd82d01^1 Scares fetal liver spleen 
gb7d82l07.s1 Scares fetalliver spleen 
gb:yd77b07.s1 Scares fetal liver spleen 
transcription factor Dp-1 
ESTs. Highly similar to ALU6_HUMAN ALU 8 
EST. IVIoderatety similar to S65657 alpha- 
gb:ye40a03.s1 Soanes fetal liver spleen 
ESTs. Moderately similar to 138022 hypot 
deltex (Drosophila) homobg 1 
gb:yf35f11 .si Scares fetal threr spleen 
EST 

fibraiin 2 (congenital oontractural era 
ESTs 

KIAA1321 protein 

Empirically selected from AFFX single pr 

chondroHin sulfate proteoglycan 6 (bama 

activity-dependent neuroprotective prote 

ESTs 

ESTs 

ESTs 

iiypcthetical protein LOC57187 

heterochromatin-jike protein 1 

hypothetical protein MGC4251 

HI histone family, member 0 

sex comb on midleg (DrosophiteHike 1 

S100 caicium-binding protein A9 (catgran 

ESTs. Weal<ly similar to TSP2^HUIWVN THROM 

ESTs 

hypothetical protein MGC2749 

ESTs. Weakly similar to ALU5.HUMAN ALU S 

OKFZP564G2022 protein 

ESTs. Weakly similar to 138022 hypotheti 

hypothetical protein FLJ14495 

ESTs 

ESTs 

ESTs 

gb:zn45g10.r1 Stratagene HeLa cell s3 93 
ESTs, Highly similar to LCT2.HUMAN LEUKO 
chromodomain hellcase DNA binding protel 
tumor necrosis factor, alpha-induced pro 
ESTs 
EST 

enhancer of polyoomb 1 

ESTs 

ESTs 

ESTs 

periQpin 

destrin (actin depolymerizing factor) 
cortactin SH3 domain-binding protein 
Homo sapiens cDNA FLJ13069 fis, done NT 
ESTs, Weakly similar to PC4259 feniGn 
ESTs 

kelch (Drosophiia)-like 1 
hypothetical protein FLJ10210 
Ca2-»dependent activator protein for seer 
sema domain, immunoglobulin domain (ig), 
Homo sapiens cDNA: FU21814 fis. done H 
EST 

Homo sapiens cONA FLJ12908 fis. done NT 

cytochrome b5 reductase b5R.2 

hypothetical pnstein FU10392 

artedne (Drosophila) homolog 2 

ESTs 

ESTs 

ESTs, Weakly similar to 138022 hypotheti 
zinc finger protein 36 (KOX 18) 
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Tabl^ 3A shows the accesston numbers for those pkeys lacking unigenelD's for Table 3. The pkeys in Table 7 lacking unigenelD's are repr^ented within 
Tawes For each probeset we have Usted the gene duster number from which the oiigonucleotides were designed. Gene dusters were compiled 
using sequence derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence similarity using Clustering and 
Ai^nment Tools (DoubleTwfet Oakland Cafifomia). The Genbank aocesskm numbers for sequences comprising each duster are Osted in the -Accesston" 
oonimn. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accesston: Genbank accession numbers 



Pkey 

108469 
124106 
108501 
108562 
101300 



CAT Number Accession 



116761J 
125446 1 
13684_-12 
36375_1 
4669_1 



132091 94851 1 



117034 
10D752 



20113_2 
33207.21 



116417 5418_11 



AA079487 AA128547 AA128291 AA079587 AA079600 

H12245AA094769R14576 

AA083256 

AA100798 AF020589 AA074629 AA075946 AA100849 AA085347 AA126309 AA079311 AA079323 AA085274 
BE535511 M62098 AA306787 AW891766 AA348998 AA338869 AA344013 AW956561 AW389343 AW403607 L40391 
AW408435 AA121738 AI568978 H13317 R20373 AW948724 AW948744 AA335023 AA436722 AA448690 C21404 
AW884390 AA345454 AA303292 AA174174 BE092290 T90614 AA035104 R76028 AA126924 AA741086 AW022056 
AW118940 AA121666 AI832409 AA683475 AI140901 AI623576 AW519064 AW474125 AI953923 A1735349 AW150109 
AI436154 AW118130 AW270782 A1804073 N27434 AA876543 AA937815 A1051 166 AA505378 A1041975 A1335355 
AI089540 AA662243 AI127912 Ai925604 AI250880 AI366874 AI564386 AI815196 AI683526 A1435885 AI160934 H79030 
AI801493 AA448691 Ar673767 Ai076042 AI804327 AA813438 AA680002 AI274492 T16177 AI287337 AI935050 
AA907805 AA911493 AI589411 AI371358 AW576236 A1078B66 AW516168 AA346372 A1560185 AA471009 R75857 
AA296025 AA523155 AA853188 AI696593 A1658482 A1566601 AW072797 AA128047 AA035502 AW243274 AA992517 
R43760 

AW954243 AA829930 AA412478 AA828434 AA814538 AI927418 AI192435 W52897 AA443666 AA031913 AI683306 
AA918481 A1183314 D83907 AI206832 AA876122 D83836 D83838 D82533 AI761290 AI191125 AI143749 AW771909 
A(241436 AI767267 W56507 AA847787 AA568692 T10502 AI247870 AA715017 AA643304 AA890233 AA811387 
AA897470 AA907729 AI708679 A1078010 AA452830 AW419160 AI783713 N80205 W56778 AA676899 AI888718 N69930 
AI338935 AI21 7580 AA639508 AA575836 BE046852 AI312651 AI038406 AA628649 AA643838 AM93761 AA032024 
W38849 AA3401 78 AA447052 AA452969 W19369 AA296364 H44229 W58767 C05751 C05835 AI741989 N98532 
AW102617 AA412583 A1922246 W38495 AA355375 AA928571 C06275 AA352500 N93132 
U72209 NM_005748 AI655607 A1052758 AA385199 AW956794 H88679 AL135153 AI765644 AA384399 AW966458 
AA568443 AA804610 AI873513 H88639 Z25371 R63456 W44919 

T81309 BE019033 R94181 BE019198 NM_000612 J03242 AW411299 BE300064 BE297544 R94182 AW630108 T53723 
D58853 H78073 H80594 BE299560 T48899 H70196 M17426 N77077 S77035 H58384 H61664 H78540 T84527 C17198 
H60255 H71980 R92644 W79050 X00910 M29645 R91055 M17863 M17862 T71815 BE299561 BE464561 X06260 
R94741 T54216 C18594 BE262015 X06161 AW409889 AA378400 BE263228 BE313278 R881 16 BE313457 H43500 
T48617 BE313761 H77309 AI207601 X06159 H40413 X03425 T87663 R10627 X03562 M141 18 W03982 R97520 H81229 
T83157 H83168 H48762 AA669898 BE263054 H47289 AA022807 R1 1555 H74260 R76968 R28338 H72534 H72464 
H62031 N72478 N45355 AW411300 R89113 R69135 H58454 T83281 R93476 H69645 H68015 T82229 H71089 T85121 
H59939 W65299 N78176 H53909 N72373 R21788 H04660 H59639 H61874 BE262219 T53614 N73335 N50464 W00943 
N77189 R89257 AA570502 R89432 R06366 AA653480 AA776271 AA551359 AA551050 H51670 AA601052 BE299081 
H68198 H52276 BE207832 N91 192 H70332 X07868 X07868 H69464 H53782 H73710 R80435 AA553384 AW884176 
N53475 T71662 AW954036 AW954033 AA552931 H93206 AA430218 AA553476 AI918470 T54124 BE207982 BE300177 
N73994 AW882625 N39549 N53838 AA722389 H71878 H58909 H37849 H78435 T47933 R77174 R83814 AA41 1890 
H94199 AA663208 BE205778 AA490137 H70492 R98232 H37800 AA679294 H40341 H74238 H47290 H73231 T48618 
AA025428 AI039521 H92969 N59389 H80538 H72933 T90630 AA41ia91 N55000 H7422S AA340290 AW957061 T54316 
AA340437 H57125 H58908 H79027 H63450 1^4623 R93425 H68714 H68758 N68396 H48763 N69256 H57320 H53831 
H53589 N68833 N52453 H56048 H69870 H78074 R69253 R83375 T53615 H94330 H58455 H90864 T47934 H74261 
R89258 R97997 R91056 R28339 R86760 H78235 R97521 H67692 H40358 AA022688 H52513 H59601 T88690 H65256 
H63397 W65397 AA553588 R19280 N52645 W73930 R06367 R21743 H72372 N73921 AW883539 AW882639 T40616 
H47084 R95723AA634316AA862781 H77310 R91389 H93111 R92767 T54512 R89341 H70333 H57817 H82941 HK032 
N52638 H58385 T91796 H51086 AA340292 T49918 H81230 R36121 N50411 T87664 N62436 N39340 AA665637 
AA340446 H93377 H92973 BE296290 BE269788 H61665 AA340444 N54605 AA454101 R10628 R94200 AI200S49 
AA342640 BE298855 BE250229 T49916 HB2008 N2a278 AW880662 H71268 N76791 H47685 H65255 W05198 
AW889144 N76677 H71702 H58036 H71915 R91612 R87807 H68059 AI133328 AI247866 AA621443 AW881050 
AA700847 AA340413 AW878608 AW881 181 AW878249 H71916 N54596 BE161581 AW878082 W04212 AW881040 
AW885492 AW880519 AA334887 AW878715 W06882 AW630222 AW885381 H70869 AW381778 H47601 AW889982 
H63868 AW884986 AW878713 AW878685 R36391 AW878694 AA368070 C03393 AW878695 AW878705 AW878665 
AW878742 AW878620 AW878823 AW878688 R29048 AW878690 'AW878686 AW878810 AW878827 AW878733 
AW878659 AW878749 AW878681 AW883353 AW883277 AW883300 AW883565 AW883298 AW883143 AW883045 
AW883482 AW883352 AW88341 7 AW883357 AW883231 AW883474 AW883355 AW882620 AW882533 AW883754 
AW883139AW882B27 AW883641 AW883567 AW883481 AW882983AW8a2982AW882465AW883419AW882466 
AW883639 AW883230 AW882981 AW882534 AW882874 AW882619 AW883480 AW882826 AW882831 AW882B35 
AW882830 AW883563 AW882456 AW627642 «vv(»«ja 
AW499664 AW500888 AUW2095 AW576556 AW265424 AI521500 AA761333 AA761319 AW291137 AA649040 AA769094 
AA489664 AA635311 AW070509AA425658 AI381489 AA609309 AA134476 W74704 AJ923640AW084888 H45700 
AI985564 AWK9495 AW614573 AI859571 AI693486 AA913892 A1806164 AA909524 AW263513 AI356361 240708 
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123712 
117156 



125008 
125020 
125086 
116661 
125104 
124575 
125263 
131859 



374423.1 
145392.1 



1802095.1 

116017J 

1814993.1 

1532859.1 

413347.1 

1666649.1 

1547.2 

3672.1 



125565 1704098.1 
132983 1192^.1 



118584 532052_1 
133607 1227 6 



AI332765 AI392620 AA181060 AW118719 AW96B804 AW263502 AW505314 AA036967 W74741 R51139 H19364 H45751 
Z44962 AW370823 H25650 T54007 AA453000 AL045739 

AA609684AA758732 

W73853 AA9281 12 W77887 AW889237 AA148524 AI749182 A1754442 AI338392 AI253102 AI079403 AI370541 AI697341 

H97538 AW188021 AI927669 W72716 AI051402 AI188071 AI335900 N21488 AW770478 W92522 AI891028 AI913512 

AI144448 W73819 AA604358 N28900 Wg5221 AI868132 H98465 AA148793 

T91251 T64891 T85665 

T69981T69924AA078476 

T86284 T81933 

R61504 F04247 

T95590AA703278H62764 

N68168 N69188 N90450 

AA098878W88942 

AW960564 AA092457 T55890 D56120 T92525 A1815987 BE182808 BE182595 AW080238 M90657 AA347236 AW961686 
AW176446 AA304671 AW583735 T81714 AA316968 A1446615 AA343532 AA0d3489 AA4880D5 W52095 W39480 N57402 
D82638 W25540 W52847 D82729 D58990 BE619182 AA315188 AA308636 AA112474 W76162 AA088544 H52265 
AA301631 H80982 AA113786 BE620997 AW651691 AA343799 BE613669 BE547180 BE546656 F11933 AA376800 
AW239185 AA376086 BE544387 BE619041 AA45251 5 AA001806 AA19D873 AA180483 AA159546 F00242 AI940609 
AI9406D2 AI189753 T97663 T661 10 AW062898 AW062910 AW062902 AI051622 AI828930 AA102452 AI685095 A1619390 
AA557597 AA383220 AI804422 AI633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI250777 AA083510 
AW629109 AW513200 AA921353 A1677934 AI148698 AI955858 AA173825 AA453027 AI027865 AW375542 AA454099 
AA733014 AI591384 R79300 R80023 AA843108 AA626058 AA844898 AW375550 AA889018 AI474275 AW205937 
AI052270 AW3881 17 AW3881 1 1 AA699452 At242230 hM7476 H38178 AA366621 AA1 1 3196 AA130023 H39740 T61629 
AI885973 AW083671 AA179730 AA305757 AI285455 N83956 AA216013 AA336155 AW999959 T97525 AA345349 
T91762 AA771981 AI285092 AI591386 BE392486 BE385852 AA682601 A1682884 AA345840 T85477 AA292949 
AA932079 AA098791 D82607 T48574 AW752038 C06300 
R20840 R20839 

M30269 NI^002508 X82245 AI078760 AW957003 D78945 M27445 AA650439 AL048816 AV660256 AV660347 
AA333052 BE295257 T60999 AA383049 AW369677 Z26985 AW175704 AA343326 AW747957 AI818389 W17308 
W17302 H15591 AA371284 AA370412 W94966 BE384365 T28498 R80714 R16959 H21723 AW835154 D56097 D56381 
W21232 AA190565 AW379755 AW067895 

AW136928 At685655 6E218584 BE465078 1^68963 AA975338 BE147199 N76377 

BE273749 BE397561 BE387189 AL037858 AL037878 A1963094 BE259216 AA01 1363 AL036189 BE562325 AA251169 
BE617431 N98537 AA158093 AL047800 M34539 NM_000801 AA312140 D16971 AA158904 AA307114 AA312803 
T09203 AW629686 AL048504 BE388578 AA220957 AA158364 BE267385 AA294971 C18055 BE241757 AA1 15056 
AI936769 6E378435 BE208971 AW674924 6E622060 AA604674 AA115273 AW402159 AA338608 BE568819 M80199 
X55741 AA375111 AA376016 BE612671 AA805742 AW405588 N25850 N44580 H06031 AW403549 BE536552 AA056726 
BE543239 AA082517 AI201645 AI201642 AI192622 N40104 AA370921 BE547569 AI969602 AA302038 A1197890 
AW268354 AI014938 W45448 AI541395 AA037272 BE538828 AL039613 BE536130 AA299355 AW805147 AW974624 
H53220 AI471471 AA399303 AA007386 W35106 BE613277 R12739 R12738 AA304342 AA687802 BE409581 AW98844 
AV662092 AW904105 AA011375 BE315214 H99302 BE537893 N32299 AW855829 AI291320 BE078322 AI301395 
AA303362 N32719 AA358328 AA357877 AI952540 H56279 H02758 H0204B AW805233 R82224 AA410772 AA291352 
BE171109 N69935 BE 1 69248 AA361 173 H44978 BE617887 D52560 AA084043 W03595 R67219 N36477 hW2924 R67104 
H44901 H79695 W21105 AA393988 W30899 AA316096 BE622896 W46872 AA442678 BE544893 BE540112 BE621873 
AA338067 N55052 BE398154 BE621210 AA740760 C03739 C03205 BE396692 AA482370 AA031614 AA301575 
AA304710 AA132153 AA029796 AA994960 H19567 AA442969 H49781 H46871 AA035395 AA056185 AA149378 
AA643080 AL135479 AA292329 AA654337 AA041228 AA454888 AA025039 W58331 AA625981 T94941 AA302448 
H19900 AA218956 AA513790 AA563962 AA398076 W44441 AA293276 W47373 AA625879 W30688 AA043029 T64284 
R79151 AA304340 AA485186 AA604939 R82470 AA421425 AW771456 AI339329 AA304424 AA605236 AA936934 
AA587673 AI209162 AI697301 AI479995 AI679814 AI361950 AW189125 AI955888 AI986019 BE301019 A1084792 
AI310211 AW189307 A1022070 AW977204 A1146825 AW190163 AW303281 A1828345 BE046043 AW029257 AA482268 
AI246507 AM20729 AW084932 AW439514 AI890487 AW439692 AI523896 AI186612 AI659953 A1889773 AA687527 
AW072694 AW262153 AW467371 AI613269 AI679238 D54404 AA158103 AW105527 AW149739 AW150361 AW268387 
AW1 17708 AI951682 AI687440 AW674285 AA678365 AI587082 AA732095 AA019899 W45661 AA627300 BE613304 
AA765891 AA612935 AI814658 AW316916 R66594 AA514640 AA025040 AA031472 AW732076 AA029797 AI244560 
AI128734 AW381720 AI092360 AI263283 AW613175 AI890675 AI720156 AW631348 AI635106 AI278045 AA303979 
AA703505 W45449 AW078661 AI292052 AW381707 AI147854 AW381743 AA158905 AA303258 AAB88144 AW195967 
AA428706 AA989559 AA617731 H19882 BE543418 AA830386 AA421302 W58652 T94995 AI8e9743 AI679145 
AW085971 N98425 AA765136 A1347027 AI356955 AA928038 AI679717 AA458459 AA679281 AI367973 AI270041 
AA765135 AA732793 At798447 AA668646 AA251008 A1984538 AI401737 AA056186 BE043308 AW662375 AI3021 10 
N50724 W96332 BE537047 N26983 AI567172 AA765298 AW673237 N29784 AA534275 AA084044 AW067973 
AW300766 T63398 W46823 R39790 AI364185 AW298582 AA454814 AW069878 N67751 H05982 N23140 AI362647 
AI302086 AI767772 N25755 H53114 AA706133 T93511 AA429291 AA935294 AA987647 W02803 R66595 AI680795 
W23673 AW440794 AA722872 H49538 AW131042 AA531603 AA908665 AA040791 AA235312 W52205 N93444 R82180 
H02759 H79696 AW088894 H56079 AA961143 AW057776 AW973745 AA016311 AW071227 AA017511 AI753994 
W47374 T64155 AA296092 AI698526 AA558158 AA2960B8 AW794259 H01963 AA149267 AA485076 AA975856 H44938 
AA035396 AI955555 H46289 AA486161 AI631222 AA359047 AW794253 AI806962 AW243930 AA526145 AW878734 
AA018464 AA132031 R67220 R79152 AA296093 H54300 A!005160 BE242548 AW992803 AW878644 AW878666 T27742 
R82471 AW517604 AW472738 A1282904 R39791 AA486098 AW467891 AW960520 AA551736 AA056621 AW945197 
R66373 AA554236 BE242202 AI904376 AI832590 H19484 R00890 A1627677 AA302287 AI869451 AI734855 AI708073 
A1832902 AA585184 AW204299 AA055565 D12417 D11975 T63543 AW664099 R54423 BE612712 T96340 T63985 
AA598917 T40735 T64053 AA149284 AW272548 AA363445 AA042893 AW300697 BE261973 T53501 T53500 AW878729 
AW878657 AW794391 AA069193 R01553 H44875 AA385406 AA533968 M93060 AL135600 W98331 AA017651 
AA018849 AA017692 H85337 BE278690 AA731598 AA018512 AI076813 A1022644 R02585 X52220 AW296894 AA825671 
AI699321 AI393601 AW592611 AI146747 AA608921 AA158365 AW590007 AA354519 D20081 R027O4 AW798339 
M92422 AA094903 AA007676 
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134403 170371 



135096 
103767 



126872 
113026 



120284 
112540 
111904 
121094 
128510 
114106 
121335 
120761 
122050 
130018 
100104 
121822 



108280 
108309 
107832 
123523 
123533 
132225 
125017 
125063 
125084 



33756.2 
34817_1 



103855 84277 1 



AI352558 Z82248 X78138 NM^003405 AU077248 AA223125 S80794 D78577 AI124697 AW403970 BE614089 BE296713 
BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA084159 BE538808 AW401377 AA256774 C03366 W46595 W47608 AA305009 H69431 H69456 AI120082 
H11706 AA303717 AA361357 H22042 H78020 AW999584 AA134368 AA322911 AA322961 H60980 N85248 N31547 
H79624 T1 1718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151G67 BE002724 AA608688 H82692 BE155392 AW383836 BE155394 AA487004 AW383504 AI342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25364 AA367935 AI559406 AA033522 AA374252 
AW835019 AI922133 A1697089 N99662 AW189078 AI199076 AW151598 W59944 AA662875 W94022 AA299055 
AI039008 A1B29449 AA583503 A1635674 AW131665 AI473820 AW2731 18 AW90093O AA908944 AI688035 AW170272 
AKW2545 AW468176 AI608761 AI0a2748 AI91 1682 AI248943 AI831016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905196 AI191245 AW70204 AI188296 AI421367 AI125315 AI087141 AA629032 AA740589 AI5541 81 
AA150830 AI248541 AI077943 AA775958 AA864930 AI261476 AI123121 AI310394 AA862331 AA872478 BE537084 
AI205606 AA720684 AI872093 AW150042 AL120538 AA219627 AA988608 C21397 AI359337 H2S337 AI089749 
AA605146 AI359620 AA150478 AI359738 AW383642 AW995424 AI766457 R56892 AI089839 W61343 N69107 W48459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA126874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 L08439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520AW028427AA171496 A/469689 AWe64539AI811102Ai811116 BE464590 BE350791 H78021 T15405 H21979 
AA219489 H13301 AA505883 AI864305 AI423963 AW084401 FHW963 R69858 H67097 AI917740 AI655561 H69864 
AA033631 AW383484 AI886261 H25293 AA513281 AW271187 H11617 N79982 AI174338 AI904207 AI904208 BE614558 
W94127 W65436 AI272249 AA700018 AI579932 AI085941 AW152629 

AA334551 BE008229 AA307537 AW961 156 AW995894 AW995826 NM_006751 M61 199 AA045603 AL036372 AV645606 
A1688095 AW351901 AA101337 AA101345 N73342 BE018030 BE569044 AW841975 AA373388 BE090412 H95440 
N53845 R67867 AA093441 AA363427 H93708 AW023134 AW994986 AW994989 BE090429 R23614 AI567932 H03726 
H01 101 H01867 AA548743 A1671806 AW872949 AW872941 AA742447 AI199788 AA045604 AI637465 AI741796 
AW242217 AW131463 AI765302 A1683923 AA889762 AI804889 AI986437 C06049 BE502340 AI695651 AI491970 
AA496804 AA28ia08 AA665699 AI473814 BE301445 AA707837 AA551925 A1017348 A1208185 AA775203 AA156296 
AA557463 H95441 AA768547 AW769358 AA991 197 AA181954 AI091389 AI147289 AW771837 AI638582 AA84441 1 
AI374750 T29320 AW951272 AW085923 H02834 AA843259 AA814696 AW183290 AA158453 N68125 N69039 AA100423 
AA101346 AI918720 H01 102 R67868 H01868 N66438 R46580 AI858433 AA599560 AA187577 AA157481 AA361520 
AL047827 AA158452 R21688 AW964874 AA325161 R40871 AW752395 AW375924 R13355 AA281174 AA428908 
AA081258 AA16031 1 W17034 H83596 299393 AI831206 AW771 108 AW769214 N89775 AW161495 AW161522 
AW160880 299394 A1814820 

BE244667 BE241813 BE242271 AA381943 NM.016040AF151858 AW967497 AW966873 AI824386 AW470133 
AW015765 BE018650 AW503659 AI129838 AI632346 AA013099 AW77051 1 BE219482 AI824135 AI867379 AA019348 
AA285143 AW087624 AI990100 AA251084 A1633962 AA287714 AA400773 A1292112 AW469095 AA743312 AW1 17423 
AA694551 AA885657 AA112675 BE327333 AA082161 H03613 AA094735 AW500235 N28878 AA287713 AW300233 
AA826249 N46921 BE348728 AW505056 AW966879 AI521202 AA393405 AI264668 AA910851 AA251721 AM70834 
H03503 AAQ89688 R58562 BE004728 AA668793 H27167 R54717 

W02363 N80298 AA304486 AW954799 AW805136 AW970817 AW373398 AW875459 AA136805 AA683501 N73299 
AW341082 AI632954 AA493369 AI478433 AI037911 AW272169 AW043832 AA010683 AW629090 AW183622 N64510 
AW079953 A1554533 AA563670 AA010682 AW237610 AW419057 AI470926 AI627833 AA195080 AA195179 AI471443 
AW590266 AI168477 AW771214 A1767341 AW340086 AW748455 A1280079 A1244821 AI381283 AW300130 AW183374 
AW1 95397 AA136706 AI824598 AW573004 Z98448 AA905255 AI497883 

AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE011359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 

AA376654 W76367 AA318232 AI694545 AI742403 AI887383 AW204731 AW874431 BE220997 AA1 14979 AA303838 
Aia02267 AW952031 W74801 AA011287 AA1151 12 AI306385 R37677 AW571707 R59986 W94102 AW197042 H10206 
AW139819 AI686172 AI674165 R51633 AI367086 T23948 H10833 H23002 H11743 R37085Z39208 H22794 H11820 
R13817 Z43122 H10257 R88398 R18795 AA010848 R67191 H10875 R67170 
AA179656 AA182626 AA182603 
R69751 R70467 H69771 H80879 H80878 
Z41572R39330 
AA402505 AA398900 

X94703 NM.004249 R52316 T87420 N46403 Z36855 BE076834 
AW602528 BE073859 Z38412 
AA404418Ai217248 
AA321890R18000 

AI453076 AI376075 AI014836 AA628633 AA961066 A1150282 AI028574 AI217182 AA732910 AA431478 AL041229 
AA353093 AW957317 AW872498 A1560785 AI2891 10 AW135512 X97261 T68873 
AF008937 

AI743860 N49543 AW027759 BE349467 A1656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 A1611424 
AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AI61 161 1 AI61 1407 AI611452 W56437 AI284566 
AI583349 AW183058 AI308085 AI074952 AA437315 AA628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AM32414 A1984994 AI539335 AA401550 AA358757 AI609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 A1580502 AI832944 AI038530 AA425107 AK)14986 AI148349 AW23J721 AW779756 
AW137877 AI125293 AA400404 R28554 
AA065069AA085108 

AA069818 AA069971 AA089923 AA069908 
genbanK_AA021473 AA021473 
genbank.AA608588 AA608588 
genbanleAA608751 AA608751 
genbank_AA1 28980 AA128980 
genbank_T68875 T68875 
genbank^T85352 T85352 
genbanK_T8S373 T85373 



142696J 
84431.1 



158963_1 

1605263_1 

1719336.1 

275729_1 

19829 1 

1182096.1 

279548.1 

224903.1 

273507_2 

18985.1 

19974.^ 

244391.1 



110682.1 
111495.1 
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125091 genbanK_T91518 T91518 

1009B4 entre?_J00212 J00212 

102269 entre?^U30245 U30245 

125150 NOT FOUND entre^W38240 W38240 

123964 genbanK.C13961 C13961 

118111 9enbanK.N55493 N55493 

1 18129 genbanleN57493 N57493 

102491 entnB?^U51010 U51010 

1 18329 genbanleN63520 N63520 

1 18475 genbanK.N&6845 N86845 

118581 genbanK_N68905 N68905 

1 1 1514 genban)eR07998 R07998 

104534 R22303 at R22303 

120340 genbanKJ^206828 AA206828 

120376 genbanlLAA227469 AA227469 

104787 genbanlLAA027317 AA027317 

120409 genbaak^AA235050 AA235050 

120745 genbank^AA302809 AA302809 

120809 genbankj^346495 AA346495 

120839 genban)eAA348913 AA348913 

1 13702 genbanleT97307 T97307 

106596 304084_1 AI583948 AA578212 AW303715 AA653450 AA456981 AI400385 W88533 AI224133 AW272145 AA088686 R9469B 

1 13947 genbanK.W84768W84768 
122562 genbank AA452156 AA452156 
122635 genbanlCAA454085 AA454085 
108277 genbanK«AA064859 AA064859 
108403 genbanleAA075374 AA075374 
122860 genbanK.AA464414 AA464414 
108427 genbanleAA076382 AA076382 
108439 genbanK_AA078986 AA078986 

131353 231290_1 AW411259 H23555 AW015049 AI684275 AW015886 AW068953 AW014085 At027260 R52686 AA918278 AI129462 

AAg89360 N34869 AI948416 AA534205 AA7Q2483 AA705292 
108533 genban)eAA084415 AA084415 
124254 genbank_H69899 H69899 
101447 entre2^M21305 M21305 
101458 entre^M22092 M22092 

101667 13349 1 NM_005381 M60858 AW373732 AW373724 AW373689 AW373629 AW373609 AW373776 AA187806 AW386946 

AW374207 T05235 AA216203 AW385556 AA306940 AA306526 AA315461 AL036757 AW373711 AW403124 AW403640 
AW377084 T27360 H62638 F06957 AW377051 AA554779 AA378568 AA096007 AW352407 AW3Q2637 F07929 H17433 
AW382712 H05665 F07292 N39a75 AA089729 H62556 N42842 R12952 AW373735 AW364155 AA056183 W39185 
AW382708 N32488 AF114098 AW375993 A1133569 W52561 AA603040 AA133710 A1928796 AW176370 AA827519 
AW338437 AA521142 T29341 AI800461 AW317002 AA703914 AA860830 A1859203 AI445772 AA714334 A1817066 
AI832027 AW510442 AI635802 AW088306 AW06e672 AW408555 AW467542 AA552657 AA152367 W32081 AA582124 
AA074040 AA931657 AI051 154 AW41Q203 AI921644 H17434 AI832330 AW404836 AI925038 AA088423 AA954166 
AA580453 AW021292 AI267215 AW080082 AW383778 Ai933053 AI919097 W31557 N90245 AA931591 AA563995 
F36352 AA056184 AA476294 AA641327 AA533550 AI749630 W58323 AA569119 AA508573 AI809050 AI378996 
AA411362 AW407505 AA938104 AA074041 AA632876 AW193748 AA507873 A1270128 AI472365 AA411363 AI523216 
A1719965 AI816302 AA182681 AI707990 AA133588 AI758537 W60253 A1460308 AA135423 A1083904 F04188 N89693 
AW408776 AI678595 AI270568 AA722059 W58234 F33650 AA090547 AA285108 AA425981 N85079 020218 A1273980 
AA159D28 F03226 AW247914 N26918 AW272741 N90109 H05666 N23327 AW247953 R44748 AA962015 F03558 
AI752394 AW409913 AW248396 AI816463 AI752393 AA325370 AA263089 AI570130 AI971951 AI160658 AI357360 
AW168686 AL121075 AW050536 N21672 W67748 AA514242 A1127386 H14607 AI185752 W79364 AA088520 AA152476 
AW351940 AW373683 AI940524 AW374953 T56500 N24329 A1940720 AW374933 AW374947 AW391913 AL138337 
AW376241 AW062943 F26666 AW410202 AW062958 F34529 AW381807 AW393315 W17147 AW176359 AA664576 
AW380424 AA306040 AI745674 AW300951 AI188579 AI438973 A1305271 AA433818 AA612607 AI831809 AI940409 
AA158663AI572988 

124576 genbanK.N68201 N68201 

108931 genbank AA147186 AA147186 

108941 genbanK.AA148650 AA148650 

124720 144582^1 R05283R11056 

124793 genbanK.R44519 R44519 

124799 genbanleR45088 R45088 

103138 entre^X85965 X65965 

117683 genbanK.N40180 N40180 

124991 genbanK.T50116 T50116 

103432 entrez X97748 X97748 

119174 genbanK.R71234 R71234 

119239 95573.2 T11483T11472 

133678 • 11235_1 AW247252 AA346143 NM 000270 AA381085N91995X00737AA381 079 AA296473AA2961 10 AA315735AA31 1617 

AA326750 AA376804 AW403290 T95231 M13953 T47963 H82039 AA279899 AA627997 N76320 N99527 H37842 
W20095 AA457308 AW469547 AA724143 H83220 AA319496 W86334 W30892 R89169 R99427 N41854 H47286 
AA348094 AA045089 R63016 AI922219 AI024906 At096488 AI885005 AA194872 N90489 A1452544 H72411 AA282427 
AA430735 R68963 R22453 H70385 AW129369 AW467320 AW519082 AA345018 AA582183 AI981789 R65918 N30611 
AI979189 AI280889 AW273191 R66531 AI285845 A1675927 A(421990 AW190879 H37794 AA699667 H68427 AA954388 
AI188757 AI140048 AA430382 AI204151 AW247864 AA559099 AI431420 AA548276 A1149466 AA772669 AA694368 
AA724168 AA3016S1 AA261^ AA779925 AA234760 W88290 AA913603 AW511745 At500697 AA814922 AA835040 
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T47964 H53998 M975804 R98710 AI077604 N70252 R98084 AW250171 H69268 AI597614 AA970748 AA972548 
AI3771 16 R62962 H16737 R89070 AA731329 R66532 N54354 AI818832 H81944 N71567 T95122 W86463 AA437095 
AI431999 AI915724 N63861 AI674743 AA457307 AA21 1475 N64444 AI799146 H72853 R99335 H60413 AA770367 
AA156105 AI269937 H64029 H89728 R65819 AW470496 AI873318 A1735713 H82987 C02447 A1478666 T27651 
A1S99770 AW025156 H69719 AI984717 N69225 A1459856 AA953577 AI424691 H13843 R22404 AI873796 AI336002 
N70898 AI420854 AA541792 AA346142 AI000814 AI828348 AA045090 T51257 N90434 H13890 N73184 AI708083 
AA781608 AA329050 AA339985 R68964 H64795 W04186 H16845 
genbank_T97186 T97186 

W38194 
W38197 



119416 

119558 NOT.FOUND_entre2L.W38194 

10 1 19559 NOT J^OUND_entfe?_W38197 

119654 genbanK.W57759W57769 

121350 genbanK.AA405237 

121558 genbanK_AA412497 

105985 genbanK.AA406610 

15 114648 genbanKjAA101056 

121895 genbanleAA427396 

100327 entrez^D55640 D55640 

123315 714071J AA496369 AA496646 

123473 genbank_AA599143 AA599143 



AA405237 
AA412497 
AA406610 
AA101056 
AA427396 
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TABLE 4: 

Pkey: Unique Eos probeset identifier number 

Accession: Acces^n number used for previous patent fi&igs 
ExAocn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 
Unlgene Title: Undone gene title 



Pkey Accession ExAccn UniGene UnigeneTitle 



100405 D86425 
100420 086983 
100481 HG1098-HT1098 
100484 HG1103-HT1103 
100718 HG3342-HT3519 
100991 J03764 
101097 L06797 
101168 L15388 
101194 L20971 
101261 L35545 
101345 L76380 
101447 M21305 
101485 tV124736 
101543 M31166 
101550 M31561 
101560 M32334 
101674 M61916 
101714 1^68874 
101741 1^4719 
101838 M92934 
101657 1^94856 
102012 U03057 
102024 U03877 
102164 U18300 
102241 U27109 
102283 U313M 
102303 U33053 
102564 U59423 
102663 U70322 
102759 U81607 
102778 U83463 
102804 U89942 
102887 X04729 
102898 X06256 
102915 X07820 

103036 X54925 

103037 X54936 
103095 X60957 
103158 X67235 
103166 X67951 
103185 X69910 
103280 X79981 
103554 Z18951 
103850 AA187101 
104465 N24990 
104592 R81003 
104764 AA025351 
104786 AA027168 
104850 AA040465 
104865 AA045135 
104894 AA054087 
104952 AA071089 
104974 AA085918 
105178 AA187490 
105263 AA227926 
105330 AA234743 
105376 AA236559 
105729 AA292694 
105826 AA398243 
105977 AA406363 
106008 AA411465 
106031 AA412284 
108124 AA423987 



Y00630 Hs.75716 
AW958272 Hs^7326 
NM_002291Hs.82124 



AW291587 Hs.82733 nldogen2 

D86983 Hs.1 18893 Melanoma associated gene 

X70377 Hs.121489 cystatinD 

NIVL005402HS.288757 v-ral simian leukema viral oncogene horn 
BE295928 Hs.75424 inhibitor of DNA binding 1 . dominant neg 
J03836 Hs.82085 serine (or cysteine) proteinase inhibito 
BE245301 Hs.89414 chemoklne(C-X<: motiO. receptor 4 (fus 
NM_005308Hs.211569 G protelnKsupted receptor Idnase 5 
1-20971 Hs.188 piiospliodiesterase 4B, cAI^-spedfic (dun 
D30857 Hs.82353 protein C receptor, endothelal (EPCR) 
NM.005795HS.152175 calcitonin receptor-iil« 
M21 305 gbiHuman alpha satellite and satellite 3 

AA296520 Hs.89546 selectin E (endothelial adhesion molecul 
M31 166 Hs.2050 pentaxin-related gene, rapidly Induced b 
serine (or cysteine) proteinase inhibito 
intercellular adhesion moiecute 2 
laminln. beta 1 

M68874 Hs.211587 phosphoKpase A2. group tVA (cytosolic. 
NI^003199Hs.326198 transcription factor 4 
BE243845 Hs.75511 connective tissue growth fector 
BE550723 Hs.1 53179 fatty add binding protein 5 (psoriasis- 
BE259035 Hs.1 18400 singed (Drosophila)-Iike (sea urchin fas 
AA301867 Hs76224 EGF-containing fibulin-like extracellula 
NM.000107HS.77602 damage-specific DNA binding protein 2 (4 
NM»007351Hs.268107 muIOmerin 
AW161552 h1s.83381 guanine nucleotide binding protein 11 
U33053 Hs.2499 protein kinase C-like 1 
U59423 Hs.79067 MAD (mothers against decapentaplegic, Dr 
NM.002270HS.168075 karyopherin Omportin) beta 2 
NM_005100Hs.788 A kinase (PRKA) andior protein (gravin) 
AF000652 Hs.8180 syndecan binding protein (syntenin) 
NM-00231 8HS.83354 lysyl oxidase-like 2 
J03836 Hs.82085 serine (or cysteine) proteinase inhibito 
NIVL002205HS.149609 integrin, alpha 5 (fibronectin receptor. 
X07820 Hs.2268 matrix metaltoproteinase 10 (stromelysin 
M13509 Hs.83169 matrix metalloproteinase 1 (interstitial 
placental growth factor, vascular endoth 
tyrosine kinase with immunogkibulin and 
hematopoleticaily expressed homeobox 
AA159248 Hs.180909 peroxiredoxin 1 
N(4.006825Hs.74368 transmembrane protein (63kD). endoplasmi 
U84722 Hs.76206 cadherin 5. type 2. VE-cadherin (vascula 
A1878826 Hs.74034 caveoHn l.caveolae protein. 22kD 
AA187101 Hs.213194 hypothettoal protein M6C1 0895 
Z44203 Hs.26418 ESTs 
AW630488 Hs.25338 protease, serine. 23 
AI039243 Hs.278585 ESTs 

Hs.10031 KIAA0955 protein 
Hs.8728 hypothetical protein DKFZp434G171 
Hs.22575 B-cell CLL/lymphoma 6, member B (zinc fi 
Hs.18858 phospholipase A2. group IVC (cytosiMic. 
AW076098 Hs.345588 desmoplakin (DPI. DPIl) 
Y12059 Hs^78675 bromodomain-containing 4 
AA313825 Hs.21941 AD036 protein 
AW388633 Hs.6682 solute carrier family 7. (cationic amino 
AW338625 Hs,22120 ESTs 
AW994032 Hs.8768 hypothetical protein FU10849 
H46612 Hs.293815 Homo sapiens HSPC285 mRi^A. partial cds 
AA478756 Hs.194477 E3 ublquitin Ogase SMURF2 
AKD01972 Hs.30822 hypothetical protein FU1 11 10 
AB033888 Hs.8619 SRY (sex detemilnlng regton Y)-bax 18 
X64116 Hs.171844 Homo sapiens cDNA: FU22296 ffi, ctane H 
H93366 Hs.7567 Homo sapiens cDMA:FLJ21962fis, done H 



BE018302 Hs.2894 
NM.005424HS.78824 
BE242587 Hs.1 18651 



AA027167 
AL133035 
T79340 
AF065214 
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106155 AA425309 AA425414 Hs.33287 

106302 AA435896 AA398859 Hs.18397 

106423 AA448238 AB020722 Hs.16714 

106793 AA478778 H94997 Hs.16450 

5 107174 AA621714 BE122762 Hs.25338 

107216 D51069 D51069 Hs.211579 

107295 T34527 AA186629 Hs.80120 

107385 U97519 NM-005397Hs.16426 

108756 AA127221 AA127221 Hs.117037 

10 108846 AA132983 AL1 17452 Hs.44155 

108888 AA135606 AA135606 Hs.189384 

109001 AA156125 AI056548 Hs.72116 

109166 AA179845 AA219691 Hs73625 

109456 AA232645 AW956580 Hs.42699 

15 109768 F10399 F06838 Hs.14763 

110107 H16772 AW151660 Hs.31444 

110906 N39584 AA035211 Hs.17404 

110984 N52006 AW613287 Hs.80120 

111006 N53375 BE387014 Hs.166146 

20 111018 N54067 AI287912 Hs.3628 

111133 N64436 AW580939 Hs.97199 

111760 R26892 BE551929 Hs.268754 

113073 T33637 N39342 Hs.103042 

113195 T57112 H83265 Hs.8881 

25 113923 W80763 AW953484 Hs.3849 

114521 AA046808 AW139036 Hs.108957 

115061 AA253217 AI751438 Hs.41271 

115096 AA255991 AI683069 Hs.175319 

115145 AA258138 AA740907 Hs.88297 

30 115819 AA426573 AA486620 Hs.41135 

115947 AA443793 R47479 Hs.94761 

116314 AA490588 AI799104 Hs.178705 

116339 AA4g6257 AK000290 Hs.44033 

116430 AA609717 AK001531 Hs.66048 

35 116589 D59570 AI557212 Hs.17132 

116733 F13787 AL157424 Hs.61289 

117023 H88157 AW070211 Hs.102415 

117186 H98988 H98988 Hs.42612 

117563 N34287 AF055634 Hs.44553 

40 117997 N52090 N52090 Hs.47420 

118475 N66845 N66845 

118581 N68905 N68905 

119073 R32894 BE245360 Hs.279477 

119155 R61715 R61715 Hs.310598 

45 119174 R71234 R71234 

119221 R98105 C14322 Hs^50700 

119416 T97186 T97186 

119866 W80814 AA496205 Hs.193700 

121335 AA404418 AA404418 

50 121381 AA405747 AW088642 Hs.97984 

123160 AA488687 AA488687 Hs.284235 

123473 AA599143 AA599143 

123523 AA608588 AA608588 

123533 AA608751 AA608751 

55 123964 C13961 C13961 

124006 D60302 AI147155 Hs^0016 

124315 H94892 NIVL005402Hs^88757 

124659 N93521 AI680737 Hs^9068 

124669 N95477 AI571594 Hs.102943 

60 124847 R60044 W07701 Hs.304177 

124875 R70506 A1887664 Hs.285814 

125091 T91518 T91518 

125103 T95333 AA570056 Hs.122730 

125355 R45630 R60547 Hs.170098 

65 125565 R20839 R20840 

125590 R23858 R23858 Hs.143375 

423765 R23858 R23858 Hs.143375 

126511 AI024874 T92143 Hs.57958 

100286 W26247 BE247550 Hs.86859 

70 126563 W26247 AA516391 Hs.l81368 

126649 AA856990 AA001860 Hs^79531 

449602 AA856990 AA001860 Hs^79531 

126872 AA136653 AW450979 

456000 AA136653 ^180876 Hs.11614 

75 414221 AA136653 AW450979 

127402 AA356869 AA358869 Hs.227949 



nuclear factor l/B 

hypothetical protein FU23221 

Rho guanine exchange factor (GEF) 15 

ESTs 

ESTs 

melanoma cell adhesion molecule 

UDP-N'acetylnatpha-D^alactosamine:polyp 

podocalyxin-like 

ESTs 

DKFZP586G1517 protein 

gb:zl10a05^1 Soares_ptegnanLuteiusJsibH 

hypothetical protein FU20992 similar to 

RAB6 interacting, kinesin-like (rabklnes 

ESTs 

ESTs 

ESTs 

ESTs 

UDP-N-acetyi-alpha-D-galactosamine:polyp 
Homer, neuronal immediate early gene, 3 
mitogen-activated protein idnase kinase 
complement component Clq receptor 
Homo sapiens cDNA FU1 1949 fis, ctone HE 
microtubule-associated protein IB 
ESTs, Weakly similar to S41044 chromosom 
hypothetical protein FLJ22041 simiiarto 
40S ribosomal protein S27 isofonm 
Homo sapiens mRNA full length Insert cDN 
ESTs 
ESTs 

endomucln-2 
KIAA1691 protein 

Homo sapiens cDNA FU1 1333 fis, ctone PL 

dipeptidyt peptidase 8 

hypothetical protein FU10669 

ESTs. Moderately similar to 154374 gene 

synaptojanln 2 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs, Weakly similar to ALU1.HUMAN ALU S 
unc5 (Cetegans homotog) c 
EST 

gb:za46c1 1 .si Soares fetal liver spleen 
gb:za69b09.s1 Soares_fetaUung_NbHL19W 
ESTs 

ESTs, Moderately simiiarto ALU1.HUMAN A 
gb:yi54c08.s1 Soaies placenta Nb2HP Homo 
tjyptase beta 1 

gb7e50h09.sl Scares fetal fiver spleen 
Homo sapiens mRNA; cDNA DKFZp5B6l0324 (f 
gb:zw37e02.s1 SoarBS_totaLfetus_Nb2HF8_ 
hypothetical protein FLJ22252 simiiarto 
ESTs. Weakly simiiarto 138022 hypotheti 
gb:ae52d04.s1 Stratagene lung carcinonoa 
gb:ae54e06.s1 Stratagene lung carcinoma 
gba^6h07.s1 Stratagene hmg cardnonrta 
gb:C13961 Ctontech human aorta polyA-i- mR 
ESTs 

v-ral simian leukemia viral oncogene horn 

Homo sapiens cDNA FLJ11918 fis. done HE 

hypothetical protein MGC12916 

Homo sapiens done FL68503 PR02286 mRNA, 

sprouty (Drosophila) homokig 4 

gb:ye20t05.s1 Stratagene hmg (937210) H 

ESTs, Moderately simiiarto KIAA1215 pro 

KIAA0372 gene product 

gb7g05cG8.rl Soares infant brain INIB H 

Homo sapiens, done IMAGE:3840937, mRNA, 

Homo sapiens, done IMAGE:3840937, mRNA, 

EGF-TM7-latrophifin-reIated protein 

growth factor receptor-bound protein 7 

U5 snRNP-spedfic protein (220 kO), orth 

ESTs 

ESTs 

gb:UI-H-B13-aIa-a-12-0AJI.s1 NCI CGAP.Su 
HSPC065 protein 

gb:UI-H-BI3^la^-12-04JLs1 NCLCGAPjSu 
SEC13 (S. cerevisiae)^ 1 
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127651 AI123976 
424806 AI123976 
128062 AA379500 
128992 R49693 
5 129046 AA195678 
129188 M30257 
129314 AA028131 
129371 M10321 
129466 J03040 
10 129765 M86933 
129805 AA012933 
129884 AA288710 
130495 AA243278 
130639 D59711 
15 130657 T94452 

130828 AA053400 
130972 AA370302 
131080 J05008 
131137 U85193 
20 131182 AA256153 
131486 X83107 
131573 AA046593 
131647 AA410480 
131756 D45304 
25 131859 M90657 
131881 AA010163 
132050 AA135353 
132083 Y07867 
132164 U84573 
30 132358 X60486 

132413 AA132969 
132456 AA1 14250 
132490 F13782 
132676 AA283035 
35 132687 AB002301 
132718 AA056731 
132736 U68019 
132760 H99198 
132933 AA598702 
40 132968 N77151 

132994 AA505133 
133061 AB000584 
133147 D12763 
133161 AA253193 
45 133200 AA432248 
133260 AA083572 
133363 AA479713 
133491 L40395 
133517 X52947 
50 133550 W80846 
133607 M34539 
133614 D67029 
133627 U09587 
133691 M85289 
55 133696 D10522 
133913 W84712 
133975 D29992 
133985 L34657 
134039 S78569 
60 134088 D43636 
134161 U97188 
134299 AA4875S8 
134416 M28882 
116470 X70683 
65 134656 X14787 

134989 AA236324 
135051 C15324 
135073 AA452000 
135349 D83174 
70 100114 D00596 
100130 D11428 
100143 D13640 
100168 D14874 
100208 D26129 
75 100224 D28476 
100405 D86425 



AA379621 
H04150 



NM_001955Hs.2271 
W27392 Hs.33287 
AI824144 
F06972 
AA040311 



Hs.23912 
Hs.27372 
Hs.28959 



AA359615 Hs.30089 
AA443966 Hs.31595 
AW960564 
AW361018 Hs.3383 
AI267615 Hs.38022 



AA382523 Hs.105689 MSTP031 protein 
AA382523 Hs.105689 MSTP031 protein 

Hs.105547 neural proliferatk)n,diffeientiation an 
Hs,107708 ESTs 
AB029290 Hs.108258 acUn binding protein; macrophin (microf 
NI\A.001078Hs.109225 vascular cell adhesion molecule 1 
BE622768 Hs.290356 mesoderm development candidate 1 
X06828 Hs.1 10802 von WOlebrand factor 
AW410538 Hs.111779 secreted protein, addfc. cystelhe-rich 
M86933 Hs.1 238 amelogenin (Y chromosome) 
AA012848 Hs.12570 tubulin-specific chaperoned 
AF055581 Ks.13131 lysosomal 
AW250380 Hs.109059 mitochondrial fibosomal protein L12 
AI557212 Hs.17132 ESTs. Moderately simitar to 154374 gene 
AW337575 Hs.201591 ESTs 
AW631469 Hs.203213 ESTs 

D81866 Hs.21739 Homo sapiens mRNA; cDNA DKFZp586l1518 (f 
endothelin 1 
nuclear factor l/B 
ESTs 

BMX non-receptor tyrosine kinase 
ESTs 
ESTs 
ESTs 

transmembrane 4 superfamily member 1 
upstream regulator element binding prot 
ESTs 

BE386490 Hs.279663 Pirin 

AI752235 Hs.41270 procollagen-lysine, 2-oxoglutarate 5-dio 
NWL003542HS.46423 H4 histone family, member G 
AW361383 Hs.260116 metalioprotease 1 (pitrilysin family) 
AB011084 Hs.48924 KIAA0512 gene product; ALEX2 
NM_001290Hs.4980 LIM domain binding 2 
N92589 Hs.261 038 ESTs. Weakly similar to 136022 hypotheti 
AB002301 Hs.54985 KIAA0303 protein 
NM.004600Hs.554 Sjogren syndrome antigen A2 (60kD. ribon 
AW081883 H5.211578 Homo sapiens cDNA: FU23037 fis, done L 
AA125985 Hs.56145 thymosin, beta, identified in neuroblast 
BE263252 Hs.6101 hypothetkal protein MGC3178 
AF234S32 Hs.61638 myosin X 
AA112746 Hs.279905 ctone HCM)310 PRO0310p1 
AI186431 Hs.296638 prostete differentiation factor 
AA026533 Hs.66 interleukin 1 receptor-like 1 
AW021 103 Hs.6631 hypothetical protein FU20373 
AB037715 Hs.183639 hypothetical protein FU1 0210 
AA403045 Hs.6906 Homo sapiens cDNA: FU231 97 fis. done R 
A1866286 Hs.71962 ESTs, Weakly similar to B36298 prolinen- 
BE619053 Hs.1 70001 eukaryotic translation initiation factor 
NM.000165HS.74471 gap junction protein, alpha 1 , 43kD (con 
vesicle-assodated membrane protein 5 (m 
FK506-binding protein 1A (12kD) 
SEC14 (S. cerev)siae)-nke 1 
glycyl-tRNA synthetase 
Hs.211573 heparan sulfate proteoglycan 2 (peitecan 
Hs.75607 myristoylated alanine-rlch protein kinas 



AI129903 Hs.74669 
BE273749 

NIVL003003HS.75232 
NWL002047HS.75280 
M85289 
AI878921 

AU076964 Hs.7753 calumenin 
C18356 Hs.295944 tissue fador pathway inhibitor 2 
L34657 Hs78146 platelet/endothelial cell adhesion molec 
laminln. alpha 4 
KIAA0096 protein 
IGF-(I mRNA-binding protein 3 
oomptement component Clq receptor 
Hs.21 1579 melanoma cell adhesion molecule 
Hs.83484 SRY (sex detemnining regton Y)-box 4 
thrombospondin 1 

nudbc (nudeoside diphosphate linked mol 
SRY (sex determining region Y)4)0X 4 
Homo sapiens mRNA; cONA DKFZp566E1624 (f 
serine (or cysteine) proteinase inhibito 
thymidylate synthetase 
NM_000304Hs.103724 peripheral myelin protein 22 ^ 
AU076465 Hs^8441 KIAA0015 gene produd 
H73444 Hs.394 adrenomeduliin 
NI\/L002933Hs.78224 ribonudease. RNase A famSy. 1 (pancrea 
AL121516 Hs.138617 thyroid hormone receptor interactor 12 
AW291S87 Hs.82733 nidogen2 



NM.002290HS.78672 
AI379954 Hs.79025 
AA634543 Hs.79440 
AW560939 Hs.97199 
X68264 
AI272141 
AI750878 Hs.87409 
AW968058 Hs.92381 
AI272141 Hs.83484 
W5S956 Hs.94030 
AA1 14212 Hs.9930 
X02308 Hs.82962 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



100420 
100455 
100529 
100818 
100619 
100658 
100676 
100718 
100752 
100828 
100850 
100991 
101097 
101110 
101142 
101156 
101168 
101184 
101192 
101317 
101336 
101345 
101400 
101475 
101485 
101498 
101505 
101543 
101557 
101560 
101587 
101592 
101633 
101634 
101667 
101682 
101714 
101720 
101741 
101744 
101793 
101837 
101838 
101840 
101857 
101864 
101931 
101966 
102012 
102013 
102024 
102059 
102121 
102283 
102300 
102378 
102395 
102460 
102491 
102499 
102523 
102560 
102564 
102589 
102600 
102645 
102687 
102693 
102709 
102759 
102804 
102882 
102907 
102915 
102927 
102960 



086983 
D87953 

HG1862+iT1897 

HG2614-HT2710 

HG2639-HT2735 

HG2855-HT2995 

HG3044-HT3742 

HG3342-HT3519 

HG3543-HT3739 

HG4069-HT4339 

HG417-HT417 

J037&4 

L06797 

L08246 

L12711 

L13977 

L15388 

L19871 

L20859 

L42176 

L49169 

L76380 

M15990 

M23254 

M24736 

M26576 

M27396 

M31166 

M31994 

M32334 

M35878 

M36429 

M57730 

M57731 

M60858 

M62994 

M68874 

M69043 

M74719 

M75126 

MS4349 

M92843 

M92934 

M93056 

M94856 

M95787 

S76965 

S81914 

U03057 

U03100 

U03877 

U08021 

U14391 

U31384 

U32944 

040369 

U41767 

U48959 

U51010 

U51478 

U53445 



U59423 
U62015 
U63625 
U67963 
U73379 
U73824 
U77604 
U81607 
U89942 
X04412 
X06985 
X07820 
X12876 
X15729 



D86983 Hs.1 18893 
AW888941 Hs.75789 
BE313693 Hs.334330 
AI752163 Hs.114599 
N24433 Hs.241567 
U56725 Hs.180414 
X02761 Hs.287820 
BE295928 Hs.75424 
T81309 

AL048753 Hs.303649 
AA836472 Hs.297939 
J03836 Hs.82085 
BE245301 Hs.89414 
AI439011 Hs.86386 
L12711 Hs.8g643 
AA340987 Hs.75693 
NM.005308Hs^11569 
NM_001674Hs.460 
BE247295 Hs.78452 
L42176 Hs.8302 
NM_006732Hs.75678 
NM_005795Hs.1 52175 
M15990 Hs.194148 
BE410405 Hs.76288 
AA296520 Hs.89546 
X12784 Hs.1 19129 
AA307680 Hs.75692 
M31166 Hs.2050 
BE293116 Hs.76392 
AW958272 Hs.347326 
AI752416 Hs.77326 
AF064853 Hs.91299 
NM_004428Hs.1624 
AV650262 Hs.75765 
NM_005381 
AF043045 Hs.81008 
M68874 Hs.211587 
M69043 Hs.81328 
NM.003199Hs^26198 
AI879352 Hs.118625 
W01076 Hs^78573 
M92843 Hs.343586 
BE243845 Hs.75511 
AA236291 Hs.1 83583 
BE550723 Hs.1 53179 
BE392588 Hs.75777 
NM_006823Hs.75209 
X96438 Hs.76095 
BE259035 Hs.1 18400 
BE616287 Hs.178452 
AA301867 Hs.76224 
AI752666 Hs.76669 
NM_004998Hs.82251 
AW161552 Hs.83381 
AI929721 Hs.5120 
AU076887 Hs.28491 
AU077005 Hs.92208 
U48959 Hs.211582 
U51010 

BE243877 Hs.76941 
U53445 Hs.15432 
R97457 Hs.63984 
U59423 Hs.79067 
AU076728 Hs.8867 
AI984144 Hs,66713 
AL1 19566 Hs.6721 
N1VL007019HS.93002 
AA532780 Hs.183684 
AA122237 Hs.81874 
NM_005100Hs.788 
NM.002318HS.83354 
AI767736 Hs.2gD070 
BE409861 Hs.202833 
X07820 Hs.2258 
BE512730 Hs.65114 
A1904736 Hs.76(^ 



Melanoma associated gene 

N-myc downstream regulated 

calmodulin 2 (phosphorytase kinase, delt 

collagen, type VIII, alpha 1 

RNA binding motif, single stranded inter 

heat shock 70kD protein 2 

fibronectin 1 

inhibibrof DNA binding 1, dominant neg 
insulin-like growth factor 2 (somatomedl 
small Inducible cytokine A2 (monocyte ch 
cathepsin B 

serine (or cysteine) proteinase inhibito 

chemoklne (C-X-C motiO. receptor 4 (fus 

myeloid cell leukemia sequence 1 (BGL2-r 

transkelolase (Wernicke-Korsakoff syndro 

prolylcarboxypeptidase (angtotensinase C 

G proteinK»upled receptor kinase 5 

activating transcriptkm factor 3 

solute canier family 20 (phosphate tran 

four and a half LIM domains 2 

FBJ murine osteosarooma viral oncogene h 

calcitonin receptor-ike 

v-yes-1 Yamaguchi sarcoma viral oncogene 

calpain 2, (m/ll) large subunit 

selectin E (endothelial adhesion moleaJl 

collagen, type IV. alpha 1 

asparagine synthetase 

pentaxin>related gene, rapidly Induced b 

aldehyde dehydrogenase 1 family, member 

intercellular adhesion molecule 2 

insulin-like growth factor binding prote 

guanine nucleotide binding protein (G pr 

ephrin-A1 

GR02 oncogene 

nucleolin 

filamln B, beta (actin-binding protein-2 
phospholipase A2, group IVA (cytosolic, 
nuclear factor of kappa ^ht polypeptki 
transcription fiador 4 
hexokinase 1 

CD59 antigen p18-20 (antigen klentffied 
zinc finger protein homokigous to Zfp^B 
connective tissue growth factor 
serine (or cysteine] proteinase inhibito 
fatty acid binding protein 5 (psoriasis- 
transgelin 

protein kinase (cAIVlP^iependent catalyti 
immediate early response 3 
singed (DrosophilaHike (sea urchin fas 
catenin (cadherin-assodated protein), a 
EGF-containing fiburin-iike extracellula 
nicotinamide N-methyltransferase 
myosin IE 

guanine nucleotide binding protein 11 
dynein. cytoplasmic, light polypeptide 
spermidine/spermine Nl-acetyltransferase 
a disintegrin and metalloproteinase doma 
myosin, light polypeptide kinase 
gb:Human nkx}tinamide N-methyltransferas 
ATPase, Na-tflC* transporting, beta 3 poly 
downregulated in ovarian cancer 1 
cadherin 13. H^adherin (heart) 
MAD (mothers against decapentaplegte, Dr 
cysteine-rich. angngenic inducer. 61 
hepatitis delta antigen-interacting prot 
lysosomal 

ubiquitin canier protein E2-C 
eukaiyoSc translation initiation factor 
microsomal glutathnne 8-transferase 2 
A kinase (PRKA) andior protein (gravin) 
lysyl oxidase-Oke 2 
gelsolin (amytokfosis, Finnish type) 
heme oxygenase (decyding) 1 
matrix metaltoproteihase 10 (stromelysin 
keratin 18 

DEADIH <Asp<3lu^a^p/His) box potypep 
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103011 X52541 
103020 X53416 
103029 X54489 
103036 X54925 
5 103056 X57206 
103080 X59798 
103095 X60957 
103138 X65985 
103176 X69111 

10 103195 X70940 
103347 X87838 
103371 X91247 
103432 X97748 
103471 Y0O815 

15 103967 AA303711 
104447 L44538 
104764 AA025351 
104783 AA027050 
104798 AA029462 

20 104865 AA045136 
104877 AA047437 
104894 AA054087 
104952 AA071089 
105113 AA155450 

25 105178 AA187490 
105196 AA195031 
105215 AA205724 
105263 AA227926 
105271 AA227986 

30 105330 AA234743 
105461 AA253216 

105492 AA256210 

105493 AA256268 
105594 AA279397 

35 105727 AA292379 
105732 AA292717 
105767 AA346551 
105882 AA400292 
105936 AA404338 

40 106031 AA412284 
106124 AA423987 
106222 AA428594 
106241 AA430108 
106263 AA431462 

45 106264 AA431470 
106356 AA443756 
106454 AA449479 
106634 AA459916 
106724 AA465226 

50 106793 AA478776 
106799 AA479037 
106842 AA482597 
106868 AA487561 
106890 AA489245 

55 106961 AA504110 
106974 AA520989 
107030 AA599434 
107061 AA608649 
107086 AA609519 

60 107216 D51069 
107385 U97519 
107444 W28391 
107985 AA035638 
108507 AA083514 

65 108695 AA121315 
108931 AA147186 
109001 AA156125 
109195 AA188932 
109390 AA219653 

70 109456 AA232645 
109737 F10078 
110411 H46032 
110660 H82117 
110906 N39584 

75 111018 N54067 
111091 N59658 



AJ243425 H5.328035 earty growth response 1 
X53416 Hs.195464 faamin A, alpha (ac6n-binding piotein- 



AW800726 Hs.789 
M13509 Hs.83169 
Y18024 H5.78877 
AU077231 H5.82932 
NM.005424HS.78824 
X65965 

ALJ021154 Hs.76884 
AA351647 Hs.2642 



GR01 oncogene (melanoma growth stimulati 
matrix metalloproteinase 1 (interstitial 
irK>sitol 1,4A^phosphate 3-Miase B 
cycQn D1 (PRAD1: paiathyroM adenomatos 
tyrosine Icinase with immunoglobulin and 
gb:H.sap!ens SOD-2 gene for manganese su 
inhibitor of DN A binding 3. dominant neg 
eukaryotlc translation elongation factor 
AU077309 Hs.171271 calenin (cadherin-assoctated protein), b 
X91247 Hs.13046 thloredoxin reductase 1 
X97748 gb:H.sapl8ns PTX3 gene promotor region. 

Y0081 5 HsJ5216 protein tyrosine phosphatase, receptor t 
AL120051 Hs.144700 ephrin-BI 
AW204145 Hs.156044 ESTs 
AI039243 Hs.278585 ESTs 

AA533513 Hs.93659 protein disulftde Isomerase related prot 
AW952619 Hs.17235 Homo sapiens done TCCCIA00176 mRNA sequ 
T79340 Hs.22575 B-ceU CLUIymphoma 6, member B (zinc fi 
AI138635 Hs.22968 Homo sapiens done IMAGE:451939. mRNA se 
AF065214 Hs.18858 phospholipase A2. group IVC (cytosolic 
AW076098 Hs.345588 desmoplaldn (DPI, DPII) 



AB037816 Hs.8982 
AA313825 Hs.21941 
W84893 Hs.9305 
AA205759 Hs.10119 
AW388633 Hs.6682 
AA807881 Hs.25329 
AW338625 Hs.22120 
66539071 Hs.69388 



Homo sapiens, done IMAGE:3506202. mRNA, 
AD036 protein 
angiotensin receptor-Ilka 1 
hypothetical protein FU14957 
solute carrier family 7. (cationic an^ 
ESTs 
ESTs 

hypothetical protein FU20505 
AI805717 Hs.289112 CGM3protefri 
AL047586 Hs.10283 RNA binding motif protein 8B 
AB024334 Hs25001 tyrosine 3>monooxygenase/tryptophan 5-mo 
AL135159 Hs.20340 KIAA1 002 protein 
AW504170 Hs.274344 hypothetical protein MGC1 2942 
AW370946 Hs.23457 ESTs 

W46802 Hs.81988 disabled (Drosophila) homolog 2 (mitogen 
A1678765 Hs.21812 ESTs 

Hs.171844 Homo sapiens cDNA: FU22296 fis. done H 



X64116 
H93366 
AA356392 
BE019681 
W21493 
AL046859 
AA186715 



Hs.7567 Homo sapiens cDNA: FU21962 fts, done H 
Hs.21321 Homo sapiens done FLB9213 PR02474 mRNA, 
Hs.6019 Homo sapiens cDIMA: FU21288 fis, done C 
Hs.28329 hypothetical protein FU14005 
Hs.3407 protein kinase (cAMP-dependent catalytl 
Hs.336429 RIKEN cDNA 9130422N19 gene 



NM_014038Hs.5216 HSPC028 protein 



W25491 
N48670 
H94997 
6E313412 
AF124251 
BE185S36 
AA489245 

AW243614 Hs.18083 
AI817130 Hs.9195 
AL1 17424 Hs.25035 
BE147611 Hs.6354 
NfVL012331Hs^6458 



Hs.288909 hypothetical protein FU22471 
Hs.28631 Homo saptens CDNA: FU22141 fis. done H 
Hs.16450 ESTs 

Hs.7961 Homo sapiens done 25012 mRNA sequence 
Hs^054 novel SH2-oontelnlng protein 3 
Hs.301 183 motecute possessing anfcyrin repeats indu 
H5.88500 mitogen-adivated protein kinase 8 inter 

Homo sapiens cDI^ FU10768 fis, done NT 
Homo sapiens cDNA FU13698 fis. done PL 
chloride intracellular channel 4 
stromal cell derived factor receptor 1 
methionine sulfoxide reductese A 
D51069 Hs.211579 melanoma ceQ adheston molecute 
NM_005397Hs.16426 podocalyxin-like 

proliferation-associated 2G4, 38kD 
Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
ESTs 

KIAA1077 protein 

gb:zo38d01.s1 Strategene endothelial eel 
hypothetical protein FU20992 similar to 
solute carrier family 4. sodium bicaibon 
EH-domain conteinlng 3 
ESTs 

ESTs. Moderately similar to A47582 B-cel 
Homo safriens mRNA for KIAA1741 protein. 
ESTs 
ESTs 

mitogen-activated protein kinase Mnase 
hypothetical protein DKFZp434N185 



Hs.343258 
Hs.71968 
Hs.68301 
Hs.70823 

HS72116 
Hs.132904 



W28391 
T40064 
A1554545 
AB029000 
AA147186 
AI056548 
AF047033 
AW007485 Hs.87125 
AW956580 Hs.42699 
AA055415 Hs.13233 
AW001579 Hs.9645 
AA782114 Hs.28043 
AA035211 Hs.17404 
AI287912 Hs.3628 
AA300087 Hs.33032 
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111356 
111378 
111741 
111769 
112318 
112951 
113057 
113195 
113490 
113542 
113803 
113847 
113910 
113947 
114047 
115061 
115819 
115870 
115964 
116228 
116264 
116314 
116589 
117023 
117112 
117156 
117176 
117280 
119559 
119866 
120655 
121314 
121335 
121822 
121835 
122331 
122577 
123160 
123486 
124059 
124339 
124358 
124364 
124726 
124763 
125167 
125304 
125307 
125329 
107985 
125598 
125609 
116024 
418000 
126399 
127435 
127566 
127619 
434190 
128453 
128495 
128515 
128580 
128623 
128642 
128669 
128903 
128914 
129087 
129188 
129226 
129265 
129345 
129468 
129488 
101838 



N90933 

Ng3764 

R26124 

R27957 

R55470 

T16550 

T26674 

T57112 

T88700 

T90527 

W42789 

W60002 

W78175 

W84768 

W94427 

AA253217 

AA426573 

AA432374 

AA446622 

AA478771 

AA482594 

AA490588 

D59570 

H88157 

H94648 

H97538 

H98670 

N22107 

W38ig7 

W80814 

AA287347 

M402799 

AA404418 

AA425107 

AA425435 

AA442872 

AA452860 

AA488687 

AA599674 

F13673 

H99093 

N22495 

N23031 • 

R15740 

R39610 

W45560 

Z39833 

Z40583 

AA825437 

R66613 

R66613 

AA868063 

AA128075 

AA1 28075 

AA128075 

N66570 

AI051390 

AA627122 

AA627122 

X02761 

AF010193 

AA149044 

U82108 

D78676 

L35240 

AA598737 

R69417 

AA232837 

N72695 

M30257 

M96d43 

X68277 

AA292440 

J03040 

AA228107 

AA449788 



BE301871 Hs,4867 
AW160993 Hs.326292 
AB020653 Hs.24024 
AW629414 Hs.24230 
AW083384 Hs.11067 
AA307634 Hs.6650 
AW194301 Hs.339283 
H83265 Hs.8881 
BE178110 Hs.173374 
H43374 Hs.7890 
AW880709 Hs.283683 
NM_005032Hs.4114 
AA1 13262 Hs.17901 
W84768 

AL035858 Hs.3807 
AI751438 Hs.41271 
AA486620 Hs.41135 
NM-005985HS.48029 
AA987568 Hs.74313 
AI767947 Hs.50841 
D51174 Hs.272239 
AI799104 Hs.178705 
AI557212 Hs.17132 
AW070211 Hs.102415 
AW969999 Hs.293658 
W73853 

H45100 Hs.49753 
M18217 Hs.172129 
W38197 

AA496205 Hs.193700 
AA305599 Hs.238205 
W07343 HS.18253B 
AA404418 
AI743860 

AB033030 Hs.300670 
AL133437 Hs.110771 
AA829725 Hs.334437 
AA488687 Hs.284235 
BE019072 Hs.334802 
BE387335 Hs.283713 
H99093 Hs.343411 
AW070211 Hs.102415 
AF265555 Hs.250646 
NM_003854Hs.104576 
BE4 10405 Hs76288 
AL137540 Hs.102541 
AL359573 Hs.124940 
AW580945 Hs330466 
AA825437 Hs^875 
T40064 HS71968 
T40064 HS71968 
AA868063 H5.104576 
AA088767 Hs.83883 
AA932794 Hs.83147 
AA088767 Hs.83883 
X69086 Hs^86161 
AI051390 Hs.116731 
AA627122 Hs.163787 
AA627122 Hs.163787 
X02761 Hs.287820 
NM_005904Hs.100602 
BE395085 Hs.10086 
U82108 Hs.101813 
BE076608 Hs.105509 
Z28913 Hs.102948 
W28493 Hs.180414 
AW150717 Hs.345728 
AW867491 Hs.107125 
A1348027 Hs.108557 
NM.001078HS.109225 
BE222494 H5.180919 
AA530892 Hs.171695 
R22497 Hs.110571 
AW410S36 Hs.111779 
AW96672B Hs^642 
BE24384S Hs7SS11 



mannosyl (a!pha-1,3-)-gIycoprotein beta- 
hypothetical gene DKFZp434A1114 
KiAA0846 protein 
ESTs 

ESTs, Highly similar to T46395 hypotheti 
vacuolar protein sorting 45B (yeast homo 
Human DNA sequence from done RP1-187J11 
ESTs, Weakly similar to S41044 chromosom 
Homo sapiens cDNA FU10500 fis, clone NT 
Homo sapiens mRNA forfaAA1671 protein, 
chromosome 8 open reading frame 4 
plastin 3 (T isofonn) 

Homo sapiens, done IMAGE:3937015. mRNA. 
gb:2h53d03.s1 SoaresJetaUiver„spleen_ 
FXYD domain-containing ion transport leg 
Homo sapiens mRNA full length insert cDN 
endomudn-2 

snail 1 (drosophifa homofbg). zincfinge 

KIAA1265 protein 

ESTs 

lysosomal 

Homo sapiens cDNA FLJ1 1333 fis, done PL 
ESTs, Moderately similar to 154374 gene 
Homo sapiens mRNA; cDNA DKFZ^6N0121 (f 
ESTs 
ESTs 

uveal autoantigen with coiled coil domai 
Homo sapiens cDNA: FLJ21409 fis. clone C 
Empirically seleded from AFFX single pr 
Homo sapiens mRNA; cDNA DKF^86I0324 (f 
hypothetical protein PRQ2013 
phospholipid scramblase 4 
gb:zw37e02.s1 Soares.totaLfetus.Nb2HF8_ 
metaliothionein IE (functional) 
KIAA1204 protein 

Homo sapiens cONA: FU21904 fis. clone H 
hypothetical protein MGC4248 
ESTs, Weakly similar to 138022 hypotheti 
Homo sapiens cDNA FLJ14680 fis, done NT 
ESTs. Weaicly simBar to S64054 hypotheti 
DEAD/H (Asp-GIu-Ala-Asp/His) box poiypep 
Homo sapiens mRNA; cDNA DKF2f^86N0121 (f 
baculovlrai lAP repeakXDntaining 6 
cart)ohydratB (keratan sulfate Gal-6) sul 
calpain 2, (nrtAl) la^e subunK 
netrin4 

GTP-binding protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
cartx)hydrate (keratan sulfate 6al-6) sul 
transmembrane, prostate androgen Induced 
guanine nucleotide binding protein-tike 
transmembrane, prostate androgen induced 
Homo sapiens cDNA FU13613 fte, done PL 
ESTs 
ESTs 
ESTs 

fibroneclin 1 

MPJD (mothers against decapentaplegic. Dr 
type I transmembrane protein Fn14 
solute carrier family 9 (sodium/hydrogen 
CTL2 gene 

enigma (UM domain protein) 
heat shock 70kD protein 8 
STAT induced STAT inhibitor 3 
plasmalemma veside associated protein 
hypothetical proteh PP1057 
vascular cell adhesion molecule 1 
inhibitor of DNA binding 2, dominant neg 
dual spedfk% phosphatase 1 
growth anest and ONA-damageHnducble, 
secreted protein, addic. cysteine-rich 
meflikmlne adenosyltransfeiase II, beta 
connective tissue growth factor 



155 



NSDCXJID: <WO 02079492A2_L> 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



413731 AA449789 
129557 W01367 
129619 AA610116 
129627 AA258308 
129762 AA460273 
129884 AA286710 
130018 T68873 
130147 D63476 
130178 M62403 
130282 X55740 
130431 L10284 
130495 AA243276 
130553 AA430032 

130638 H16402 

130639 D59711 
130657 T94452 
130688 AA431571 
130776 R79356 
130818 AA280375 
130840 Z49269 
130899 Z41740 
131002 AA121543 
131080 J05008 
131084 AA101878 
131091 T35341 
131107 N87590 
131182 AA256153 
131207 W74533 
131319 U25997 
131328 V01512 
131509 X56681 
131555 AA161292 
131564 AA491465 
131573 AA046593 
131692 D50914 
131756 D4S304 
131859 Mg0657 
131909 W69127 
131915 AA316186 
132046 AA384503 
132050 AA136353 
132151 M044755 
132164 U84573 
132187 M058911 
132303 AA620962 
132314 AA285290 
132358 X60486 
132398 R31641 
132421 AA489190 
132490 F13782 
132520 AA257993 
132546 M24283 
132610 AA443114 
132716 T35289 
132640 N23817 
132883 AA047151 
132968 N77151 
132989 AA480074 
132999 Y00787 
133071 T99789 
133076 W84341 
133099 L09209 
133147 D12763 
133149 T16484 
133161 AA253193 
133200 AA432248 
133220 X82200 
133260 AA083572 
133295 L00352 
133349 N75791 
133391 X57579 
133398 X02612 
133436 H44631 
133454 AA090257 
133478 X83703 
133491 L40395 



BE620886 Hs.75354 
AI824144 Hs.23912 
AF104266 Hs.24212 
NM.003155HS.25590 
AW939251 Hs.25647 



Hs.2780 



Hs.28959 
Hs.30736 
Hs.31595 



6E243845 Hs.75511 connective tissue growth foctor 

AL045404 Hs.46366 KIAA0948 protein 

AA209534 Hs.284243 letraspan NET-6 protein 

T40064 Hs.71968 Homo sapiens mRNA; cDNA DKF2p564F053 (fr 

AA453694 Hs.12372 tripartite motif protein TRI1M2 

AF0555B1 Hs.13131 lysosomal 

AA353093 metaHothbnein 1L 

D63476 Hs.172813 PAK-interacting exchange factor beta 

U20982 Hs.1516 insuUn^ke growth factor-binding prote 

BE245360 Hs.153952 S" nudeofidase (C073) 

AW505214 HS.155S80 catnexin 

AW250380 Hs.109059 mitochondriaf ribosomal protein L12 

AF062649 Hs^52587 pituitary tumor-transfom^ 1 

AW021276 Hs.17121 ESTs 

AI557212 Hs.17132 ESTs, Moderately simnar to 154374 gene 
AW337575 Hs.201591 ESTs 

BE548267 Hs.337986 Homo sapiens cDNA FLJ1 0934 fis. done OV 
AF167706 Hs.19280 cysteine-rich motor neuron 1 
AW190920 Hs.19928 h/pothetica) protein SP329 
BE048821 Hs.20144 small inducible cytokine subfamily A (Cy 
AI077288 Hs.296323 serum/glucocorticoid regulated kinase 
AL050295 Hs.22039 KIAA0758 protein 
NI^001955Hs.2271 endolhelin 1 
NNL017413HS.303084 apelln; peptide ligand for APJ receptor 
AJ271216 Hs.22880 dipeptidylpeptidase III 

6CN1 (general control of amino-add synt 
ESTs 
latrophilin 
stanniocaldn 1 

v-fos FBJ murine osteosarooma viral onco 
jun D proto^ncogene 
Hs.278613 interferon, alpha-indudble protein 27 
Hs.28792 Homo sapiens cONA FU1 1041 fis. done PL 
ESTs 

KIAA0124 protein 
ESTs 

transmembrane 4 supeifamOy member 1 
NIC016558HS.274411 SCAN domabH»ntalning 1 
AI161383 Hs.34549 ESTs. Highly slmflar to S945411 done 4 
Hs.179260 chromosome 14 open reading frame 4 
Hs.38022 ESTs 

Hs.173705 Homo sapiens cDNA: FU22050 fis. done H 
Hs.41270 procollagen-lysine. 2-oxogIutarate5-dk> 
Hs.4193 DKFZP58601624 protein 
BE177330 Hs,325093 Homo sapiens cDNA:FU21210 fis, done C 
AF1 12222 Hs.323806 pinin. desmosome associated protein 
NtyL003542Hs.46423 H4 histone family, member G 

ESTs. Weakly similar to A43932 mucin 2 p 
double ring-finger protein, Dorfin 
Lll^ domain binding 2 
Janus kinase 1 (a protein tyrosine kinas 
Hs.168383 intercellular adhesion molecule 1 (CD54) 
Hs.5326 amino add system N transporter 2: porou 
Hs.283738 casein kinase 1. alpha 1 
Hs.5807 GTPaseRab14 

Hs.5897 Homo sapiens mRNA; cDNA DKFZp586P1622 (f 
Hs.61638 myosin X 
AA460a74 Hs.331328 hypothetical protein FU13213 
Y00787 Hs.624 interteukinB 

BE384932 Hs.64313 ESTs. Weakly similar to AF2571821 G-pro 
AW946276 Hs.6441 Homo sapiens mRNA; cDNA DKFZp586J021 (fr 
W16518 Hs^9518 amylokl beta (A4) precursor-like protein 
AA026533 Hs.66 interteukin 1 receptor-like 1 
AA370045 Hs.6607 AXIN1 up-regulat^ 
AW021 103 Hs.6631 hypothetical protein FU20373 
AB037715 Hs.183639 hypothetical protein FU10210 
NIVL006074HS.318501 Homo sapiens mRNA fuU length Insert cDN 
AA403045 Hs.6906 Homo sapiens cDNA: FU23197 fs. done R 
AI147861 Hs^13289 low density lipoprotein receptor (famifi 
AW631255 Hs.8110 L-S-hydroxyacyl-Coenzyme A dehydrogenase 
inhibtn. beta A (adivin A. activin AB a 
cytochrome P450, subfamily I (aromatic c 
immediate earty protein 
hypothetk:al proteb I^GC5618 
cardiac ankyrin repeat protein 



X56681 

T47364 

T93500 

AA040311 

BE559681 

AA443966 

AW960564 



AI359214 
AI267615 
BE379499 
AI752235 
AA235709 



AA876616 Hs.16979 
AW163483 Hs.48320 
NIVL001290HS.4980 
AA257992 Hs.50651 
M24283 
AA160511 
BE379595 
BE218319 
AA373314 
AF234532 



AW103364 HsJ27 
NM_000499Hs.72912 
BE294068 Hs.737 
BE547647 Hs.177781 
X83703 Hs.31432 



BE619053 Hs.170001 eukaryotic translafon initiatton factor 
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133510 

133517 

133526 

133538 

133562 

133584 

133590 

133617 

133651 

133671 

133678 

133681 

133722 

133730 

133760 

133802 

133825 

133838 

133859 

133889 

133960 

133975 

133977 

134039 

134075 

134081 

134164 

134203 

134238 

134299 

134332 

134339 

134343 

134381 

134403 

134416 

134493 

134558 

134817 

134983 

134989 

135052 

135062 

135069 

135071 

135073 

135170 

135196 

135348 



AA227913 

X52947 

Ml 1313 

L14837 

M60721 

D90209 

T67986 

AA148316 

U97105 

T25747 

K02574 

D78577 

X53331 

S73591 

X95735 

L16862 

U44975 

M97796 

U86782 

AA099391 

M19267 

D29992 

L19314 

S78569 

U28811 

L77886 

C14407 

M60278 

R81509 

AA487558 

D86962 

AA478971 

D50683 

U56637 

M61199 

M28882 

X15183 

S53911 

U20734 

D28235 

AA236324 

AA148923 

AA174183 

AA456311 

L08069 

AA452000 

AA282140 

J02854 

AA442054 



AW880841 Hs.96908 
NM^000165Hs.74471 
AU077051 HS74561 
NM_003257Hs.74614 
M60721 Hs.74870 
D90209 HS.1&1243 
T70956 Hs.75106 
BE244334 Hs.75249 
AI301740 Ks.173381 
AW503116 Hs.301819 
AW247252 
Ai352558 

AW969976 Hs.279009 
BE242779 Hs.179526 
BE410769 HsJ5873 
AW239400 HS76297 
BE616902 Hs.285313 
BE222494 Hs. 180919 
U86782 Hs.178761 
U48959 Hs211582 
M19267 Hs.77899 
C18356 Hs.295944 
AI125639 Hs.250666 
NM.002290HS78672 
NM_012201Hs78979 
AL034349 Hs79005 
AW245540 Hs.79516 
AA161219 Hs.799 
AA102179 Hs.160726 
AW580939 Hs.97199 
D86962 H5.81875 
R70429 Hs.61988 
D50683 Hs.82028 
AI557280 Hs.184270 
AA334551 

X68264 Hs^11579 
M30627 Hs.289088 
NM^001773Hs.85289 
AU076592 Hs.198951 
D28235 Hs.196384 
AW968058 Hs.92381 
AL136653 Hs,93675 
AK000967 Hs.93872 
AA876372 Hs.93961 
W27190 Hs.94 
W55956 Hs.94030 
T53169 Hs.9587 
C03577 Hs.9615 
U80983 Hs^68177 



p53-induced protein 

gap junction protein, alpha 1, 43kD (con 

a!ptia-2-macroglobulin 

tight junction protein 1 (zona ocdudens 

H2.0 (OrosophiiaHike homeo box 1 

activating transcription factor 4 (tax-r 

dusterin (complement lysis inhibitor. S 

ADP-ribosylaiion ^Mike 6 interact! 

dih/dropynmtdrnase4il(e 2 

zinc finger protein 146 

nucleoside phosphorylase 

tyrosine 3-monooxygenase/tiyptophan 5-mo 

matrix Gta protein 

upregulated by 1,25-dihydroxyvttamin D-3 
zyxin 

G protein-coupled receptor kinase 6 

core promoter element binding protein 

inhibitor of DNA bindmg 2, dominant neg 

26S proteasome-assodated padi homolog 

myosin, light polypeptide kinase 

tropomyosin 1 (alpha) 

tissue factor pathway inhibitor 2 

hairy (DFosophlla)-homotog 

laminin, alpha 4 

Golgi apparatus pmtein 1 

protein tyrosine phosphatase, receptor t 

brain abundant, membrane attached signal 

diphtheria toxin receptor (heparin-bindi 

Homo sapiens cDNA FU1 1680 fis. done HE 

complement component Clq receptor 

growth factor receptor-bound protein 10 

dteai}Ied (Drosophaa) homdog 2 (mitogen 

tiansfbnning growth fector, beta reoepto 

capping protein (ac&n filament) musde 

spemn specific antigen 2 

melanoma cell adhesion molecule 

heat shock 90kD protein 1, alpha 

CD34 antigen 

jun B proto-oncogene 

prostaglandln-endoperoxide synthase 2 (p 

nudix (nudeoside diphosphate linked moi 

dedduat protein induced by progesterone 

KIAA1682 protein 

Homo sapiens mRNA; cDNA DKFZp667D095 (fr 
DnaJ (Hsp40) homolog. subfamily A, membe 
Homo sapiens mRNA; cDNA DKF2p586E1624 (f 
Homo sapiens cDNA: FU22290 fis, done H 
myosin regulatory light chain 2, smooth 
phosphoiipase C, ganuna 1 (formerly subty 
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TABLE 4A 

Table 4A shows the accession numbeis for those pkeys lacking unigenelD's for Table 4. The pkeys in Table 7 lacking unigenetD*s are represented wHhin 
Tables 1-6A. For each probeset we have listed the gene duster number from whteh the ofigo^ Gene dusters were compiled 

5 using sequences derived from Genbank ESTs and iriRNAs. These sequences were clustered based on sequence simQaiity using Clustering and 

Alignment Tools (DoubleTwist. Oakland California). The Genbank accession numbers for sequences oompiising each duster are listed in Una 'Aocesston' 
column. 

1 0 Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accesston: Genbank accession numbers 



IS Pkey 



CAT Number Accession 



100752 33207.21 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



117156 145392J 



131859 3672.1 



125565 1704098J 
133607 1227.6 



T81309 BE019033 R941B1 BE019198 NM_G00612 J03242 AW411299 BE300064 BE297544 R94182 AW630108 T53723 
D58853 H78073 H80594 BE299560 T48899 H70196 M17426 t(77Q77 S77035 H58384 H61664 H78540 T84527 017196 
H60255 H71980 R92644 W79050 X00910 M29645 R91055 M17863 M17862 T71815 BE299561 BE464561 X06260 • 
R94741 T54216 C18594 BE262015 X06161 AW409889 AA378400 BE263228 BE313278 R88116 BE313457 H43500 
T48617 BE313761 H77309 AI207601 X06159 H40413 X03425T87663 R10627 X03562 M14118 W03982 R97520 H81229 
T83157 H83168 H48762 AA669898 BE263054 H47289 AA022807 R11555 H74260 R76968 R28338 H72534 H72464 
H62031 N72478 N45355 AW411300 R89113 R69135 H58454 T83281 R93476 H69645 H68015 T82229 H71089 T85121 
H59939 W65299 IM78176 H53909 N72373 R21788 H04660 H59639 H61874 BE262219 T53614 N73335 N50464 W00943 
N77189 R89257 AA570502 R89432 R06366 AA553480 AA776271 AA551359 AA551050 H51670 AA601052 BE299081 
H68198 H52276 BE207632 N91 192 H70332 X07868 X07868 H69464 H53782 H73710 R80435 AA553384 AW864176 
N53475 T71662 AW954036 AW954033 AA552931 H93206 AA430216 AA553476 AI916470 T54124 BE207982 BE300177 
N73994 AW882625 N39549 N53836 AA722389 H71878 H58909 H37849 H78435 T47933 R77174 R83814 AA411690 
H94199 AA663208 6E205778 AA490137 H70492 R98232 H37800 AA679294 H40341 H74238 H47290 H73231 T48618 
AA025428Ai039521 H92969 N59389HB0538 H72933 T90630AA411891 N55000 H74225 AA340290 AW957061 T54316 
AA340437 H57125 H58908 H79027 H63450 N74623 R93425 H66714 H68758 N68396 H48763 NS9256 H57320 H53831 
H53589 N68833 N52453 H56048 H69870 H78074 R69253 R83375 T53615 H94330 H58455 H90864 T47934 H74261 
R8g258 R97997 R91056 R28339 R86760 H78235 R97521 H67692 H40358 AA022688 H52513 H59601 T88690 H65256 
H63397 W65397 AA553588 R19280 N52645 W73930 R06367 R21743 H72372 N73921 AW883539 AW882639 T40616 
H47084 R95723 AA634316 AA862781 H77310 R91389 H931 1 1 R92767 T54512 R89341 H70333 H57817 H82941 H62032 
N52638 H58385 T91796 H51086 AA340292 T49918 H81230 R36121 N5041 1 T87664 N62436 N39340 AA665637 
AA340446 H93377 H92973 BE296290 BE269788 H61665 AA340444 N54605 AA454101 R10628 R94200 AI200549 
AA342640 BE298855 BE250229 T49916 H82008 N28278 AW880662 H71268 N76791 H47685 H65255 W05198 
AW889144 N76677 H71702 H68036 H71915 R91612 R87807 H68059A1 133328 AI247866AA621 443 AW881 050 
AA700847 AA340413 AW878608 AW881 181 AW878249 H71916 N54596 BE161581 AW878082 W04212 AW881040 
AW885492 AW880519 AA334887 AW878715 W06882 AW630222 AW885381 H70869 AW381778 H47601 AW889982 
H63868 AWB84986 AW878713 AW878685 R36391 AW878694 AA368070 C03393 AW878695 AW878705 AW878665 
AW878742 AW878620 AW878823 AW878688 R29048 AW878690 AW878686 AW878810 AW878827 AW878733 
AW878859 AW878749 AW878681 AW883353 AW883277 AW883300 AW883565 AW883298 AW883143 AW883045 
AW883482 AW883352 AW883417 AW883357 AW883231 AW883474 AW883355 AW882620 AW882533 AW883754 
AW883139 AW882827 AW883641 AW883567 AW883481 AW882983 AW882982 AW882465 AW883419 AW882466 
AW883639 AW883230 AW882981 AW882534 AW882874 AW882619 AW883480 AW882826 AW882831 AW882835 
AW882830 AW883563 AW882456 AW627642 

W73853 AA9281 12 W77887 AW889237 AA1 48524 AI749182 AI754442 AI338392 AI253102 A1079403 AI370541 AI697341 
H97538 AW188021 AI927669 W72716 AI051402 A1188071 AI335900 N21488 AW770478 W92522 AI691028 Ai913512 
AI144448 W73819 AA604358 N28900 W95221 AI868132 H9d465 AA148793 

AWg60564 AA092457 T55890 D56120 T92525 AJ815987 BE182608 BE182595 AW080238 M90657 AA347236 AW961686 
AW176446 AA304671 AW583735 T61714 AA316968 AI446615 AA343532 AA083489 AA488005 W52095 W39480 N57402 
D82638 W25540 W52847 D82729 D58990 BE619182 AA315188 AA308636 AA1 12474 W76162 AA088544 H52265 
AA301631 H80982 AA113786 BE620997 AW6516gi AA343799 BE613669 BE547160 BE546656 F1 1933 AA376800 
AW239185 AA376086 BE544387 BEei9041 AA452515 AA001806 AA190873 AA180483 AA159546 F00242 AI940609 
AI940802 AI189753 T97663 T66110 AW062896 AW062910 AW062902 AI051622 AI828930 AA102452 AI685095 AI819390 
AA557597 AA383220 AI804422 AI633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI250777 AA083510 
AW629109 AW5132D0 AA921353 AI677934 AI148698 AI955858 AA173B25 AA453027 AI027865 AW375542 AA454099 
AA733014 AI591384 R79300 R80023 AA843108 AA626058 AA844898 AW375550 AA889018 A1474275 AW205937 
AI052270 AW3881 17 AVV3881 1 1 AA699452 AI242230 N47476 H38178 AA366621 AA1 13196 AA130023 H39740 T61629 
A1885973 AW083671 AA179730 AA305757 AI285455 N83956 AA216013 AA336155 AW999959 T97525 AA345349 
T91762 AA771981 AI285092 A1591386 6E392486 BE38S852 AA682801 At682884 AA345840 T85477 AA292949 
AA932079 AA098791 D82607 T48574 AW752038 006300 
R20840 R20839 

BE273749 BE397561 BE387189 AL037858 AL037878 AI963094 BE259216 AA011363 AL036189 BE562325 AA251169 
BE617431 N98537 AA158093 AL047800 M34539 NWL000801 AA312140 D16971 AA158904 AA307114 AA312803 
T09203 AW629686 AL048504 BE388578 AA220957 AA158364 BE267385 AA294971 018055 BE241757 AA1 15056 
AI936769 BE378435 BE206971 AW674924 BE622060 AA604674 AA1 15273 AW4021 59 AA338608 BE568819 M80199 
X55741 AA3751 1 1 AA376016 BE612671 AA805742 AW405588 N25850 N44580 H06031 AW403549 BE536552 AA056726 
6E543239 AA082517 AI201645 Ai201G42 AI192622 N40104 AA370921 6E547569 A1969602 AA302038 Atig7890 
AW268354 AI014938 W45448 AI541395 AA037272 BE538826 AL039613 BE536130 AA299355 AW805147 AW974624 
H53220 AI471471 AA399303 AA007386 W35106 BE613277 R12739 R12738 AA304342 AA687802 BE409581 AI498844 
AV662092 AW904105 AA011375 BE315214 H99302 6E537893 N32299 AW855829 AI291320 BE078322 AI301395 
AA303362 N32719 AA3S8328 AA357877 A1952540 H56279 H027S8 K02048 AW805233 R82224 AA410772 AA2913S2 
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133681 13893_1 



134403 17037J 



126872 142696.1 

121335 279548J 

130018 18986.1 

121822 244391.1 



123523 



BE171 109 N69935 BE169248 AA361 173 H44978 BE617887 D52560 AA084043 W03595 R67219 N36477 N42924 R67104 
H44901 H79695 W21105 AA393988 W30899 AA31 6096 BE622896 W46872 AA442678 BE544893 BE5401 12 BE621873 
AA338067 N55052 BE398154 BE621210 AA740760 C03739 C03206 BE396692 AA482370 AA031614 AA301576 
AA304710 AA132153 AA029796 AA994960 H19567 AA442969 H49781 H46871 AA035395 AA056185 AA149378 
M643080 AL135479 AA292329 AA654337 AA041228 AA454888 AA025039 W58331 AA625981 T94941 AA302448 
H19900 AA218956 AA513790 AA563962 AA398076 W44441 AA293276 W47373 AA625879 W30688 AA043029 T64284 
R79151 AA304340 AA485186 AA604939 R82470 AA421425 AW771456 A1339329 AA304424 AA605236 AA936934 
AA587673 AI209162 A1697301 A1479995 AI679814 AI361950 AW189125 AI955888 AI986019 BE301019 At084792 
AI310211 AW189307 AI022070 AW977204 AI146825 AW190163 AW303281 AI828345 BE046043 AW029257 AA482268 
AI246507 A1420729 AW084932 AW439514 AI890487 AW439892 AI523896 AI186612 AI659953 Ai889773 AA687527 
AW072694 AW262153 AW467371 A1613269 AI67g238 D54404 AA158103 AW105527 AW14g739 AW1S0361 AW268387 
AW117708 A1951682 AI687440 AW674285 AA678365 AI587082 AA732095 AA019899 W45661 AA627300 BE613304 
AA765891 AA612g35 AI814658 AW316916 R66594 AA514640 AA025040 AA031472 AW732076 AAQ29797 AI244560 
At128734 AW381720 AI092360 AI263263 AW613175 At890675 AI720156 AW631348 AI635106 AI278045 AA303979 
AA703505 W45449 AW078661 AI292052 AW381707 AI147854 AW381743 AA158905 AA303258 AA888144 AW19S967 
AA428705 AA989559 AA617731 H19882 BE543418 AA830386 AA421302 W58652 794995 AI869743 AI679145 
AW085971 N98425 AA765136 AI347027 A1356955 AA928038 AI679717 AA458459 AA679281 AI367973 AI270041 
AA765135 AA732793 AI798447 AA668646 AA251008 AI984538 AI401737 AA056186 BE043308 AWe62375 AI302110 
N50724 W96332 BE537047 N26983 AI567172 AA765296 AW673237 N29784 AA534275 AA084044 AW087973 
AW300766 T63398 W46823 R39790 AI364185 AW298582 AA454814 AW089878 N67751 H05982 N23140 A1362647 
AI302086 AI767772 N25755 H531 14 AA706133 T9351 1 AA429291 AA935294 AA987647 W02803 R66595 AI680795 
W23673 AW440794 AA722872 H49538 AW131042 AA531603 AA908665 AA040791 AA235312 W52205 N93444 R82180 
H02759 H79696AW088894 H56079 AA961143 AW067776AW973745 AA016311 AW071227 AA017511 AI753994 
W47374 T64155 AA296092 A1698626 AA558158 AA296088 AW794259 H01963 AA149267 AA485076 AA975856 H44938 
AA035396 AI955555 H46289 AA486161 AI631222 AA359047 AW794253 AI806962 AW243930 AA526145 AW878734 
AA018464 AA132031 R67220 R79152 AA296093 H54300 AI005160 BE242548 AW992803 AW878644 AW878666 T27742 
R82471 AW517604 AW472738 AI282904 R39791 AA486098 AW467891 AW960520 AA551736 AA056821 AW945197 
R66373 AA554236 BE242202 AI904376 AI832590 H19484 R00890 AI627677 AA302287 AI869451 AI734855 AI708073 
AI832902 AA585184 AW204299 AA055565 D12417 D11975 T63543 AW664099 R54423 BE612712 T96340 T63985 
AA598917 T40735 T64053 AA149284 AW272548 AA363445 AA042893 AW300697 BE261973 T53501 T53500 AW878729 
AW878657 AW794391 AA069193 R01553 H44875 AA385406 AA533968 M93060 AL135600 W96331 AA017651 
AA018849 AA017692 H85337 BE278690 AA731598 AA018512 AI076813 AI022644 R02585 X52220 AW296894 AA8^1 
AI699321 AI393e01 AW592611 AI146747 AA608921 AA158365 AW590007 AA354519 D20081 R02704 AW798339 
M92422 AA094g03 AA007676 

AI352558 Z82248 X78138 NM.003405 AU077248 AA223125 S80794 D78577 A1124697 AW4a3970 BE614089 BE296713 
BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA192S40 H67636 AA321827 
AW950283 AA084159 BE538808 AW401377 AA256774 C03366 W46595 W47608 AA30S009 H69431 H69456 AL120082 
H11706 AA303717 AA361357 H22042 H78020 AW999584 AA134368 AA322911 AA322961 H60g80 N85248 N31547 
H79624 T1 1718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA608688 H82692 BE155392 AW383636 BE155394 AA487004 AW383504 AI342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25364 AA367935 AI559406 AA033522 AA374252 
AW835019 A1922133 AI697089 N99662 AW189078 AI199076 AW151598 W59944 AA662875 W94022 AA299055 
A103900B A1B29449 AA583503 AI635674 AW131665 AI473820 AW273118 AW900930 AA908944 AI688035 AW170272 
AI082545 AW468176 A1608761 AI082748 AI91 1682 A1248943 AI831016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905196 AI191245 A1470204 AI188296 AI421367 AI125315 AI087141 AA629032 AA74D589 AI554181 
AA150830 AI248541 AI077943 AA775958 AA864930 AI261476 AI123121 AI310394 AA862331 AA872478 BE537084 
A!205606 AA720684 AI872093 AW150042 AL120538 AA219627 AA988608 C21397 AI359337 H25337 AI089749 
AA605146 AI359620 AA150478 AI359738 AW383642 AW995424 AI766457 R56892 AI089839 W61343 N69107 W46459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA126874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 L08439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520 AW028427 AA171496 AI469689 AW664539 A181 1 102 AI81 1116 BE464590 BE350791 H78021 T15405 H21979 
AA219489 H13301 AA505883 AI864305 A1423963 AW084401 F04963 R69858 H67097 AI917740 AI655561 H69864 
AA033631 AW383484 AI886261 H25293 AA513281 AW271187 H1 1617 N79982 AI174338 AI904207 A;904208 BE614558 
W94127 W65436 AI272249 AA700018 A1579932 AI085941 AW152629 

AA334551 BE008229 AA307537 AW961156 AW995894 AW995826 NM_006751 M61 199 AA045603 AL036372 AV645606 
AI688095 AW35ig01 AA101337 AA101345 N73342 BE018030 BE569044 AW841975 AA373388 BE0g0412 H95440 
N53845 R67867 AA093441 AA363427 H93708 AWQ23134 AW994986 AW994989 K090429 R23614 A1567932 H03726 
H01101 H01867AA548743 AI671806 AW872949AWB72941 AA742447 Ali99788 AA045604 AI637465 AI741796 
AW242217 AW131463 AI765302 AI683923 AA889762 AI804889 AI986437 C06049 BE5Q2340 Af695851 Ai491970 
AA496804 AA281008 AA665699 AI473814 BE301445 AA707837 AA551925 AI017348 AI208185 AA775203 AA1S6296 
AA557463 H95441 AA768547 AW769358 AA991197 AA181954 AI091389 AI147289AW771837AI638582 AA844411 
A1374750 T29320 AW951272 AW085923 H02834 AA843259 AA814696 AW183290 AA158453 N68125 N69039 AA100423 
AA101346 AI918720 H01102 R67868 H01868 N66438 R46580 AI858433 AA599560 AA187577 AA157481 AA361520 
AL047827 AA15B452 R21688 AW964874 AA325161 R40871 AW752395 AW375924 R13355 AA281174 AA428908 
AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE16B94S AA809054 AW238038 BE011212 BE011359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 
AA404418AI217248 

AA353093 AW957317 AW872498 AI560785 AI2891 10 AW135512 X97261 T68873 

AI743860 rW9543 AW027759 BE349467 Ai656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI611424 
AL079362 Ai969290 AI928016 BE394912 BE504220 BE467505 AI61161 1 AI611407 AI611452 W56437 AI284566 
AI583349 AW183058 A1308085 AI074952 AA437315 AA628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AI432414 A1984994 AI539335 AA401550 AA358757 A1609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 At580502 AI832944 AI038530 AA425107 AI014986 AI148349 AW237721 AW779756 
AW137877 AI125293 AA400404 R28554 
genbanK.AA608588 AA608588 
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123533 genbanK.AA608751 AA608751 
125091 genbanluT91518 T91518 
123964 genbanleC13981 C13961 
102491 entfez^USIOlO U51010 
S 118475 genbanK.N66845 N66845 
118581 genbanleN68905 N68905 
113947 genbanK.W84768W84768 
101447 entre?.M21305 M21305 

101667 13349 1 NM_005381 1^^0858 AW373732 AW373724 AW373689 AW373629 AW373609 AW373776 AA187806 AW38G946 

1 0 AW374207 T05235 AA216203 AW385556 AA308940 AA308526 AA315461 AL036757 AW373711 AW403124 AW40364O 

AW377084 T27360 H62638 F08957 AW377051 AA554779 AA378568 AA0g6007 AW352407 AW302637 F07929 H17433 
AW382712 H0S665 F07292 N39875 AA089729 K62556 N42842 R12952 AW373735 AW364155 AA056183 W39185 
AW382708 N32488 AF1 14096 AW375993 AI133569 W52561 AA603040 AA133710 AI928796 AW176370 AA827519 
AW338437 AA521 142 T29341 AIB00461 AW317002 AA703914 AA860830 AI859203 AI445772 AA714334 AI817066 

1 5 AI832027 AW510442 AI635802 AW088306 AW068672 AW408555 AW467542 AA552657 AA152367 W32081 AA582124 

AA074040 AA931657 A1051154 AW410203 A1921644 H17434 AI832330 AW404836 AI925038 AA088423 AA954166 
AA580453 AW021292 AI267215 AW0800B2 AW383778 AI933053 AI919097 W31557 N90245 AA931591 AA563995 
F36352 AA056184 AA476294 AA641327 AA533550 AI749630 W58323 AA569119 AA508573 AI809050 AI378996 
AA411362 AW407505 AA938104 AA074041 AA632876 AW193748 AA507873 A1270128 AI472365 AA411363 AI523216 

20 AI719965 AI816302 AA182681 AI707990 AA133588 AI758537 W60253 AI460308 AA135423 AI083904 F04188 N89693 

AW408776 AI678595 AI270568 AA722059 W58234 F33650 AA090547 AA285108 AA425981 N85079 D20218 AI273980 
AA159028 F03226 AW247914 N26918 AW272741 N90109 H05666 N23327 AW247953 R44748 AA962015 F03558 
AI752394 AW409913 AW248396 AI816463 AI752393 AA325370 AA263089 AI570130 AI971951 AI160658 AI357360 
AW168686 AL121075 AW050536 N21672 W67748 AA514242 AI127386 H14607 AI185752 W79364 AA088520 AA152476 

25 • AW351940 AW373683 AI940524 AW374953 T55500 N24329 AI940720 AW374933 AW374947 AW391913 AL138337 

AW375241 AW062943 F26666 AW410202 AW062958 F34529 AW3B1807 AW393315 W17147 AW176359 AA664576 
AW380424 AA306040 At745874 AW300951 AI18d579 AI438973 AI305271 AA433818 AA612807 At831809 AI940409 
AA158663A1572988 
108931 genbanK^AA147186 AA147186 

30 103138 entFe^X65965 X65965 
103432 entre2LX97748 X97748 
119174 genbanK.R71234 R71234 

133678 11235J AW247252 AA346143 NM.000270AA381085 N91995X00737 AA381079 AA296473AA296110AA315735 AA311617 

AA326750 AA376804 AW403290 T95231 M13953 T47963 H82039 AA279899 AA627997 N76320 N99S27 H37842 

3 S W20095 AA457308 AW469547 AA724143 H83220 AA319496 W88334 W30892 R89169 R99427 N41854 H47286 

AA348094 AA045089 R63016 At922219 AK)24g08 AI096488 AI885005 AA194872 N90489 A1452544 H7241 1 AA282427 
AA430735 R68963 R22453 H70385 AW129369 AW467320 AW519082 AA345018 AA582183 AI961789 R65918 N30611 
A1979189 AI280889 AW273191 R66531 A1285845 A1675927 AW21990 AW190879 H37794 AA699667 H68427 AA954388 
AI188757 AI140048 AA430382 AI204151 AW247864 AA559099 AI431420 AA548276 A1149466 AA772669 AA694388 

40 AA724168 AA301651 AA281952 AA779925 AA234760 W86290 AA913603 AW511745 A1500697 AA814922 AA835040 

T47964 H53998 AA975804 R98710 AI077604 N70252 R98084 AW250171 H69268 AI597614 AA970746 AA972548 
AI3771 16 R62962 H16737 R89070 AA731329 R66532 N54354 AI818832 H81944 N71567 T95122 W86463 AA437095 
AI431999 AI915724 N63851 AI674743 AA457307 AA21 1475 N64444 Ar799146 H72853 R99335 H60413 AA770367 
AA156105 AI269937 H64029 H89728 R65819 AW470496 AI873318 AI735713 H82987 C02447 A1478666 T27651 

45 AI699770 AW025156 H69719 AI984717 N69225 AI459856 AA953577 AI424691 H13843 R22404 AI873796 A1336002 

N70898 AI420854 AA541792 AA346142 AI000814 AI828348 AA045090 T51257 N90434 H13890 N73184 AI708083 
AA781608 AA329050 AA339985 R68964 H64795 W04186 H16845 
1 19416 genbanK.T97186 T97186 
119559 NOT_FOUND^entre:^W38197 W38197 

50 123473 genbanK_AA599143 AA599143 



160 



;NSDOCID: <WO ' 02079492A2 I > 



wo 02/079492 



PCT/US02/04915 



10 



TABLES; 

Pkey: Unique Eos probeset identifier number 

Accession: Accession number used for previous patent filings 
ExAocn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 
Unigene Title: Unigene gene title 



Pkey Accession ExAccn UnlGene UnigeneTitle 



15 



20 



25 



30 



115819 
132837 
101545 
102898 
101192 
102915 
105330 
107385 
102024 
134416 
103036 
104865 
106124 
105330 
109001 
104764 
133200 
105263 
105178 
109456 



AA426573 

D58024 

M31210 

X06256 

1-20859 

X07820 

AA234743 

U97519 

U03877 

M28882 

X54925 

AA045136 

AA423987 

AA234743 

AA156125 

AA025351 

AA432248 

AA227926 

AA1 87490 

AA232645 



AA486620 Hs.41135 
AA370362 Hs.57958 
BE246154 Hs.1 54210 
NM_002205Hs.149809 
BE247295 Hs.78452 
X07820 Hs^258 
AW338625 Hs.22120 
NI\/l_005397Hs,16426 
AA301867 Hs.76224 



X68264 
M13509 
T79340 
H93366 



Hs^11579 
Ks.83169 
Hs^75 
Hs.7567 



AW338625 Hs^2120 
AI066548 Hs.72116 
AI039243 Hs^78585 
AB037715 Hs.183639 
AW388633 Hs.6682 
AA313825 Hs.21941 
AW956580 Hs.42699 



AA486620 

AA370362 

BE246154 

NM_002205 

BE247295 

X07820 

AW338625 

NM.005397 

AA301867 

X68264 

M13509 

T79340 

H93366 

AW338625 

AI056548 

AI039243 

AB037715 

AW388633 

AA313825 

AW956580 
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TABLE 5A 

Table 5A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. The pkeys in Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene duster number from which the oligonucleotides were designed. Gene dusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist. Oakland California). The Genbank accession numbeis for sequences comprising each duster are listed In the "Accession" 
Golumn. 



Pkey: Unkjue Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 
115819 



CAT Number 
10241J 



102024 14505 J 



101545 
109456 
103036 



24607.1 

180633J 

17145J 



133200 28960.1 



132837 256666 1 



102898 24023 1 



Accession 

AA486620 AF205940 AA297524 AB034695 AA081335 NM.016242 AA188323 AA297537 H88204 AW953081 W31695 
AW582203 AA248250 AW68121 1 AA426230 AA464807 AA426155 N44141 AA347390 AA770661 Ai333225 N36136 
AW665724 AA431694 AI374976 AI400254 AI338446 AA186e95 H88205 WQ4527 AA487086 Ai051414 AA918383 
AA426573 AA425620 AW438654 AA090513 BE167284 BE167291 AI301726 

AA301867 AW957981 R27614 AA1 55808 A1920990Ai740711 AA301026 AA301015 At220981 Ai857670AI537140 
AW015210 AA030000 W46890 H44021 AI355967 AI651735 AA058479 AA146932 T58265 R85890 AA047810 AA017387 
AW026093 AA971133 AI827263 AI056416 AI355994 AI127691 H46603 U03877 NM_004105 AA157357 H42844 
AA146824 AA187709 AA187269 AA304348 AA147292 AA361687 AA156041 AA330636 R32929 AA321130 AW950260 
AA082157 AA029129 AA303708 AAa28155 D31561 T84689 AA302493 BE153057 BE153181 W39408 AA187200 
BE153250 AW383337 AW382622 AW382647 AW750072 BE153060 AW382630 AW371865 AW392464 AW382664 
AW382658 AW382650 H61647 AW365075 AW365049 AA373397 BE072779 BE072781 Z30254 W24381 BE153254 
AA040442 BE072729 BE072731 N94740 AA146945 AW802737 AI826799 AI085395 R34034 H65140 AA082800 H88275 
AA147824 R63882 W80899 AA296413 AI7653aO AI862426 AW022055 AW300003 AI743784 AI862635 AI985428 
AA147764 AW573245 AW190290 AI040898 D57613 N63457 AA148082 AI02B458 AA1481 10 AW814489 N75105 
AW629443 AA704122 AW582220 AA181240 AA057495 A1418224 AI261751 AW388595 AI472205 AW470672 AA102546 
AA789046 AA182416 AA062668 AW300732 AI288220 AA181982 AA146825 AA028130 AI985522 AA303344 AA081313 
N69082 AA182035 AI867128 AA100902 AA605087 N67178 AW020324 AW890446 A1472191 AI335691 AI597837 
AI081 143 AI335681 AA040443 AI128067 AI678244 AA018303 AA157260 W80792 AI934590 AI096430 T54343 AI446350 
AA165196 AA780683 AA603631 AA047787 AA968580 AA912645 AW890504 AW026913 D56983 H52088 AA156121 
R30848 AW023036 AI590960 N67345 AI753225 A1753283 AI183768 AA147818 H89101 AI362141 H89205 AI14771 1 
AA321 129 AA668622 AA343479 AW069438 AI422376 AW629270 AA013413 A1221948 AA970605 N52335 H38366 
T91180 AA657841 AA0173e6 AA152227 AA187593 AI913340 AI719313 A1989943 AI701271 A1004328 A1868348 N93659 
H65093 H25736 D57007 D56957 C00987 D61839 D56661 AI472137 AI971002 1:>56971 BE048830 D57972 AI589286 
AI361055 AI361071 AI292223 AA155898 D57139 D57981 D57345 AI420034 D57332 D57959 AA875933 R33493 N67558 
D58353 AA188394 AA147966 AI160640 AI363165 H40638 AA57ei37 AW950265 AA300943 AI128999 H46584 AA917355 
N57820 AA320504 H51959 H25737 

BE246154 1^1210 NM.001400 AA193392 NR^016537 AF233365 AF022137 H27787 AA370448 F05373 T27666 W21494 
AA036907 AI249g6i6 N93476 F01623 AA304390 AA308808 

AW956580 AA886361 AI147670 AI0901 15 AI16B683 AA232645 H99504 AA374707 AA380875 AW139567 AI735132 
BE439385 AW629780 N28322 AA232789 AA232790 N73285 

M135a9 X54g25 NM^002421 M16567 X05231 M15996 W39354 AA186634 AA852324 AA187507 AA081 149 AA166524 
AA187264 AA187361 AA386155 AA186973 AA374217 U78045 AA081230 AA188049 AA186393 W56827 AA852602 
AA157468 AA3D8204 AA186754 AA186808 AAD82516 AA304334 AW376428 BE439384 AW376420 AA156273T18504 
AA186521 W49496 AW084608 AA083575 AA372360 AW963590 AA132297 W47445 AA186376 AA157628 AW003999 
AID37890 AI858080 AI589010 AI743739 AI452673 AW304188 AW1 17854 BE439933 AA157416 AW778966 AI038497 
AA081006 AA100829 AA181048 C02231 T27821 VV23960 AW954802 AI471432 AW801296 AW801289 AW8016D3 
AW801523 AW801292 AW801542 AW801601 AA181134 AI445147 AA191501 AA582862 N94407AI147810AA1 81880 
W49497 W52714 AA188249 AI932881 AI082493 AA503656 AA182682 AW801393 AA182830 AA181882 AA182826 
AI613182 N94510 W47343 AI085755 AI076956 A1918426 AA081208 AI282835 AA147528 AI081490 AI654536 AA181875 
AA081282 AA186389 C06085 AA083542 AI800644 AA157642 AA101069 AA157752 AA158121 AA143331 AA081283 
AA8526D3 AA188296 A1932880 AW449628 AA187348 C02091 AA514656 AA082736 AA308786 AA143201 M16567 
AB037715 AI351347 A1375796 AI884765 AL121124 W01068 AI807275 T95240 R42807 AW516645 AI057314 AI033520 
AA057671 N70215 AA054215 AW204183 AA552149 T95130 AW796310 AI866520 AW275564 AW796308 A1637901 
AW197404 T78408 AA456232 AW206463 AA779800 AI052695 AA026744 AA454623 AW470729 R45490 AW770258 
AI038393 AI290170 AA722734 AL121 125 R41608 AI862414 AA83861 1 R45582 AI278083 BE466849 BE219944 
AA418030 BE041555 AA578572 T16528 AW006344 Z39782 A1244848 AW137344 AA707400 AI032028 BE540464 
AI094265 AI184281 AA931890 AW382744 AV\/382729 AW02D448 AW827237 AA431226 AI672059 AW772345 N70172 
AWQ22003 AI862704 H19344 R6151 1 AI080204 H16566 AA432248 AI767980 T16688 A1984342 A1217478 AI767095 
Z38551 AI359566 AI361437 AI041000 R07033 H16608 H19054 R12874 R61567 N9a368 BE221 199 Z42320 AA094554 
R07078 AW860886 AA418090 R41262 

AA370362 AA3641 10 AW959554 AW371737 AW382068 AW604716 AW604713 AA487827 AW371674 AA429137 
6E503321 T93570 W72803 AI093076 AA487977 A1241562 BE439445 AW204065 R51635 AI802994 T10362 W68553 
Ai866215 AW152154 AA700716 AI127443 R15824 Ai537587 AA953110 D58024 AI520811 AA693670 AI453280 W76329 
AW023955AW022563 

NM.002205 X06256 M13918 BE070866 AW239485 AW996127 BE273894 BE272590 BE41Q252 R2S975 T11766 T11787 
AA301142 AA301 165 AW960506 BE272819 AA386086 T39391 AA285303 AA370580 D5858S T58668 AA156213 W24142 
AA343323 AW798067 AA151 197 AA376121 R94782 AA302363 Hg0357 R82621 AA301677 H55997 AW796059 W92358 
AL046458 AA471198 AA301952 R46287 R82694 K03186 AA187706 R32562 R27094 R25947 R253a) AW949809 H13505 
H79049 R32403 H11213 R39710 H49765 H21142 H21006 AA417664 W52075 N56771 AA284240 N98556 N309O7 
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AA707335 AW603781 AI340367 AI814584 AA524182 AA370076 AA418785 AA704082 AI806851 H25513 T56388 
AA419627 H03986 H20963 T56245 AI459715 AW973768 AI334096 AI693020 T63414 R82646 AW167251 H5S998 
AI274916 AA778367 AI755253 Ai033667 AW083222 AA181979 R26865 AA661627 AA706329 At798646 AA612799 
A1160180 A1274973 A1039264 AA301880 AI042429 AA307632 A!085688 AI278366 AI498890 AA303865 AI954844 
AA502380 AA156334 AA723480 AI803584 AI581026 AA304584 N51038 R94702 R69814 AW160962 AI670049 AA588807 
AA151 198 T53400 AI567709 A11 85326 AA309205 AW338969 R53903 AA991891 AA301643 AW93337 AI026049 H25514 
AI741075 R28632 AW166445 AI333068 H49978 H91267 AA558193 AW079663 AA627380 AA807401 AI199956 AA666118 
AI718216 AW193228 AI077745 AI500496 AI266059 AW080383 R08468 R26757 R32404 AA716599 W92322 A1077734 
A1270181 R46198 AI217540 AA304045 AA305421 AW074445 AI468256 AW089568 AW571605 BE162930 H41009 
AA578313 AW874497 AA181284 AA861947 T29451 D20841 T58618 AA418731 AI2&2500 AW081407AA6045e0 
AA729855 AI262538 A1580225 

102915 2903_2 X07820 Nft/L002425 BE271570 AI263526 AW296143 A1829878 AI973162 AI085155 AA857498 AA709305 C02220 

134416 30694 1 X68264 NM_006500 AF089868 BE257461 BE275425 AW9971 54 AI902799 AI902803 M78206 AAD85691 AW392972 

AA325490 BE006161 AA349269 AA323568 AL042548 AA191148 AA187703 AA322791 AJ297452 T11625 AW366487 
AA303513 AA186961 AA17348a N28330 N28379 W40320 AA1871 18 H03695 AA402709 BE407476 H06354 BE276589 
AA351284 AA379921 AL1 38080 BE410587 AA1 13094 AA340481 BE277483 R21191 R79518 N86170 AA320505 
AA296065 AW951900 AA658897 AA65C052 AA654304 AA191691 N26649 AW080963 Ai265800 N72019 AI453458 
AA092563 AA402310 AI439450 AI061054 AA302358 T71566 AA302047 AA303432 N21289 H27357 AA303504 A1174583 
AW151762 AA181958 AW880618 AA630773 AI889539 AW901058 AI373405 AA341941 AA086217 A1675590 A1653936 
AA633570 AA987619 A1270656 N93847 N40689 AW517517 N20030 W95985 AA303955 H89170 AA309917 N21642 
AA373132 W38517 AI687806 W76182 AA101065 AA036916 N45635 A(744510 AI669803 A1039157 AI126355 AA634607 
AW131 120 AW1 96838 AA190601 AA91 1 130 BE221320 N92355 AA036752 H03696 AA588873 A1458868 AI041818 
AA090477 A1093248 AA304755 AL137942 AL044688 A)083709 AI150965 N88891 AA635675 AA594898 W94657 
AA182823 AW166205 F27886 R79246 F37329 AA565697 AID75739 AI088654 At094287 AI204256 AA095203 T93020 
AA688298 AA057324 N23442 AA075411 AA305046 AI031688 AI191503 AA111887 AA112264 N27929 AA187509 
A1375522 AI474006 H062g7 AI826177 N48880 H2d333 AA075490 R22809 W79542 AI055934 AA042901 AA173481 
AA301986 W74531 AI051747 AA187715 A1888888 AA993017 Aia57530 T92954 N8Q227 AW273S95 AI351260 AW170643 
AW292979 AA302605 AA302330 BE349495 AA328602 AA3Q^61 AI470984 AA155943 AA1S5914 
105178 7792.1 AA31382S AW960347 AF223468 NM_016613 AA186345 AA186508 AA081195 AA147972 AA346943 AW961667 

AA187222 AA187207 AW371052 AW449751 AW748803 AW391606 AW371047 AW3710S7 AW371085 AW382895 
AW371092 AW377556 BE010930 AI018882 AA247878 C04398 C05158 F11398 AA188315 H23385 R55086 H15346 
AA029106 AA228114 H17005 F08498 243376 AA095582 AA055186 AA463361 R15218 AA299132 AW103578 W21538 
AA428131 AA187115 AA157197 AA157167 AW371371 AA363562 AW965995 N55663 217878 AA228023 AI140342 
AA100927 AA496988 AA055917 AI089303 AW014967 AW090248 AW338371 AW131066 D62963 D79713 A1583950 
A)336781 Ai500705 AI471485 AW090239 D79784 D61847 D62789 D61842 AI086327 AI273381 D61815 D63043 AI913548 
AI280560 AI610828 AA029995 C16343 C16513 AI075741 AW516308 AI804764 AA948068 AI356688 AW103452 
AW573063 239445 C16489 A1949870 F04712 AA147823 AW026284 AI151538 AA081303 AA613890 AI251865 AW088499 
AA992111 A1862091 AI373465 BE502094 AI922270 AA884288 AA157079 N56963 AW189145 AA428080 R55056 
AA884068 AW771716 AA186662 C16364 H15723 AI921181 AA156888 H17006 AA1 87490 AI400994AA346942 H28533 
AW129047 R41656 H14636 AA995041 D58370 Z21131 D58186 A1383271 AA643977 D58044 AI934302 AW779425 
F09065 H14930 AA890693 H23274 

105263 178672_2 AW388633 AW378440 AW388283 AW388339 AW388333 AW388414 AW388413 AW388607 AW388453 AW388687 
AW388480 AW388591 AW38871 1 AW38851 1 AW388438 AW388570 AW388449 AI694383 AW237145 AI652991 
AI964041 AW366319 AW366321 AW961938 AW469211 AI634155 AI492186 AI624430 A1677965 N26502 AI963871 
AW378431 AW378421 AI015391 AW352126 N59336 AI352317 AW1971 13 N67998 AW778935 AI476054 AI206626 
R371 16 R4021 1 AA227926 AA639698 R38073 A1001745 T32854 A1619649 AI423703 F10774 AW388616 T16595 H05894 

105330 182497.1 AW338625 R43226 R51640 A)307645 A1308100 AI085787 AI4203S7 AI692610 AA877160 AI953366 AA234743 

104764 90967.1 AI039243 R68234 AA025351 AA971083 AI537757 AA025362 R81636 T86650 

104865 102037J T79340AI742317 AW1 82676 AW4514«)AM20964 R43284 AA088179AW590886 AW269529AA045187 AI521736 
A1827455 AA045136 AW271709 AI004344 AA639631 AA744417 AA744218 AA045357 AA045351 

106124 54542.1 H93366 AI653547 AA336265 AW966175 BE566451 R71178 At630656AA234331 N55039 AA305632AW960431 R34044 

R32254 AW020970 AW451281 AW275041 At636933 AI655640 AA423986 AA642466 Ai684063 AI633B76 AI624897 
AA814795 AW590328 AI889166 AW243541 AI439691 AW473445 AI475516 AA741228 A1127534 AA165143 AI074714 
AI654076 AA400674 AI560249 N50709 AW438621 AI806810 AI434579 AI308184 AA423987 A1141272 AI565586 
AI338440 AA219628 AI246643 AI985809 AA724260 AA633988 AI364172 AI798439 AI650801 R33503 A1435891 
AA903649 T96161 AA665538 AA219620 A1309962 AA400707 BE247066 R32178 AI275962 AA661602 AW003197 
6E466649 AA831198 AI620052 AI825387 AI634037 A)670978 Ai670979 AI655092 R32304 AA828858 AI382428 
AW023660 AA262892 T26891 AW089917 T26926 R32227 

107385 6976.1 NM_005397 U97519 AW899329 A1902387 AA077792 AA078525 AW376607 AA077946 AA070415 BE208721 AW167958 

BE293050 BE208240 AI648698 AA101314 BE393348 BE305122 AA077591 6E274036 AA313687 BE392220 BE378954 
AA171461 AA464821 AW938242 AW938224 AW93a243 AW938232AA147953 N64294 AA205218AW305065 AW517478 
AA307983 AA377023 BE563629 R99976 NB0294 T87719 T87928 AA496849 AA485344 AA204938 AW370448 AA318242 
AW964384 H92423 W95317 BE378774 BE391 1 56 AA349138AA1 73095 AW51 3198 AA037672AA148029 AA169726 
W04791 AA075508 BE382937 BE395034 AF139793 AA961734 N48612 H64714 AW151251 AI565113 AI566881 
AW087370 AA631168 AA622014 AW513098 AI857810 AW152287 AI052596 AI983246 AA024856 A1912456 AI677938 
AW026403 AA972537 AK)88497 AW999869 W94582 AI140166 AI160659 AI566868 AA101263 AW190390 AW166466 
A1401207 AI418158 AI625265 AI146298 AW008592 BE223020 N58926 A1308797 AA037673 Ai935992 AI3a4706 
AA024939 AI216589 AI610423 AI354621 AI500677 A1679389 A[7993 10 N64508 AI128756 AI679897 AW589535 
AA989333 A1500527 AA565479 AA913529 A3923295 F21691 AA989376 A)699064 AA902447 AI690910 AA772659 
AA204983 AI337895 R99975 H65205 AA340766 AI339441 AI913855 AA450293 AW192010 AA070416 N72401 AI371481 
Ai247108 AI371261 AI364987 AI280171 AI269104 At868756 AA9Q9836 AA983640 At973271 AA913092 AI868205 
AI1441 12 A1190975 N58085 AI56663& N93405 AW150504 AW298846 AI687036 AA9Q2984 Al^4460 Al^047 AA6S3148 
A161 1228 AW131922 AA862687 AA902519 C01732 AW796045 AL044660 

101192 15367 1 BE247295 AW068092 Alj041313 AA159244 NM.005415 L20B59 AL135570 W47073 AW516906 BE388271 BE408629 

W46972 BE293846 BE256647 AI075010 AL041095 AA28S300 AIj039S60 AA368740 W26G02 AA399344 AA039235 
W27631 AW834898 AW834914 Rg3390 AA378039 AV649660 T53674 N98824 AA399974 AW843378 AA368267 R082S6 
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AV653575 R27900 N48215 AW366371 N45500 AV652967 AI889251 AI080457 N3S021 AI738542 AW242849 AI857471 
A1859775 A1582a30 R75850 N66564.AW341636 AI49d006 AI887217 AW026894 AW182840 AA039313 AA831346 
AI393465 AW089210 Af743830 AA744243 AA401310 AW439758 AW088152 R93391 AA291379 AA225220 AW009358 
AI192879 AA291202 AI565089 AA225089 AA807688 A1052058 AI341641 AI086625 AA333864 AA159147 AI923912 
R75851 AI761143 AW768588 AA394195 A1288450 AW512564 AI4S2775 AI056520 AA468602 AA872566 AI434739 
AA291838 AI948623 AW768614 AI374753 AW068174 AA884908 AI199346 AI199347 W94946 AI159995 AA877642 
A1280846 AI307610 AA403310 R08205 AW182123 AI000999 R27808 AWQ26571 D20816 A1560350 T27667 AW960271 
At174628 AI432042 AI424528 AA909562 T17342 AI783866 
109001 146370.3 AI056548 AW409843 AW263540 AA723669 AA909334 AA156120 AA157141 AA156125 AW409866 W19499 AA157229 
AW887435 
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TABLE 6: 



10 



Pkey: Unique Eos prabeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

AUC1 : 70«h percentile of average intensity (Al) for prabeset at each of 2,6,15,24.48. and 96 hour timepoints minus 70«« peioentile Al at 0 hrs, 
summed over 5 experiments. 

AUC2: AUC 1/90>h percentile of Al for aorta, aortic valve, vein, and arteiy. 



Ptey Ex^ccn UnigenelD UnigeneTitle 
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314941 
327414 
321911 
331578 
332466 
313513 
320635 
326230 
313566 
313665 
324852 
314372 
311877 
322262 
312173 
319795 
313350 
326759 
300318 
313978 
306840 
310272 
315044 
321325 
303251 
302378 
315060 
332048 
337214 
311598 
304782 
312802 
302680 
317452 
318558 
312149 
319267 
321510 
326198 
315730 
310442 
331237 
300469 
338316 
330968 
331019 
331261 
301822 
325544 
328700 
322882 
336034 
316580 
309931 
330692 
319962 
338033 
314943 
332640 
338158 
327036 



AA515g02 Hs.130650 

AF026944 Hs.293797 
AI2464d2 Hs.24gg89 
AB018259 Hs.1 18140 
AW298600 Hs.141840 
N50617 Hs.60506 



AA628517 
AW751201 
AI380792 
AL040176 

AA084248 
AA632012 
AI821409 
AB037821 
AW591949 

AW444502 

AI870175 

AI077477 

AF216389 

BE547674 

AB033100 

AF240635 

AL109712 

AA551104 

AW337575 

AW023595 

AA582081 

AA644669 

AW192334 

AA972965 

AW402677 

T90309 

F11802 

H75391 



Hs.118502 

Hs.120932 

Hs.135104 

Hs,142003 

Hs.85339 

Hs.188746 

Hs.304471 

Hs.146858 

Hs.57958 

HS.2569B2 

Hs.13957 

Hs.307912 

Hs.148932 

Hs.204169 

Hs.300646 

Hs.1 15897 

Hs.296506 

Hs.189048 

Hs^1591 

Hs.232048 

Hs.193042 

Hs.38218 

Hs,135568 

Hs.146381 

Hs.269651 

Hs.6818 

Hs.255748 



ESTs 

predicted exon 

ESTs 

ESTs 

KIAA0716 gene produd 
ESTs. Weakly similar to S59501 tnterfero 
smalt nuclear ribonudeoprotein polypept 
predicted exon 

ESTs 
ESTs 

ESTs. Weakly sMarlo The KIAA0149 gen 

G protein-coupled receptor 39 

ESTs 

ESTs, Highly similartoAF116865 1 hedge 
protocadherin 10 
ETL protein 
predicted exon 

ESTs. Highly similar to AF1 16865 1 hedge 

ESTs 

EST 

semaphorin Rs, short form 
ESTs 

KIAA protein (simitar to mouse paladm) 
protocadherin 12 

Homo sapiens mRNA full length insert cON 
ESTs. Moderately similar to ALUC.HUMAN 
ESTs 

predicted exon 
ESTs 



AUC1 

1038 

303.2 

429.2 

677.4 

395.2 

324 

394.8 

357.2 

433.6 

-83 

348.2 

-49.2 

-1309 

-247.8 

-1025.8 

203.6 

183.8 

1654.4 

'346 

576.6 

56.4 

-127.6 

-102.6 

1080.6 

1270.8 

915.8 

1236.8 

522.6 

269 

796.4 



H25899 Hs.201591 
AW072215 Hs.208470 
W87874 Hs,25277 
8E301708 HS.2339S5 

R44557 Hs.23748 
NM_006033Hs.65370 
BE539976 Hs,103305 
X17033 Hs.271986 



AW248508 Hs.279727 

AA938198 Hs.146123 
AW341683 
R39288 Hs.6702 
H06350 Hs.135056 

Y00272 Hs.184572 
BE568452 Hs.5101 



gb:nn32h08.s1 NCl_CGAP.Gas1 Homo sapiens 
ESTs 349.6 
ESTs 638.6 
ESTs 360.8 
RNA binding motif protein. X chromosome 700.2 
ESTs 274.2 
ESTs 238.2 
ESTs 231.8 
predicted exon 581.6 
ESTs 281.6 
ESTs -213 
hypothetical protein FU21065 285 
hypothetical protein FU20401 26.6 
predicted exon 14942 
ESTs 975.8 
lipase, endothelial 201 .2 

Homo sapiens mRNA; cDNA DKFZp434B0425 (f 
Integrin, alpha 2 (CD49B. alpha 2 subuni 356.2 
predicted exon 1014.6 
predicted exon 627.4 
Homo sapiens cDNAFU14035fis. done HE 84.8 
predkited exon 782.6 
hypothetical protein FU12972 746.4 
gb:hd13d01jc1 Soares.NFU.T.6BC,S1 Homos 
ESTs 137 
Human DNA sequence from done RP5^E9 



predk:ted exon 540.6 

cell drvisfon cyde 2. G1 to S and G2 to -494.8 

protein regulator of cytokinesis 1 -600 

predicted exon 311.2 

predated exon 351.8 



AUC2 

9 

30.3 

42.9 

10.3 

39.5 

32.4 

39.5 

36.7 

12 

0.5 

34.8 

0.5 

0.2 

1 

1 

5.2 
18.4 
1.2 
1 

2.3 
0.4 
0 
0 

4.8 

5.3 

15.8 

4.9 

4.7 

26.9 

20.2 

316.410.5 

7.6 

63.9 

36.1 

6.6 

7.5 

23.8 

23.2 

8.2 

9.7 

0.3 

0.5 

0.3 

34.7 

1.8 

0.9 

478.61.3 

1,7 

9.4 

62.7 

5.7 

783 

13.8 

134 J 13.5 
13.7 

14.6 0.5 
14 
1 
1 

31.1 
ZS2 
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302855 AJ227892 Hs.146274 ESTs 180^ 
327568 predlctBdexon 229 

324801 AW770553 K5.14553 sterol 0-acyttransferase (acyUk)enzyin8 161^ 
317850 A1681545 Ks.152g82 hypolheticat protein FU131 17 -GdO 
322818 AW043782 Hs.293616 ESTs 126.4 
324626 AI585464 Hs.292638 ESTs 170.2 
317224 X73608 Hs.93029 sparc/osteonectin.cwcvandkazal-IIked -80 
310955 AI476732 Hs.263912 ESTs 466.8 
315240 R38772 Hs.172619 KIAA1 106 protein 277 
338386 predicted exon 267.6 

338442 predicted exon 256 

318617 AW247252 Hs.75514 nucleoside phosphorylase 1247.8 
338645 predicted exon 206 

313135 N58907 Hs.162430 ESTs 204.8 
324716 BE169746 Hs.12504 hypothetical protein DKFZp761ID081 203.6 
330305 predicted exon 199.8 

308248 AI560919 gb:tq41g10j(1 NCLCGAP_Ut1 Homo sapiens 

308886 A1833240 gb3t76d10jc1 Barstead colon HPLRB7 Homo 198.2 

315622 AI796144 Hs.258188 Homo sapiens cDNAFUl 1674 fis, done HE 191^ 
323675 R43240 Hs.2721 68 tumor differentially expressed 1 189.2 
312164 T91980 Hs.221074 ESTs 187.6 
300378 Z45270 Hs.235873 hypothetical protein FU22672 271.6 
317478 Af343569 Hs.107000 Homo sapiens mRNA for WDC146. complete c 
317559 AW452344 Hs.l29977 ESTs 184.2 
317207 A1873346 Hs.214505 ESTs 182.8 
334834 predicted exon 178.8 

320925 D62892 gbi1UM337C07B Clontech human aorta poiyA 

303289 AL121460 Hs.272673 hypothetical protein FU20508 316.4 
328548 predicted exon 174.6 

317108 AA884000 Hs.8173 hypothetical protein FLJ10803 172.4 
318013 At188183 Hs.144078 ESTs 326 
314299 AW382682 Hs.154840 ESTs 170.8 
317702 AW173339 Hs.135665 ESTs 169.8 
316094 AW975920 Hs.283361 ESTs 169.4 
323706 AA377578 Hs.65234 hypothetical protein fU20596 169.2 
325843 predicted exon 321.4 

316012 AA764950 HS.11989B ESTs 1047.2 
309687 AW236154 Hs.77385 myosin,Bghtpolypeptide6,all«li.smoothmu 168.2 
323329 At134744 Hs.10852 ESTs 168 
312853 W05086 Hs.1 14256 ESTs 167.4 
313070 AI422023 Hs.161338 ESTs 298.6 
314096 AW977642 Hs.291742 ESTs 165.6 
338728 predicted exon 165.4 

316609 AW292520 Hs.122082 ESTs 165 
305989 AA888220 gb:oj15h01.s1 NCI.CGAP Kid5 Homo sapiens 

312642 AW052128 gbwx26c02.x1 NCLCGAP.Kid1 1 Homo saplen 

339236 predicted exon 163.6 

317058 AI217713 Hs.147586 ESTs 161.8 
311137 AW207582 Hs.196042 ESTs 582.2 
310178 AI936450 Hs.147482 ESTs 161.2 
320745 H51696 Hs.89278 hypothetical protein FU1 1186 161 
317336 AW014637 Hs.130212 ESTs 160 
309871 AW300366 gb3(s63b05j(1 NCLCGAP.Kidll Homosapien 

302038 AC004076 Hs.129709 Homo sapiens diromosome 19, oosmidR3021 
332237 N52883 Hs.102676 EST 159 
312362 AW015994 gb:Um-BI0p4bh-g-09^-Ul.s1 NCLCGAP.S 158.6 

331558 N62401 Hs.48531 EST 158.6 
316215 AI684535 Hs.200811 ESTs 158.4 
336059 predicted exon 157.4 

302790 AJ245245 gb:Homo sapiens mRNA for immunogtobuOn 155.8 

predicted exon 

hypothetical proteh FLJ20142 
ESTs 

predated exon 
Hs.1 30264 ESTs 

predicted exon 



328418 

304229 AK000149 Hs.29493 
331606 AW273285 Hs.50802 
338962 

317959 A1204202 
336228 

313534 AW072916 Hs.78743 zinc finger protein 131 (done pHZ-10) 

317404 AI806867 Hs.126594 ESTs 

311943 AI469911 Hs26498 hypothetical protein FU21 657 

314680 AI247425 Hs.152182 ESTs 

331484 N29696 Hs.44076 EST 

338116 predicted exon 

329863 predicted exon 

315555 AW452886 Hs.239107 ESTs 

317039 AA868583 Hs.126153 ESTs 

331138 R63816 Hs.28445 ESTs 



153.8 
153.6 
153 
664.4 
152.6 
152.4 
152.2 
152.2 
152 
151.4 
151.2 
151.2 
150.6 
149.6 
149.6 
149.6 



18 
22^ 
16.1 
1 

4.5 
17 
0 

46.7 
27.7 
26.8 
25.6 
24.2 
20.6 
20.5 
20.4 
20 

1994 19.9 

19.8 

19.1 

18.9 

18.8 

18.7 

187 18.7 
184 
18.3 
17.9 

177.2 17.7 

17.6 

17.5 

17.2 

17.2 

17.1 

17 

16.9 

16.9 

16.9 

16.9 

16.8 

16.8 

16.7 

16.6 

16.6 

16.5 

16.5 

164.616.5 
164 16.4 
16.4 
16.2 . 
16.2 
16.1 
16.1 
16 

159.816 

159 15.9 

15.9 

15.9 

15.9 

15.8 

157 

15.6 

15.4 

154 

15.3 

153 

15.3 

15.2 

15.2 

15.2 

15.2 

15.1 

15.1 

15.1 

15.1 

15 

15 

15 
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316561 AI917222 Hs.1216S5 ESTs 149.4 14J 

328695 predicted exon 149.2 14.9 

302282 BE396283 Hs.173987 eukaryotic translation initiatton factor 148.4 14.8 

318781 F11802 Hs.6818 ESTs 148.2 14.8 

5 323709 AW297246 Hs.288546 Homo sapiens cDNAaJ14190fis, done NT 148 14^ 

310790 AW192063 Hs.248865 ESTs 147.8 14.8 

316833 AW292614 Hs,124367 ESTs 147.8 14.8 

323176 NM.007350HS.82101 pleckstrin homology-fike domain, family 229 14.8 

324188 AW274439 Hs.252709 ESTs 147,6 14.8 

10 317441 AA922798 Hs.196583 ESTs 147.4 14.7 

317584 AI825890 Hs.220513 ESTs 146.8 14.7 

321798 AI308206 Hs.181959 ESTs 146.8 14.7 

304363 AA206O45 gb:zq77f05,s1 StratagenehNT neuron (937 146.6 14.7 

313952 F20956 gb:HSPD05390HM3 Homo sapiens cDNA done 146.614.7 

15 301909 A1702609 Hs.15713 ESTs 263.8 14.7 

309198 AI9048g5 Hs.9614 nudeophosnun (nudeoiar phospiwprotein 146.2 14.6 

321860 N47474 Hs.212631 ESTs 146.2 14.6 

330187 predicted exon 146 14.6 

323042 AA463571 Hs.172550 polypyrimidlne tract binding protein (he 145.6 14.8 

20 313636 AA262397 Hs.201366 ESTs 145.2 14.5 

302437 AB024729 Hs.227473 UDP-N-acetylg!ucosamine:a-1.3-D-mannosid 145 14^ 

318197 At473096 Hs.133403 ESTs 144.8 14.5 

302749 M16951 gb:Human ig mu-chain mRNA VDJ4-region. 5 144.6 14.5 

322357 AI734258 Hs.245367 ESTs. Weakly similar to ALU1_HUIVIAN ALU S 144.614.5 

25 300391 AI927371 Hs.288839 iiypothetical protein FU1 2178 144.4 14.4 

326077 predicted exon 144.4 14.4 

302004 Y18264 Hs.123094 sal (Drosopliila)-lii(e 1 ' 144 14.4 

320668 AA805666 Hs.146217 Homo sapiens cDNA: FU23077 fis. done L 144 14.4 

331212 T88693 Hs.226410 ESTs 144 14.4 

30 311268 AI969727 Hs.231859 ESTs 143.2 14.3 

305159 AA659166 Hs.275668 EST,WeaklysimilartoEF1D.HUI^ELONGATIONF 143 14.3 

304510 AA457391 Hs.1 19122 ribosomalproteinL13a 142.8 14.3 

320852 AA772920 Hs.303527 ESTs 142.8 14.3 

330854 AW291944 Hs.122139 ESTs 142.8 14.3 

35 318275 AW449952 Hs.190125 basic-helix-loop-helix-PAS protein 142.6 14.3 

314992 AI824879 Hs.211286 ESTs, Wealdy similar to 1207289A reverse 142.2 14.2 

322631 AA001697 Hs.293565 ESTs. Wealdy simitar to putative pi 50 [H 142.2 14.2 

332283 R40855 Hs.100839 EST 142 14.2 

302894 AA719572 Hs.274441 Homo sapiens mRNA; cDN A DKFZp434N011 (fr 141.214.1 

40 301808 R35391 Hs.252831 reticulon3 141 14.1 

318608 AI204491 Hs.l51502 ESTs 141 14.1 

316499 AW292947 Hs.l22872 ESTs 140.8 14.1 

317011 AI248760 Hs.l50276 ESTs 140.8 14.1 

321840 N45600 Hs.46534 Homo sapiens mRNA; cDNA DKFZp434P0714 (f 140.814.1 

45 327365 predicted exon 140.8 14.1 

331264 AA278898 Hs.225979 hypothetical protein similar to small G 140.8 14.1 

324545 AW501944 Hs.127243 Homo sapiens mRNA for KIAA1724 protein. 140.4 14 

312986 AA211586 gb:zn56d05.s1 Stratagene musde 937209 H 140.2 14 

316053 AA825814 Hs.149065 ESTs 140.2 14 

50 330723 BE247449 Hs.31082 hypothetical protein FU10525 140.2 14 

304876 AA595765 gb:nJ28g06.s1 NCLCGAPJkAl Homo sapiens 139.8 14 

311379 AW134766 Hs.202450 ESTs 139.8 14 

318265 AW019873 Hs.146840 ESTs 139.8 14 

324137 AA393127 Hs.222762 ESTs 139.8 14 

55 328262 predidedexon 139.6 14 

322349 AK001279 Hs.180171 Homo sapiens d)NA FU10417 lis, done 1^ 139.4 13.9 

323504 AA280223 Hs.130865 ESTs 139.4 13.9 

304261 AA059387 gb:zf66d01. si Scares retina N2b4HR Homo 139.2 13.9 

310489 AW451493 Hs.235516 hypothetical protein PR02955 139^ 13.9 

60 335946 predidedexon 139.2 13.9 

318155 AI041546 Hs.132133 ESTs 138.8 13.9 

313796 AI797169 Hs.208486 ESTs 138.6 13.9 

333977 predidedexon 138.6 13.9 

324845 AW969535 Hs.283718 ESTs 138.2 13.8 

65 331139 R65706 gb:yi16g12.s1 Scares ptecentaNb2HP Homo 138.2 13.8 

331131 R54797 gb:yg87b07.s1 SoaresinM brain 1NIBH 669.6 13.8 

321250 H58539 Hs.151692 ESTs 138 13.8 

312498 AA66B782 Hs.191284 ESTs. Wealdy similar to ALU1J1UMAN ALU S 137.813.8 

331252 W52470 Hs.34578 atpha2.3-sialyitransferase 137.8 13.8 

70 337407 predidedexon 137.8 13.8 

303973 AW512014 gb3a68a03a1 NCljCGAP.Lym12 Homo sapien 137.413.7 

314582 AA412258 Hs.188817 ESTs 137.4 13.7 

327373 predidedexon 137.2 13.7 

323367 AA234591 Hs304123 ESTs 136.6 13.7 

75 316207 AA832065 Hs.120260 ESTs 136.4 13.6 

315231 AA705809 Hs.119922 ESTs 136.2 13.6 
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318592 


T39310 Hs.1 139 ooM shodc domain protein A 


136^ 


13.6 


320908 AW969708 Hs.293332 ESTs 


136^ 


13£ 


328937 


predicted exon 


136.2 


13.6 


329073 


predicted exon 


136.2 


13.6 


318231 


AV659082 Hs.134228 ESTs 


136 


13.6 


311992 


AI.360200 Hs.114145 ESTs 


135.8 


13.6 


316497 


AA766457 Hs.136849 ESTs 


135.8 


13.6 


317677 AA968594 HsA27eSS ESTs 


135.8 


13.6 


321680 WQ2848 Hs.93704 ESTs 


135.8 


13!6 


JAUoU 


predicted exon 


135.8 


13.6 




AF036943 Hs.172619 KIAA1 106 protein 


135.8 


13.6 




AL134878 Hs.1 19500 ribosomal protein, large P2 


135.6 


13.6 


our «xu 


AI223207 Hs.147888 EST - 


135.6 


13.6 


oi in** 


AW449382 Hs.1 95297 ESTs 


135.6 


13.6 


OZU3/«7 


R15138 Hs.165570 Homo sapiens clone 25052 mRNA sequence 135 


13.5 


oUUZo 


AA884104 Hs.125546 ESTs 


134.8 


13.5 




N5819d H5.162898 ESTs 


134.8 


13.5 




H09604 Hs.13268 ESTs 


134.6 


13.5 


o3277d 


AF241850 Hs.151428 ret finger protein 2 


134.4 


13.4 


332494 


AA282330 Hs.145668 ESTs 


134.2 


134 


334376 


predicted exon 


134.2 


13.4 


013264 


N93416 Hs.1 18228 ESTs 


133.6 


13.4 


040ccn 
313oo9 


AA351109 Hs.5437 Taxi (human T-celUeukemla virus type 1 


133.2 


13.3 


312083 


T87398 Hs.205816 ESTs 


132,6 


13.3 


319354 


AA993807 Hs.167367 ESTs 


132.6 


13.3 


307414 


AI242106 gb:qh92a02j(1 Soares_NFL_T_GBC„S1 Homo s 


132.2 13.2 


312771 


AA018515 Hs.264482 Apg12(autophagy12.S.cefevisiae)-U)(e 


131.8 


13.2 


313004 


AI274963 Hs.145900 ESTs 


131.2 


13.1 


300995 


AW510641 Hs.258018 ESTs 


220.6 


13 


319323 


F12650 Hs.13287 ESTs 


125.4 


125 


329451 


predicted exon 


123.4 


12.3 


337603 


predicted exon 


572 




312480 


R68651 Hs.144997 ESTs 


121.4 




324934 


AW452051 Hs.147546 ESTs 


119.4 


11.9 


320723 


BE178025 Hs.7942 hypothetical protein FU20080 


117 


11 7 

1 l.r 


318188 


AI792566 gb:qI74f02.y5 NCLCGAP.Ov26 Homo sap 
AF238869 Hs.283955 Homo sapiens done GL£H-2 similar to gii 


iens 


116611 7 


320873 


112.8 


11 a 


331005 


BE003191 Hs.1 19555 ESTs 


112.6 


11.3 


304969 


AA614406 gb:np46(05.s1 NCI_CGAP.Br11 Homo sapiens 


112.4112 


319799 


A1139253 Hs.227767 zinc finger protein 41 


111.2 


11.1 


302610 


AA347945 Hs.256024 ESTs 


111 


11.1 


309485 AW130320 Hs.108124 ribosonialproteinS4,X'linked 


111 


11.1 


311880 


AW419225 Hs,256247 ESTs 


110.2 


^\ 


313981 


AW452334 Hs.128148 ESTs 


110.2 


1 1 


322442 


W49701 Hs.29667 ESTs 


109.4 


10 Q 


315099 


AA806536 Hs.291641 ESTs 


109 


10j9 


304793 


AA583264 Hs.182979 ribosomaIproteinL12 


108.8 




330815 


AA019211 Hs.236463 K)AA1238 protein 


108.8 


in^ 


304044 


T81656 Hs.252259 ribosomal protein S3 


714.8 


10^ 

IwM 


325222 


predicted exon 


135 




325889 


predicted exon 


814.6 


10.8 


321447 


AW891130 Hs,38173 ESTs 


107.8 


10.8 


302990 AA498212 Hs.180182 ESTs 


106.2 


10.6 


308106 


AI476803 gb:b77e12j(1 Soares.NSF_F8.9W.OT^PA.P.S 


270 6106 


310536 AI301041 Hs.150174 ESTs 


106 


10.6 


315257 AW157431 Hs.248941 ESTs 


233 


10.6 


318787 242313 Hs.22657 ESTs 


105.8 


10.6 


312306 


AI927226 Hs.175610 ESTs 


105.2 


105 


326788 


predicted exon 


104.4 


104 


312234 AA830640 H5.208934 ESTs 


104 


10.4 


314482 


AW085525 Hs.134182 ESTs 


234 


10.4 


323597 


Ai185693 Hs.135119 ESTs 


m4 


10.2 


302623 AW836724 Hs.194110 hypothetical protein PRO2730 


162.4 


■ 10.2 


323594 


AI791531 Hs.129993 ESTs 


101 


10.1 


324315 


N55761 Hs.194718 zinc finger protein 265 


100.2 


10 


314217 


AA256465 Hs.188725 ESTs 


99.2 


9.9 


320932 


AA554913 Hs.162297 ESTs 


98.2 


9.8 


327876 


predicted exon 


98.2 


9.8 


319736 


R17424 Hs.6650 vacuolar protein sorting 45B (yeast homo 


98 


9.8 


327747 


predicted exon 


97.6 


9.8 


327844 


predicted exon 


97.4 


9.7 


318200 AI081ig2 Hs.166517 ESTs 


97.2 


9.7 


329414 


predicted exon 


97.2 


9.7 


316296 


AI089667 Hs^70713 ESTs 


1214 


.9.7 


307010 Ai140014 gb:qa68fD9J(1 Soar8SJetaLheart.NbHH19W295 


9.7 


319792 AI138635 Hs.22966 ESTs 


385.4 


9.6 
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318804 
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311820 
321614 
330306 
303096 
313275 
302593 
321421 
330832 
311847 
322036 
328668 
325251 
329088 
322524 
337953 
323529 
307041 
318285 
312021 
329350 
326169 
338038 
312549 
312542 
320992 
318596 
315650 
324328 
332622 
328229 
319110 
316133 
303992 
322675 
325753 
312539 
302592 
314578 
335986 
321478 
305192 
304275 
302779 
301976 
316021 
320802 
317282 
316827 
303190 
315587 
333122 
310214 
320069 
309328 
318971 
327220 
315757 
320730 
313339 
318634 
320955 
306605 
309349 
306004 
330020 
302308 
314648 
315131 



A1263059 
F06771 

Z42549 

AA055611 

AW752437 

AW274545 

H86161 

AL080276 

AI027604 

H54855 

BE465115 

AI133530 

AW301807 

BE002723 



Hs.145594 
Hs.27226 

Hs.160893 
Hs.226568 
Hs.325708 
Hs.254333 



Hs.268562 

Hs. 159650 

Hs.36958 

Hs.171688 

Hs.62930 

Hs.297260 

Hs.30ig05 



dUTPpyrophosphatase 
predicted exon 
ESTs 
ESTs 

predicted exon 
ESTs 



96 

93.8 

93.4 

93.4 

922 

92 



W79027 

AA284397 
AI144243 
A1332454 
AA759263 



AI214510 

D60076 

AB026891 

AI470235 

AA649042 

AA447276 

R10674 

T75260 
AI187742 
AW515800 
AA017656 

AI004377 
AA294921 
AA410183 

AW402593 

AA6660ig 

AA070605 

AJ235667 

T97905 

AW2g3399 

BE336699 

A1733112 

At380429 

BE280787 

A1268399 

AI220072 
D43945 
AW024348 
Z44067 

AW014605 



Hs.271762 

Hs.201485 

Hs.158412 
Hs.14041 

Hs.146304 

Hs.225972 
Hs,172698 
Hs.269615 
Hs.292020 
Hs.128856 

Hs.98321 
Hs.125562 

Hs.146580 

Hs.200360 
Hs.250811 
Hs.137475 

Hs.123253 



AI682536 

T49598 

AW820035 

AI000497 

AW051913 

AA889992 

AW327279 
AW979268 
AI753709 



Hs.77256 
Hs.144904 
Hs.185055 
Hs.176101 
Hs.172445 
Hs.16079 
Hs.140489 

Hs.165893 
Hs.1 13274 
Hs.233191 
Hs.10957 

Hs.179872 
Hs,151072 
Hs.163495 
Hs.156832 
Hs.278679 
H5.119500 

H&2186 

Hs.91379 

HS.1K484 



ESTs. Moderately similar to ALU4.HUMAN A 92 

ESTs 91.8 

ESTs 91.4 

gb.7s94t)01 .r1 Soares retina N2b5IHR Homo 91 

predicted exon 91 

regulator of G-protein signalling 17 90 

ESTs 110.4 

ESTs 86 

ESTs 862 

ESTs 456.4 

ESTs 86 

Homo sapiens cDNAFU14080ris, done HE 145.8 

predicted exon 85.6 

predicted exon 85.4 

predicted exon 85.4 

ESTs 84 

predicted exon 451 
Homo sapiens clone FLC0664 PR02866 mRNA, 
gb:qb85b12j(1 SoaresJetaUeait,NbHH19W 

ESTs 81.4 

ESTs 81 

predicted exon 81 

predicted exon 80.4 

predicted exon 10242 

ESTs 77.4 

gb:HUM084E1QA Clontedi iiuman fetal brain 76.8 

solute earner family 7, (cationic amino 76 

EST 150.6 

ESTs 73.4 

ESTs 210.4 

CSR1 protein 702 

predicted exon 69.4 

hypothetical protein 1^14103 68.6 

ESTs 308.6 
gb:hd88g01.x1 NCI_CGAP_GC6 Homo sapiens 

enolase 2, (gamma, neuronal) 377.2 

predicted exon 105.2 

Homo sapiens cDNAFU13027fis. done NT 922 

v-ral simian leuken^ viral oncogene tK>m 361 .6 

ESTs 201.6 

predicted exon 108.6 

hypothetical proteki FU22009 528 

gb:ag44a04.s1 Jia bone manow stroma Hom 56.6 

gb:zm53h09.s1 Stratagene fibroblast {937 78.6 

gb:Homo sapiens mRNA for immunogbbulin 278.8 

enhancer of zeste (DrosophOa) homolog 2 479.2 
nudear receptor co-repressor 1 
BENE protein 
ESTs 
ESTs 

hypothetical protein FU10233 
ESTs 

predicted exon 
ESTs 

transcription factor EC 
EST. Weakly similar to A27217 glucose tr 
ESTs 

predicted exon 
ESTs 
ESTs 

Homo sapiens cDNA FIJI 3608 fis, done PL 
ESTs 

a disintegrin and metalloproteinase doma 
nbosomalprotein,largeP2 
gbnivx24a09jc1 NCLCGAP_Kid11 Homosapien 

euka^otictranslalionetongationfectDrlga 451.2 

predicted exon 612 

ribosomal protein L26 342 
gb£ST391378 ^/IAGE resequences. MAGP Homo 

ESTs 130.4 



792.4 

2423.8 

523.2 

576 

223 

136.2 

399 

234.4 

68 

258.8 

376.6 

47.4 

177.4 

2052 

260 

475.2 

388.6 

.81.6 



9.6 
9.4 
9.3 
9.3 
9.2 
9.2 
9.2 
9.2 
9.1 
9.1 
9.1 
9 

8.8 
8.8 
8.6 
8.6 
8.6 
8.6 
8.6 
8.5 
8.5 
8.4 
8.3 

82.6 8.3 

306.88.2 

8.1 

8.1 

8.1 

8 

7.9 
7.7 
7.7 
7.6 
7.5 
7.3 
7.1 
7 

6.9 
6.9 

6.9 

67.8 6.8 

6.7 

6.6 

6.4 

6.3 

6.1 

6 

6 

5.9 

5.6 

5.5 

5.4 

5.3 

5.3 

5.1 

5.1 

5.1 

5 

5 

4.9 
4.9 
4.8 
4.8 
4.7 
4.7 
4.6 
4.5 
4.5 
4.4 
4.4 

102.44.3 
4.2 
4.1 
3.9 

56.4 3.8 
3.7 
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322221 N24236 
302925 AL137449 



313690 A1493591 Hs.78146 platelet/endofher»l cell adhesion motec 3179.6 

333585 piedictedexon 175.4 

312911 H93366 Hs.7567 Homo sapiens cDNA:FU21962fis, done H 219 

322966 AA633669 Hs.235920 Homo sapiens cell reoognifion molecule C 350.2 

312492 R71072 Hs.191269 ESTs 322.8 

318988 Z44203 Hs.26418 ESTs 25 

332363 AI123705 Hs.106932 ESTs 773.4 

324181 AI025476 Hs.131628 ESTs 634.8 

311717 AW205369 Hs.312830 ESTs 54.2 

321342 AA1 27984 Hs.222024 transcription factor BMAL2 23.4 

308852 AI629848 Hs.1 62937 peptidylprolylisomeFaseA(cyclophilinA) 92 

331465 AA373210 Hs.43047 Homo sapiens cDNA FU 13585 fe. done PL 494 

320279 AB033062 Hs.134970 DKFZP434N 178 protein 76.2 

Hs. 179662 nudeosome assembly protein l-fike 1 253.2 

Hs.126666 homeo box B4 136.6 

331384 AB041035 Hs.93847 NADPH o;(idase 4 720 

300938 AA514416 Hs.152320 ESTs. Weakly similar to 1605244Aeiythro 27 

312695 AW196663 Hs.200242 ESTs 303.8 

320223 W35132 Hs.267442 ESTs 189 

332743 AW247977 Hs.87595 translocase of inner mitochondrial membr 14.4 

Mitochondrial Acyl-CoA Thioesterase 529.8 

predicted exon 396.2 

predicted exon 91.8 

predicted exon 408.4 

313478 AA643008 Hs.192775 ESTs 413.4 

309899 AW338564 Hs.217493 annexinA2 -30.8 
311735 AW294416 Hs.144687 Homo sapiens cDNA FU1 2981 fis. done NT -62.8 

312953 NM_001992Hs.128087 coagulation fador II (thrombin) recepto -73.6 

313055 AW367295 Hs.241175 ESTs -43.8 
313291 AI267970 Hs.150614 ESTs. Weakly similar to ALU4.HUMAN ALUS 

315059 AW275110 Hs.271106 ESTs -67 

322284 AI792140 Hs.49265 ESTs ^95.2 

322450 AL121278 Hs.25144 ESTs -1.6 

324803 AW975183 Hs.292663 ESTs 4.4 

331495 AW970939 Hs.291039 ESTs -282.8 

predided exon -152.6 

predided exon -23.2 

predided exon -331.2 

neurotrophic tyrosine kinase, receptor. 591.2 
Homo sapiens mRNA; cDNA DKFZp564B1162 (f 

135 

predicted exon 727.4 
315343 BE144306 Hs.l79891 ESTs. Weakly similar to P4HA„HUIVIAN PROLY 



331039 AW378585 Hs.18625 
333123 
328455 
334458 



333610 
335093 
339403 

302820 X04588 Hs.85844 

302270 R56151 Hs.93589 

323755 AW300094 Hs.136252 ESTs 

326946 



311168 AK001270 HS.1960B6 hypothetical protein FU 10408 



329732 

321415 BE621807 Hs.3337 
333121 
333120 

330392 AW797956 Hs75748 

314711 AA769365 Hs.126058 ESTs 

330865 BE409857 Hs.69499 hypothetical protein 



predided exon 

transmembrane 4 superfamity member 1 
predicted exon 
predided exon 

proteasome (prosome, macropain) siibunit, 



333169 
335095 
335815 
330232 



predided exon 
predided exon 
predided exon 
predided exon 



304 
109.2 
414.8 
87.8 
379.8 
589.2 
-87 
347.4 
-1182 
106.4 
-156 
102.6 



330823 AA031565 Hs.221255 ESTs, Moderately similar to ALU5_HUMAN A -62 

331704 F04225 Hs.66032 ESTs -14.6 

302642 NM_016428Hs.1307l9 NESH protein 267.6 

304484 AA432067 Hs.258373 ESTs 85 

Hs.144840 homotog of mouse C2PA -70 

Hs.134084 ESTs -195.4 

Hs73742 ribosomalprotein,large,PO -33.4 

Hs.109221 ESTs -392 

predided exon -5.6 

Hs.154140 Homo sapiens ovary-spedftcaddicprote -880.6 

predided exon 55.2 

316342 AA743935 Hs.202329 ESTs 43.4 

331263 AW780192 Hs.267596 ESTs -180.4 

335987 predicted exon -134 

311923 T60843 Hs.189679 ESTs 12.2 

310522 AW134529 Hs.244647 ESTs -187^ 

315363 AA759190 Hs.121454 ESTs. Weakly similar to olfadory recept 80 

302032 NiVL001992Hs.128087 coagulation factor II (thrombin) recepto -877 

313140 BE265133 Hs.217493 annexinA2 95.4 

310860 AW015920 Hs.161359 ESTs -239 
317899 AI952430 Hs.150614 ESTs. Weakly similar to ALU4.HUMAN ALUS 



310230 AK000377 
301531 A1077462 
306337 AA954221 
331327 N46436 
332961 

322796 W31178 
328857 



3.6 
3.5 
3.5 
3.4 
3 

2.5 

2.5 

2.4 

2.4 

2.3 

2.3 

2.3 

2.2 

2,1 

2.1 

1.8 

1.8 

1.6 

1.5 

1.4 

1.4 

1.4 

1.3 

1.3 

1.1 

1 

1 

1 

1 

-63 1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

276.61 

0.9 

0.9 

122.80.9 

0.9 

0.9 

0.7 

0.7 

0.7 

0.7 

0.6 

0.6 

0.6 

0.6 

0.6 

0.6 

0.5 

0.5 

0.5 

0.5 

0.4 

0.4 

0.4 

0.4 

0.4 

0.3 

0.3 

0.3 

0.3 

03 

0.3 

0.3 

0.3 

03 

0.3 

0.3 

-715.2 



0.3 
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328520 predicted exon -109.2 02 

302406 NM.012099HS511956 CD3^psaon-associated protein; antisens 10 0.2 

31 1fi04 AI886921 Hs.203349 Homo sapiens cDNA FU12149 fe. clone MA -252.6 0.2 

315065 AK001122 Hs.105859 hypothetical protein FU10260 ^6.2 02 

5 314129 AA228366 Hs.1 15122 ESTs -308.8 0.2 

335697 predicted exon -47^ 0.2 

335989 predicted exon 89 0.2 
320606 AW867943 Hs.127216 hypothetical protein FU13465 -205.6 0.2 
329745 predicted exon 103 0.2 

10 313628 AW419069 Hs.209670 ESTs -177^ 0.2 

334616 predicted exon -936.6 0.2 

308820 AI821267 Hs.207243 EST -72 0.2 

320416 AI026984 Hs.293662 ESTs -18.4 0.2 

335211 predicted exon -142 0.2 

15 323629 AA375957 Hs.6682 ESTs -100 01 

331420 AW452904 gb:UI-H.BI3^lyh-11-0-Ul.s1 NCI CGAP Su 83 0.1 

315984 AI015862 Hs.131793 ESTs ~ -250.6 0,1 

332833 predicted exon -374.2 01 

332607 NM.002314HS.36566 UM domain kinase 1 -27.6 o"l 

20 313467 AA0O4879 Hs.187820 ESTs -288 2 o'l 

323333 AV651680 Hs.208558 ESTs -735.6 0*1 

330775 AW247020 Hs.250747 SUM01 activating enzyme sutHJnit 1 53.6 0.1 

333168 predicted exon -1041,8 01 

332079 AI308876 Hs.103649 ESTs 19.4 01 

25 322724 AF161442 Hs.191591 Homo sapiens HSPC324 mRNA, partial cds -123.6 OA 

303652 AI799111 Hs.64341 ESTs -46.4 01 

303131 AW081061 Hs.103180 DC2 protein -156.4 01 

320716 AI479439 Hs.171532 ESTs -146.6 01 

300454 AA659037 Hs.163780 ESTs -304 01 

30 312757 AI285970 Hs.183817 ESTs -445 o'l 

312391 R43707 Hs.133159 ESTs. Weakly similar to PIHUSD salivary -111.8 0.1 

308877 AI832519 gb:at69h03.x1 Barstead cotan HPLRB7 Homo -149.6 0 

311275 AI659166 Hs.207144 ESTs -62.6 0 

302363 AW163799 Hs.198365 2,3-bisphosphogtyceratemutase -15 0 

35 321717 AW956580 Hs.42699 ESTs -1059 6 0 

302638 AA463798 Hs.102696 MCT-1 protein -332.2 0 

306352 AA961357 gb:or52a05.s1 NCLCGAP.GC3 Homo sapiens 21.8 

313798 AI292148 Hs.71622 SWI/SNF related, matrix associated, acti -97.2 0 

320807 AA135370 Hs.188536 Homo sapiens cDNA:FU21 635 fis.ctoneC -2222 0 

40 320931 AW262836 Hs.252844 ESTs ^81 6 0 

332450 AW288085 Hs.1 1156 hypothetical protein 28.4 0 

332535 AF167706 Hs.19280 cysteine-rich motor neuron 1 -722 0 

335990 predicted exon -421 0 
330746 AB033888 Hs.8619 SRY (sex detennining region Y)-box 18 35.4 0 

45 316820 AI627912 Hs.130783 Forssman synthetase -373.6 0 

337429 predicted exon -257 0 

331192 BE622021 Hs.152571 ESTs. Highly similar to IGF-II mRNA-bind -33 0 

330609 AI346201 Hs.76118 ubiquiUn cartioxyl-tenninai esterase LI -280 0 

323593 Ar739435 Hs.39168 ESTs -3627.6 0 

50 302704 AA531133 Hs.4253 hypottietfcal prot^ MGC2574 -278.6 0 

330534 Nn^004579Hs.82979 mitogen-adivating protein kinase kinase -244 0 

332374 X91195 Hs.100623 phosphol9)ase C. beta 3. neighbor pseudo -1204.2 0 

333221 predicted exon -189.6 0 

335988 predicted exon -122.6 0 

55 330574 AI984144 Hs.66713 hepatitis delta anfigen-interacting prot -2257.4 0 

312052 BE621697 Hs.14317 nucleolar protein family A, member 3 (H/ -359.2 0 

319568 AF131781 Hs.84753 hypothetical protein FU 12442 * -874.6 0 

337113 predicted exon -24.6 0 

335149 predk:tedexon -191.8 0 
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TABLE 6 A 



Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. The pkeys in Table 7 lacking unlgenelD's are repiesented within 
Tables 1-6A. For each probeset we have listed the gene duster number from which the oligonudeofides were designed. Gene dusteis were oompfled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist. Oakland California). The Genbank aooesskxi numbers for sequences comprising each duster are Osted In the *Acoessk)n' 
column. 



Pkey: Unique Eos probeset Mentifier number 

CAT number Gene dusternumber 
Aooesskm: Genbank accession numbers 



Pkey CAT Number Accession 

320925 1525201J D62892 D79755 D62760 
321614 87866J H86161 AA054308 AA018955 
313952 136885_1 F20956AA129374 AA133740AW819878 
314648 293660_1 AW979268 AA878419 AA431342 AA431628 
302749 458.107 I\fl16951 M16952 M16948 M16949 M16950 
312362 764066 1 AW015994 R39898 AW000978 AI598202 AI521706 
312542 1522649 1 D60076 D60259 D61037 
312642 1005225 1 AW052128 H51439 hl51481 

312986 171879J AA211586 F35799AA211641 F29720AW937387 AW937408 

329350 c_x_hs 
329414 c_^.hs 
329440 c_y_hs 
329451 c_y.Jis 

338033 CH22 6528FG_UNK^EI\fl:AC00 
338038 CH2^.6535F6__UNK.EM:AC00 
338116 CH22J650FG_LINieEM:AC00 
338158 CH25L6700FG_LINK-EIWAC00 
329732 c14j)2 
329745 c14j)2 
308106 AI476803 
329863 c14_p2 

338316 CH22^6944FG_LINK_EM:AC0O 
308248 A1560919 

338388 CH22«7034FG_UNK^EM:AGOO 
338442 CH22.71O9FG_LINK^EM:AC0O 
338645 CH22 7410FG_LINK^EM:AC00 
338728 CH2a^7527FG_LlNieEM:AC00 
308877 AI832519 

338962 CH22_7838FG_LINfeDJ32l10 
308886 Ai833240 

333120 CH22_349FG_81.3.UNK^EMA 

333121 CH22„350FG_81.4.UNK.EM:A 

333122 CH22_351FG 81.6 LINK_EMA 

333123 CH22„352FG.81_7_LINK-EM:A 

333168 CH22^400FG_94_1.UNK.El\4:A 

333169 CH22 401FG.94.2.LINK_EM:A 
333221 CH2a_458FGJ05J_LINK-EM: 
326077 c17_hs 

326080 c17Jis 
326169 c17_hs 
326198 c17^hs 
326230 c17_hs 

333585 CH22_846FG.203.4_LINK.EM: 
333610 CH2^.871FG.217_5_UNi^EM: 
335093 CH2^2423FG^49?_3.LINieEIW 
335095 CH2^.2425FG.492.5.UNK.EI\fl 
335149 CH22L2484FG_499.5_UNK-EM 
326759 C20_hs 

333977 CH22_1254F6_309.6.LINK_EM 
32678B .c20.hs 

335211 CH22«2550FG.511_2_UNK-EIVI 

305192 AAG66019 

303973 AW512014 

303992 AW515800 

326946 C21_hs 

328229 CL6.hs 

328262 C.6 Jis 
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328418 c.7_hs 

328455 c.7^hs 

335697 CH22.305BFG_596J2_LINK-E 

328520 c.7.hs 

5 328548 c.7.hs 

335815 CH22_3187FG.618.3_LINieEM 

328688 c_7_hs 

328695 c_7_hs 

307010 AI140014 

10 337113 CH23-5058FG.493J. 

307041 AI144243 

328700 c.7_hs 

335946 CH22_3324FG.646^20_LINK D 

335986 CH22^3366FG_654_10 LINK_D 

15 335987 CH22_3367FG_654J1_UNK^D 

335988 CH22^3368FG 654 12.UNK.D 

335989 CH2^3369FG_655.2_UNK«DJ 

335990 CH2^^3370FG_655_4_LINK.DJ 
337214 CH22^5288FG_613_7. 

20 330020 c16j)2 

305989 AA888220 

328857 c_7^hs 

328937 c_8_hs 

328957 c^^hs 

25 330187 c_4j32 

337407 CH22„5607FG_755 1_ 

337429 CH2^5633FG_762.3_ 

330232 CL5_p2 

307414 AI242106 

30 330305 c.7j)2 

330306 (L7_p2 

337603 CH2:L5896FG_LINK_C20H12. 

337953 CH22^6395FG_LINK-EMAC00 

339236 CH22_8181FG_LINK.BA354I1 

35 339403 CH22_8384FG_UNK-BA232E1 

309349 AW051913 

325222 c10 hs 

325251 dO^hs 

318188 956161.1 AI792566 AI053836AI054127 AI792489 AI288324 

40 309871 AW300366 

325544 c12_hs 

309931 AW341683 

332833 CH22^50FG_17_7.UNK_C20H1 

302779 33837_1 AJ235667 AJ235666 AJ235664 AJ235665 AJ235668 AJ235669 AJ235670 

45 302790 34168J AJ245245 AJ245247 AJ245257 AJ245248 AJ245254 AJ245256 AJ245253 AJ245203 AJ245250 AJ245252 AJ245243 AJ245204 
AJ245201 AJ245206 AJ245246 AJ245255 AJ245205 AJ245202 AJ245251 AJ245249 AJ245207 AJ245244 

332961 CH22_185FG_48J8_LINK.EM: 

325753 c14 hs 

327036 c21_hs 

50 325843 c16_hs 

325889 c16_hs 

304261 AA059387 

304275 AA070605 

334376 CH22_1670FG_379 8 LINK.EM 

55 327220 c.l^hs 

304363 AA206045 

334458 CH22_1757FG_391_2_LINK^EM 

327365 c_1^hs 

327373 c_2_hs 

60 334616 CH22_1923FG_411J5 LINK E 

327414 c_2.hs 

327568 0_hs 

336034 CH22_3419FG^678_5 LINK.DJ 

336059 CH22_3445FG.684_2_L!NK_DJ 

65 334834 CH22.2148FG^439_3_LINleE^4 

304782 AA582081 

304876 AA595765 

327747 c_5_hs 

336228 CH22_3626FG.730_4 LINK^DA 

70 329073 c^hs 

329088 cjUns 

304969 AA614406 

327844 c.5_hs 

„^ 327876 c_6Jis 

75 306352 AA981387 

331131 genbank.R54797 R54797 
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331139 g«ibank^R65706 R65706 

331420 675983 1 AW452904 AW449414 BE467906 AI298565 BE549932 BE326357 F04362 
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TABLE 6B 

Table 6B shows the genomn: positioning for those pkeys lacking unigene ID'S and accession numtters in Table 6. The pkeys in Tat}ie 7 lacking 
5 unigenelD*s are represented within Tables 1-6B. For each predicted exon. we have listed the genomic sequence source used for prediction. Nucleotide 
locations of each predicted exon are also listed. 



10 



15 



Pkey: 
Ref: 



Strand: 
NLposltion: 



Unique number conesponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifter (Gi) numbers. "Dunham I. et al." refers to the pubRcatton 
entitled The DMA 

sequence of human chromosome 22." Dunham I. et al. Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Pkey Ref 
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332961 

333221 

333585 

333610 

334376 

334458 

334616 

335149 

335211 

335697 

335986 

335987 

335988 

335989 

335990 

336034 

337953 

338033 

338038 

338316 

338442 

338962 

332633 

333120 

333121 

333122 

333123 

333168 

333169 

333977 

334834 

335093 

335095 

335815 

335946 

336059 

336228 

337113 

337214 

337407 

337429 

337603 

338116 

338158 

338388 

338645 

338728 

339236 

339403 

325222 

325251 

325544 

325753 

329745 

329732 

329863 

325889 



Dunham. 1. elal. 
Dunham. 1. elal. 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham, I. elal. 
Dunham, I. elal. 
Dunham, i. elal. 
Dunham, I. elal. 
Dunham, I. elal. 
Dunham, I. elal. 
Dunham. I. elal. 
Dunham. I. elal 
Dunham, 1. elal 
Dunham, 1. elal. 
Dunham, I. elal 
Dunham. I. elal 
Dunham, I elal. 
Dunham, 1. elal. 
Dunham. I. elal. 
Dunham. I elal. 
Dunham. I elal 
Dunham. I. elal 
Dunham, I. elal 
Dunham, I. elal 
Dunham, I. elal. 
Dunham, I elal. 
Dunham. I. elal. 
Dunham. 1. elal 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham. I. elal. 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham, I elal 
Dunham. I. elal 
Dunham, I elal 
Dunham. I elal 
Dunham. I elal 
Dunham. I elal. 
Dunham. I elal 
Dunham. I elal 
Dunham, I elal. 
Dunham, I elal 
Dunham, I elal 
Dunham. I. elal 
Dunham, I. elal 
Dunham. I. elal. 
Dunham. I elal 
Dunham. I. etal. 
6525287 
6682448 
6682452 
6682474 
6065779 
6065783 
6691797 
5867087 



Strand 


Nt_position 


Plus 


2521424-2521555 


Plus 


3978070-3978187 


Plus 


6234778-6234894 


Plus 


6547007-6547116 


Plus 


13902218-13902331 


Plus 


14353496-14353572 


Plus 


15176123-15176470 


Plus 


21497441-21497587 


Plus 


0\A^^A^AA 

21774611-21774680 


Plus 


25481456-25481 649 


Plus 


27967791-27967852 


Phjs 


27971413-27971481 


Plus 


2797791 2-2797801 3 


Plus 


27983788-27983860 


Pile 


27988532-27988608 


Plus 


29014404-29014590 


Plus 


6827029-6827125 


Plus 


8092128-8092271 


Plus 


8138219-8138392 


Plus 


17089711-17089988 


Plus 


19980640-19980698 


Plus 


2958 1 892-29582020 


Minus 


1119848-1119705 


Minus 


3307508-3307427 


Minus 


3308446-3308358 


Minus 


3309596-3309531 


Minus 


3310817-3310749 


Mofius 


3729886-3729788 


Minus 


3730864-3730767 


Minus 


8722928-8722725 


Minus 


17182681-17182535 


Minus 




Minus 


21292546-21292381 


Minus 


26320518-26320421 


Minus 


27487203-27487035 


Minus 


29184079-29183969 


Minus 


30904602-30904497 


Minus 


21233344-21233237 


Minus 


26095902-26095502 


Minus 


31886652-31886567 


Minus 


32086238-32086079 


Minus 


1299296-1299194 


Minus 


10614071-10613814 


Minus 


11794465-11794343 


Minus 


18662403-18662305 


Minus 


24063839-24053775 


I\4inus 


25949039-25948927 


Minus 


32773355-32773202 


Minus 


34050728-34050625 


Minus 


22332-22473 


Minus 


411693-411751 


Phis 


171228-171286 


Plus 


398512-398621 


Plus 


174774-175142 


Plus 


161252-161322 


Plus 


196801-19G971 


Phis 


223829-223891 



175 



3NSDOCID: <WO O2079492A2 I > 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



325843 

330020 

326198 

326230 

326169 

326077 

326080 

326759 

326788 

326946 

327036 

327220 

327365 

327414 

327373 

327568 

330187 

327747 

327844 

330232 

328229 

327876 

328262 

328688 

328700 

328695 

328418 

328455 

328520 

328548 

328857 

330305 

330306 

328937 

328957 

329073 

329088 

329350 

329414 

329440 

329451 



6552453 
6671887 
5867215 
5867230 
5867255 
6682495 



6249610 
6682503 
6004446 
6531965 
5867525 
6552412 
5867750 
5867792 
5867811 
6706138 
5867947 
6249582 
6013526 
5868105 
5868140 
6381906 
5868262 
5866264 
5868264 
5868409 
5868431 
5868477 
5868487 
6381927 
4877982 
4877982 
5868500 
6456773 



5868608 
6456785 
5868874 
5866885 
5868887 



Minus 

Plus 

Minus 

Minus 

Minus 

Minus 

Ptus 

Plus 

Plus 

Minus 

Plus 

Minus 

Minus 

Plus 

Minus 

Minus 

Plus 

Plus 

Minus 

Plus 

Minus 

Plus 

Plus 

Plus 

Plus 

Plus 

Minus 

Plus 

Plus 

Plus 

Minus 

Minus 

Plus 

Minus 

Plus 

Plus 

Plus 

Plus 

Plus 

Plus 

Phis' 



7126-7232 

172397-172491 

80295^0674 

301868-301972 

128321-128388 

312108^12168 

478644-478847 

97216^7311 

277132-277335 

116677-116967 

319951-320040 

65701-65781 

118133-118198 

102461-102586 

8186-8742 

46152-46287 

212923-213020 

115322-115498 

18895-18958 

113655-113830 

120935-121053 

103882-104034 

11867-12027 

626030-626094 

764089-764203 

318632-318695 

258811-258894 

385576^85633 

1942075-1942246 

72301-72397 

80557-81051 

52269^2365 

98161-96233 

1448241-1448333 

219195-219297 

37838^7956 

116738-116950 

98911-98969 

942555-942643 

21943-22063 

25974-26048 
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TABLE 7: 



Table 7 defMds Seq 10 No.. UnigenelD. UnigeneTitle. Pkey. and ExAccn for all of the sequences in Table 8. Seq I 
sequence infonnation inTabte 8 to Table 7. 



I No links the nudeu: add and protein 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Pkey: 
ExAocn: 
UnigenelD: 
Unigene Title: 
Seq.ID.No.: 



Unique Eos probeset identifier nunober 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Sequence Identirication Number found in Table 8 



Unigene Tiltle 



PKey 


^ V A /^#^n 
CAMwVII 


unigcfic lu 


101545 




He 1(vd9in 


115819 




no.H 1 I03 


424503 


MM nn?9n*; 


no. l*r9UU9 


102917 


AiniR715 


Ue 9ft77Q7 




AUf OZU 








Ue 09i0n 


iU/oOO 








AAoUlobr 




A MM A 




Ue 'TCOOil 






Ue O'i'lRTQ 

ris^iio/9 


lUOUJD 




Ue» o^ieo 


iU*tODD 








UQooee 


Ue 7i;fS7 






ns./ ^no 


'iCiAJPA 




Lie 97A(;0i; 


1 oozuu 


MDUo/ f 10 








Uc PRfi9 

nS.OOoZ 


1 n9RQ9 




Ue RT19R 

ns.ooozo 




AXAjQCCRQn 

MWyOODOU 


Ue ^OAQQ 


1 monR 


A AMtJOII 


Ue 17^0/1 


1 10071 




Uo 97Qj1'77 


1 "konfin 

I O^UDU 


AI^Df 010 




1 10/ion 


KIM nn^OQn 


u«> vtofln 




MVV iDlOO^ 


Uc> D'S'aQ'l 

nS.Ooool 




lvlDOO/4 


l_J_ O^l 'tCDT 








1/V*7QQ 

lUOf 9<3 


UQAQ07 


nS.iD4oU 


1 lOO 1 1 




Ue A'X'im 


iniAd7 


fVI£ IOU3 






I99UZ 


Ue iinRcn 




MDUlO&!l9 


Ue ilfil^ 

r15.1lOl4U 


0 1 J9 10 


AVVZ90QuU 


Ue ^A^iiAn 

ns.i4iow 


«3 1 J39D 


MMO^Oo 1 / 




O 1 JD09 


MVV/9l£UI 


Ue CI 999 

nS^l£<M 




A| rMni7fl 

MLUmU I/O 


nS.14^UUo 


A0Q07R. 


MrUODUOP 


lis. 1 9001 z 


lU lo'tO 




nS.lo^ I/O 


418994 


AA29B520 

r\#^*.W ww^W 


Ue fiQMR 


103850 


AA187101 


Hs.213194 


133260 


AA403045 


Hs.6906 


101097 


BE245301 


Hs.89414 


104786 


M027167 


Hs,10031 


132173 


X89426 


Hs.41716 


100420 


D86983 


Hs.1 18893 


111018 


AI287912 


Hs.3628 


108507 


AI554545 


Hs.68301 


104894 


AF065214 


Hs.18858 


118511 


N75620 


Hs.43157 


125509 


AA868063 


Hs.104576 


101543 


M31166 


Hs.2050 


102241 


NM.007351 


Hs^68107 


101560 


AW958272 


Hs.347326 


103280 


U84722 


Hs.76206 


105826 


AA478756 


Hs.194477 


102804 


NM-002318 


Hs.83354 


131647 


AA359615 


Hs.30089 


103095 


NM_005424 


Hs.78824 


103037 


BE018302 


Hs.2894 


100405 


AW291587 


Hs.82733 


102012 


BE259035 


Hs.118400 



endothelial difTerentiation. sptiingolipl 
endomucin-2 

integiin, alplia 5 (fibronectin receptor, 
integrin. beta 1 (fibronectin receptor, 
matrix metalloproteinase 10 (stromelysin 
ESTs 

podocalyxin-fike 

EGFHXmtalning fibulin-like extracellula 
EGF-containing fibulln-IIke extracellula 
melanoma cell adhesion molecule 
matrix metalloproteinase 1 (Interstitial 
Bk»I1 CLUlymphoma 6, member B (zinc fi 
Homo sapiens cDNA: FIJ21962 fis, done H 
hypothetical protein FLJ20992 similar to 
ESTs 

hypothetical protein F1J10210 

solute carrier family 7. (cationic amino 

matrix metaUoproteinase 3 (stromelysin 

ESTs 

ESTs 

ESTs 

ESTs 

LIM domain binding 2 

guanine nucleotide binding protein 11 

phosphoiipase A2, group IVA (cytosolic, 

tissue factor pathway inhibitor 2 

ESTs 

ESTs 

gb:Human alpha sateDIte and satellite 3 
ESTs 

K1AA0716 gene product 

ESTs. Weakly similar to S59501 Interfero 

ESTs 

ESTs 

ESTs 

G protein-coupled receptor 51 

calcitonin receptor-like 

selectin E (endothelial adhesion molecul 

hypothetical protein MGC10895 

Homo sapiens cDNA: FLJ23197 fis. done R 

chemokine (C-X-C motif), receptor 4 (fus 

KIAA0955 protein 

endotf^elal celV-spedfic molecule 1 

Melanoma assodated gene 

mitogennactlvated protein kinase kinase 

ESTs 

phosphoiipase A2, group IVC (cytosolic 
ESTs 

carbohydrate (keratan sulfate Gal-6) sul 
pentaxin-retated gene, rapitit^ induced b 
multimerin 

intercellular adhesion molecule 2 

cadherin 5. type 2, VE-cadherin (vascula 

E3ubkiuitiniigaseSMURF2 

lysyloxidase-like2 

ESTs 

tyrosine kinase with bnmunoglobulin and 
placental growth factor, vascular endoth 
nidogen 2 

singed (Drosophita}-like (sea urchki fas 



SEQ ID NO 

Seq ID1&2 
Seq ID3&4 
Seq ID 5 & 6 
Seq ID7&8 
Seq ID 9 & 10 
Seq ID 11 & 12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17&18 
Seq ID 19 & 20 
Seq ID 21 &22 
Seq iD23&24 
SeqlD25&26 
Seq ID27&28 
Seq ID29&30 
Seq ID 31&32 
Seq ID33&34 
Seq ID 35 & 36 
Seq ID37&38 
Seq ID 39 & 40 
Seq ID 41 & 42 
Seq ID 43&44 
Seq ID 45&46 
Seq ID 47 & 48 
Seq ID 49 & 50 
Seq ID 51&52 
Seq ID 53 & 54 
Seq ID 54 & 55 
Seq ID58&57 
Seq tD58&59 
Seq ID 60 & 61 
Seq ID 62 & 63 
Seq 1064 &65 
SeqI066&67 
Seql068&^9 
Seq ID70&71 
SeqlD72&73 
Seq 1074 &75 
SeqlD76&77 
Seq ID 78 & 79 
Seq ID 80 & 81 
Seq ID82&83 
Seq ID84&85 
Seq 1D86&87 
SeqD88&89 
Seq ID90&91 
Seq ID 92&93 
Seq ID 94 895 
Seq ID 96&97 
Seq ID 98 & 99 
Seq ID 100 & 101 
Seq ID 102 & 103 
Seq ID 104 & 105 
Seq ID 106 & 107 
Seq ID 108 & 109 
Seq ID 110 & 111 
Seq 10 112 & 113 
Seq 10 114 & lis 
Seq 10 116 & 117 
Seq to 118 & 119 
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101261 


D30857 


Hs.82353 


protein C receptorf endothfi&ai (EPCR) 


SeqID120&121 


105729 


H46612 


Hs^93815 


Homo saotens HSPC285 iriRNA. oartial cds 


SeqID122&123 


in79ifi 

Ivr £ lO 


u9 IU09 


H&21157d 


melanofna cell adh^ion mntenilft 


Sea ID 124 & 125 


131080 


NM 001955 


Hs.2271 


endothelin 1 


Seq ID 126 & 127 


1114Rfi 




Ms97372 


BMX non-fecenhv tvindne kinase 


Seq ID 128 & 129 




AW580939 


Hs.97199 


comDlement fscunDonfint Clo reoBDtor 


Seq ID 130 & 131 




028235 


Hs.1 98384 


Drostaalandin-endoneiDxide synthase 2 (0 


SeqlD132&133 


115827 


AA428000 


H5^83072 


acHn related Dratein 2/3 oomolex. sutiun 


Seq ID 134&135 


133614 


MM 003003 


Hs.75232 


SEC14fS cerevisiae)-Gke 1 


Seq ID 136 & 137 




AI346201 


Hs76118 


ubkiiiHin carfooxvUtBimtnal estefase LI 


Seq ID 138 & 139 


132546 


M24283 


Hs 168383 


intercenular adhesion moleoilo 1 fCD54) 


Seq ID 140 & 141 


lOOD/O 


AIAJ0479C0 


NA 


iiuucuoiuc pi ii/op4 im jiodc 


Sea ID 142 & 143 


1*^01 Ad 


nsojuo 


Hs 15165 


rpHnntr siHH inriiirpd 14 


Seq ID 144 & 145 


1 0*T f GO 


T9Qfi1ft 

1 Z90 10 


Hs 89640 


TPI^ hrrneinp kinaep pnrinlhpKal fifpnnil<t 


Sea ID 146 & 147 


liC90r 1 


ynAA9fl 

AUwOZO 


Hs 110809 


vnn WIllphranH fstdnr 


Seq ID 148 & 149 




AAnAA9dft 
MMUCrttHO 


Ue A5Q3Q 
no.09999 




Sea ID ISO & 151 






Hs 188746 


ESTs 


Seq ID 152 & 153 


'^191 7^ 


AIA91dnQ 


Hs 304471 


EST 


Seq ID 154 8i 155 


O 1 9 r 99 


MDUJf OZ 1 


Hs 146858 




Seq ID 156 8i 157 


'^13Q7ft 


Alfi7ni7*i 


Hs 13957 


ESTs 


Seq ID 158 & 159 




A 1077477 

Mil// i HI t 


Hs 307912 


ESTs 


Seq ID 160 & 161 






Hs 148939 




Sea ID 162 & 163 


310272 


AF2163a9 


Hs 148932 


SGma domain, transmsmbrane domain (TM), 


Seq ID 164 & 165 


315044 


RP^7fi74 


Hs 204169 


ESTs Wp;)klv c;ifnil;)r to S65657 atnha-l C- 


Seq ID 166 & 167 


^91 -191; 


ADftrJinn 


Hs 300646 


KIAA1974 nmtpin feimikirto mouse natad 


Sea ID 168 & 169 




ABfi'^'^inn 


Hs 30D646 


KIAA1974 nmtpin fsimikirio mouse naiad 


Sea ID 170 & 171 




AF94flfi'^S 
r\r£*fuo»3»? 


Hs 115897 


nmtopfiHhprin 19 


Seq ID 172 & 173 




Ai 100719 
ML1U9/ lit 




noiiiD sapiens ii1imw\ luii icnyui uiaeii Mun 


Sea ID 174 & 175 




P11An9 


He RA1A 
nb.DO 10 




Sen ID 176 & 177 


^inA^9 




Me 9nAA70 


CO IS 


Sea ID 178 & 179 

OC^ lU 1 f O 1* 1*9 


OU(/409 






hunnfhofir«ol nmtpin H 190401 


Sea ID IfiO £ 181 




IAfA7A7A 
WOf Of •! 


He 9i;977 


%At\mf\ eanlone /4^K1A PI 110717 fie* Hnnp MT 


Sea ID 189 & 183 


00U9DO 


r<*RDDf 


He 9^74A 


CO IS 


Sea ID 184 & 185 

OCIf Iv lot Oi lOU 






He 971Qflft 


infpnrin oinha 9 lf^T\A(itX ainhs 9 Cllhtint 
ifitcy lui, aipna ^ \wUH9D, atpna c ouuuiii 


Sea ID 186 & 187 


499C7'a 


AW9Q7QA'i 

nVV£9f 90w 


He 205726 


intpfirin ainha V /i/itmnprfin rpcpnfor 


Seq ID 188 & 189 


lOOUDl 




He 9QAfv9ft 


piosiaie uineicnuduoi 1 lavtui 


Sea ID 190 & 191 




AW9Qfl944 


Hs 966195 
no.zoo 1 99 


CO 1 o 


Sea ID 192 & 193 




DCZ*t/ £99 


He 78452 
no. f o*t9^ 


entiitp /^arripr fAmlh/ 90 /nhnenlutp tran 


Seq ID 194 & 195 


1 10 199 




He 8881 


PSTq Wp;iklv similar fn S41044 chromosom 


Seq ID 196 & 197 


lU 1 f **1 


MM 00*^100 


He 396198 
no.o^u 1 90 


Uallounpuuil laolUl *r 


Sea ID 198 & 199 


'^91011 

OZ 1 9 1 1 


Apftocq^A 
r>rUtD9*»*t 


Hs 9937Q7 
no.£9<jf 9/ 


CO 1 9 


Seq ID 200 & 201 


ooneoc 


tMOUDu 


He AOfiOR 


oiriaii nucieai nDunucicupiUisui (juiypcpi 


Sea ID 202 & 203 








MM 017ft^^*HrkmA canione hunnthoiirat nml 


Sea ID 204 & 205 


1^9QRA 
IJZ9OO 




He 61R3A 


iiiyuoin A 


Sea ID 206 & 207 


1 ^i;n7i 

lODU/ J 


VAfCCQCC 


He 040^)0 


Hnmn cantene mRMA- pTIMA nKP7n5fi6F1R94 ff 


Sea ID 208 & 209 


1U090/ 


ALUOUlUf 




[ronscnpuonai oc/*aCuvaiur wiui rutMii 


Sea ID 210 A 211 


116430 


AK001531 


Hs.66048 


hypothetical protein FU10669 


Seq ID 212 & 213 


104877 


AI138635 


Hs.22988 


Homo sapiens clone IMAGE:451939. mRNA se 


Seq ID 214 & 215 


122697 


AA420683 


Hs.98321 


hypothetical protein FU14103 


Seq ID 216 & 217 


112522 


R68857 


88.265499 


ESTs 


Seq ID 218 & 219 


304782 


AA582081 




gb:nn32h08.s1 NCLCGAP.GasI Homo sapiens 


SeqtD220 &221 


312802 


AA644669 


Hs.193042 


ESTs 


Seq ID 222 8i 223 


302680 


AW192334 


H5.38218 


ESTs 


SeqlD224 &225 


326198 






Phase 2 & 3 Exons 


Seqro226 &227 


331019 


NM.006033 


Hs.65370 


lipase; endothefial 


SeqlD228 &229 
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TABLES 



Seq ID NO: 1 DNA seauenee 
Nucleic Acid Accession #: 
Coding sequence: 244- 



NM_001400 

2208 (\mderlined sequences correspond to start and stop codons) ) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



GTCGGGGGCA 
CTTCGCCCTG 
CACAAAAAGC 
CGCCCTCTAG 
AC CATGG GGC 
GTCAACTATG 
GACAAGGAGA 
ATCCTG6AGA 
ATGTACTATT 
GCTAACCTGC 
CGGGAA6GGA 
ATTGAGCGCT 
CTCTTCCTGC 
ATGGGCTGGA 
TU^GCACTATA 
CTGTACTGCA 
AACATTTCCA 
ATCGTCCTGA 
GTGGGCTGCA 
GCTGTGCTCA 
CXSGGCCTTCA 
TTCAAGCGAC 
CACCCCCAGA 
TCTTCTTCCT 
CCACCCCAGT 
CAAGCCAGAG 
TAGAGTTAGT 
TATATATTCT 
AGCTCCTAAA 
TCTTTGTCTG 
GTGTGCACTT 
TTCATACCCC 
CTGGGGTTGT 
TGGGAAGATG 
CATGTAAGCG 
CATCTTTTCA 
AAGCCCACTT 
AGCAAAACAA 
AAATGAGTCT 
TCTTGTGTGA 
CTTGATTTTT 
GTTAACTTTT 
CGCCAGAACT 
ACAAAGAATA 
AGATGTCTTQ 
TTTGCACATA 



11 
I 

GCAGCAAGAT 
CTTGAGCGAG 
CTGGATCACT 
CGTTCGTCTG 
CCACCAGCGT 
ATATCATCGT 
ACAGCATTAA 
ACATCTTTGT 
TTATTGGCAA 
TCTTGTCTGG 
GTATGTTTGT 
ATATCACAAT 
TAATCAGCGC 
ACTGCATCAG 
TCCTCTTCTG 
GAATCTACTC 
AGGCCAGCCG 
GCGTCTTCAT 
AGGTGAAGAC 
ACTCCGGCAC 
TCCGGATCAT 
CCATCATCGC 
AAGACGAAGG 
AGAACTGGAA 
GTTT6GAAAA 
GGAGGAAG6G 
TCCTGTGAAC 
ACCCCCCTGG 
GGGTTCATTT 
GAGCTTTGA6 
CTGCTTCTTT 
TCCTCAACGT 
GGAATGATCG 
AAGATGGTTT 
GGATCCGTTT 
ATGAAATGTG 
TATCTAAATG 
AGTGAAAACC 
AACAAATATG 
TTCATTTCAA 
GAATGTATTT 
CTAGAATCCA 
TTTAAGTCCA 
AAAATATATT 
TTTTTTTAAA 
GCTTTATCAA 



21 

I 

GCGAAGCGAG 
GCTGCGGTTT 
CATCGAACCA 
GAGTAGCGCC 
CCCGCTGGTC 
CC6GCATTAC 
ACTGACCTCG 
CTTGCTGACC 
TCTGGCCCTC 
6GCCACCACC 
GGCCCTGTCA 
6CTGAAAATG 
CT6CTGGGTC 
TGCGCTGTCC 
CACCACGGTC 
CTTGGTCAGG 
CAGCTCTGAG 
CGCCTGCTGG 
CTGTGACATC 
CAACCCCATC 
GTCCTGCTGC 
CGGCATGGAA 
GGACAACCCA 
GCTGTCCACC 
AAATCTCTGG 
G6AGAATAC6 
AAT6CACTGG 
AGCTTTGATT 
GGCCCCTCCT 
GAGAT6TTTT 
AGGGATGCCC 
TCTTTTACTT 
ATCATCTATA 
GGAGGTGTAA 
TTTGGAATTT 
TTACCATTTC 
ATATTAGCCA 
GAATGGATTA 
ACATCCGTCT 
GCAACAACAT 
GTTTCAGGAA 
CCCTCTTGTG 
GCTATTCATT 
ACTGTCTCTT 
AAGAATAGTA 
CTTTTAAACA 



31 

I 

CCGTACAGAT 
CCGAGGCCCT 
CCCCTGAAGC 
ACCCCGGCTT 
AAGGCCCACC 
AACTACACGG 
GTGGTGTTCA 
ATTTGGAAAA 
TCAGACCTGT 
TACAAGCTCA 
GCCTCCGTGT 
AAACTCCACA 
ATCTCCCTCA 
AGCTGCTCCA 
TTCACTCTGC 
ACTCGGAGCC 
AAGTCGCTGG 
GCACCGCTCT 
CTCTTCAGAG 
ATTTACACTC 
AAGTGCCCGA 
TTCAGCCGCA 
GAGACCATTA 
CACCGGAAGC 
6CTTCGACTG 
AACAGCCTGG 
GAAGGGT6GA 
TTGCACTGAG 
CAAAGACTAA 
CCTTCACTTT 
TGTACATCCC 
TATACTTTAA 
GCAAATAGGC 
AACAATGTCC 
GGTTGAAGTC 
ATATCCATTG 
GGATCCTTGG 
ACTTTTGCAA 
TTCCCACTTT 
GTTGTATTTT 
GAAGTCATTT 
CCCTTAAGCA 
AGATAGTAAT 
TAGTATGGTT 
TTTAATA6GT 
TTAATAAACT 



41 
I 

CCCGGGCTCT 
CTCCAGCCAA 
CAGTGAAGGC 
CCTGGGGACA 
GCAGCTCGGT 
GAAAGCTGAA 
TTCTCATCTG 
CCAAGAAATT 
TGGCAGGAGT 
CTCCCGCCCA 
TCAGTCTCCT 
ACGGGAGCAA 
TCCTGGGTGG 
CCGTGCTGCX: 
TTCTGCTCTC 
GCCGCCTGAC 
CGCTGCTCAA 
TCATCCTGCT 
CGGAGTACTT 
TGACCAACAA 
GCGGAGACTC 
GCAAATCGGA 
TGTCTTCTGG 
GCTCTTTACT 
CTGCCAGGGA 
TGGTGTCGGG 
GATCAGGTCC 
CCAAAGGTCT 
TGTCCCCATG 
AGTTTCAAAC 
ACACCCCACC 
CTACCTGAGA 
TATGTTGAGT 
TTCGCTGAGG 
ACTTTGATTT 
AAGCCGAAAT 
TGTCCTAGGA 
ACCT^GGGAG 
TGTTGATGTT 
GTTGTGTTAA 
TATGGATTTT 
TTACTTTAAC 
TGAA6ATATG 
TTCAGTGCAA 
TTCTGACTTT 
GATTTTTTTA 



51 



CCGAACGCAA 


60 


GGAAAAGCTA 


120 


TCTCTCGCCT 


180 


CAGGGTTGGC 


240 


CTCTGACTAC 


300 


TATCAGCGCG 


360 


CTGCTTTATC 


420 


CCACCGACCC 


480 


AGCCTACACA 


540 


GTGGTTTCTG 


600 


CGCCATCGCC 


660 


TAACTTCCGC 


720 


CCTGCCTATC 


780 


GCTCTACCAC 


840 


CATCGTCATT 


900 


GTTCCGCAAG 


960 


GACCGTAATT 


1020 


CCTGCTGGAT 


1080 


CCTGGTGTTA 


1140 


GGAGATGCGT 


1200 


TGCTGGCAAA 


1260 


CAATTCCTCC 


1320 


AAACGTCAAC 


1380 


TGGTCGCTGG 


1440 


GGAGCTGCTG 


isoo 


TGTTGGTGGG 


1560 


CGGCCTGGAA 


1620 


AGCATTGTCA 


1680 


TGji^AAGCGTC 


1740 


CCAAGT6AGT 


1800 


CTCCCTTCCC 


1860 


GTTATCAGAG 


1920 


ACX3TAGGCTG 


1980 


CCAAAGTTTC 


2040 


CTTTAAAAAA 


2100 


CTGCATAAGG 


2160 


GAAACAGACA 


2220 


ATTTCTTAGC 


2280 


TATTTCAGAA 


2340 


AAGTACTTTT 


2400 


TCTAACCCGT 


2460 


TGGTAGGGAA 


2520 


TATAAATATT 


2580 


TTAAACCGAG 


2640 


TGTG6ATCAT 


2700 


AA6 





60 



65 



70 



75 



Seq ID NO: 2 Protein fie^l^^t> r^nt» * 
Protein Accession #: NP 001391 



MGPTSVPLVK 
LENIFVLLTI 
EGSMFVAIiSA 
GWNCISAIiSS 
ISKASRSSEK 
VUVSGTNPII 
PQKDEGDNPB 



11 
I 

AHRSSVSDYV 
WKTKKFHRPM 
SVFSLIiAIAI 
CSTVliPIiYHK 
SIiAIiliKTVII 
YTIjTNKEMRR 
TIMSSGNVNS 



21 
1 

NYDIIVRHYN 
YYFIGNIiALS 
ERYITMUCMK 
HYILFCTTVF 
VLSVPIACMA 
AFIRIMSCCK 
SS 



31 
I 

YTGKIiNISAD 
DLIAGVAYTA 
LHNGSNNFRL 
TUiLIiSIVIL 
PLFIUjLLDv 
CPSGDSAGKP 



41 
I 

KENSIKLTSV 
NT.LLSGATTY 
FLIilSACWVI 
YCRIYSLVRT 
GCKVKTCDIL 
KRPIIAGMEF 



51 

1 

VFIIjICCFII 
KLTPAQWFIiR 
SLIIiGGLPIM 
RSRRIjTFRKN 
FRAEYFLVIiA 
SRSKSDNSSH 



60 
120 
180 
240 
300 
360 



179 
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Seq ID NO: 3 Nucleotide sequence; 
Nucleic Acid Accession #: NM_016242 

Coding sequence; 79-864 (underlined sequences correspond to start and stop codons) ) 



AAGGCCCTGC 
ACACCATCTA 
ATTTGCAGCA 
ACAACAAAAC 
CCAACAACTG 
ATGTCAACAG 
GTCAGGAAGA 
GCTGTTTCAA 
ACAGAAATAC 
ACCTCAATAC 
GGTGGAAAAA 
GTGGTTATTG 
ATGTGCTGGA 
AAAGAGAGCG 
GCACAAGGAA 
TACGCTTAAT 
AATCCCGACT 



11 
I 

CAGCTTGGGA 
CGGGCACCAT 
GTAACAGCAC 
CATCTATAAC 
GAACAACTCC 
CTACTTTTTT 
ATGACTCCAT 
CATTACAAAG 
CAGGTAGTGT 
CAGTTACAAT 
ATGCAAGCAC 
CTTTGATTGT 
AGGCAGATCC 
TGAAGCTTCT 
AAACCAAGAA 
CTTCAGCTTC 
TCCATACCTG 



21 
I 

GGGAATTGTC 
GGAACTGCTT 

AGGTGTTTTA 
AACACCAAAC 
TAAAGGAACA 
AACAAGTAAA 
CATTTCAAAC 
TTCCAAACCC 
TCTACAACCA 
TCCAGAAAAC 
TTCAGCAACC 
AATAACACTT 
GGGCACACCA 
TACCGTTAAG 
CTGACAGCTT 
TATGCACCAA 
CTGCTGG 



31 
I 

CCTGCXTPGCT 
CAAGTGACCA 
GAGGCAGCTA 
ACAGAATCAT 
ATCACCAATG 
GATGAAGGAT 
GTAACAGTAA 
AAGACTGAAA 
GATGCATCAC 
ACCTCACAGT 
AGCCGGTCTT 
TCAGTATTTG 
GAAAATGGAA 
ACAATTTCTC 
GAGGAATTCT 
GCGTGGAAAA 



41 
I 

TCTGGAGAAA 
TTCTTTTTCT 
ATAATTCACT 
TACA6AAAAA 
AATTACTTAA 
TGAAAGCCAC 
CAAGTGTTAC 
CTCAGAGTTC 
CTTCTAAAAC 
CTCAAGTAAT 
ATTCCAGTAT 
TTCTGGTGGG 
ATGATCAACC 
ATGAGTCTGG 
CTCCACACCT 
6GAGAAAGTC 



51 
I 

GAAGATATTG 
TCTGCCCAGT 
TGTTGTTACT 
T6TT6TCACA 
AATGTCTCTG 
AACCACTGAT 
ACTTCCCAAT 
AATTAAAACA 
TGGTACATTA 
AGACACTGAG 
TATTTTGCCG 
TTTGTACCGA 
TCAGTCTGAT 
TGAGCACTCT 
AGGCAATAAT 
CT6CAGAATC 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NO: 4 Protein sequence s 
Protein Accession #: NP_057326 

1 11-21 31 41 51 

I I I I I I 

MELLQVTILF LLPSICSSNS TGVLEAANNS LWTTTKPSI TTPNTESLQK NWTPTTGTT 60 

PKGTITNELL KMSLMSTATF IiTSKDEGIiKA TTTDVRKNDS IISNVTVTSV TLPNAVSTIjQ 120 

SSKPKTETQS SIKTTEIPGS VLQPDASPSK TGTLTSIPVT IPENTSQSQV IDTEGGKNAS 180 

TSATSRSYSS IILPWIALI VITLSVFVLV GLYRMCWKAD P6TPENGNDQ PQSDKESVKL 240 
liTVKTISHES GEHSAQGKTK N 



Seq ID NO: 5 Nucleotide sequence: 
Nucleic Acid Accession #: NM_002205 

Coding sequence: 24.. 3173 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I 1 I I I I 

CAGGACAGGG AAGAGCGGGC GC TATGG GGA GCCGGACGCC AGAGTCCCCT CTCCAC6CCG 60 

TGCAGCTGCG CTGGGGCCCC CGGCGCCGAC CCCCGCTCGT GCC6CTGCTG TTGCTGCTTCG 120 

TGCCGCCGCC ACCCAGGGTC GGGGGCTTCA ACTTTAGACGC GGAGGCCCCA GCAGTACTTCT 180 

CGGGGCCCCC GGGCTCCTTC TTCGGATTCT CAGTGGAGTT TTACCGGCCG GGAACAGACG 240 

GGGTCAGTGT GCTGGTGGGA GCACCCAAGG CTAATACCAG CCAGCCAGGA GTGCTGCAGG 300 

GTGGTGCTGT CTACCTCTGT CCTTGGGGTG CCAGCCCCAC ACAGTGCACC CCCATTGAAT 360 

TTGACAGCAA AGGCTCTCGG CTCCTGGAGT CCTrCACTGTC CAGCTCAGAG GGAGAGGAGC 420 

CTGTGGAGTA CAAGTCCTTG CAGTGGTTCG GGGCAACAGT TCGAGCCCAT GGCTCCTCCA 480 

TCTTGGCATG CGCTCCACTG TACAGCTGGC GCACAGAGAA GGAGCCACTG AGCGACCCCG 54 0 

TGGGCACCTG CTACCTCTCC ACAGATAACT TCACCCGAAT TCTGGAGTAT GCACCCTGCC 600 

GCTCAGATTT CAGCTGGGCA GCAGGACAGG GTTACTGCCA AGGAGGCTTC AGTGCCGAGT 660 

TCACCAAGAC TGGCCGTGTG GTTTTAGGTG GACCAGGAAG CTATTTCTGG CAAGGCCAGA 720 

TCCTGTCTGC CACTCAGGAG CAGATTGCAG AATCTTATTA CCCCGAGTAC CTGATCAACC 780 

TGGTTCAG<3G GCAGCTGCA6 ACTCGCCAGG CCAGTTCCAT CTATGATGAC AGCTACCTAG 840 

GATA<?rCTGT GGCTGTTGGT GAATTCAGTG GTGATGACAC AGAAGACTTT GTTGCTGGTG 900 

TGCCCAAAGG GAACCTCACT TACG6CTATG TCACCATCCT TAATGGCTCA GACS^TTCGAT 960 

CCCTCTACAA CTTCTCAGGG GAACAGATGG CCTCCTACTT TGGCTATGCA GTGGCCGCCA 1020 

CAGACGTCAA TGGGGACGGG CTGGATGACT TGCTGGTGGG GGCACCCCTG CTCATGGATC 1080 

GGACCCCTGA CGGGCGGCCT CAGGAGGTGG GCAGGGTCTA CGTCTACCTG CAGCACCCAG 1140 

CCGGCATAGA GCCCACGCCC ACCCTTACCC TCACTGGCCA TGATGAGTTT GGCCGATTTG 1200 

GCAGCTCCTT GACCCCCCTG G6GGACCTG6 ACCAGGATGG CTACAATGAT GTGGCCATCG 1260 

GGGCTCCCTT TGGTGGGGAG ACCCAGCAGG GAGTAGTGTT TGTATTTCCT GGGGGCCCAG 1320 



180 



.02079492A2.L> 
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GAGGGCTGGG CTCTAAGCCT TCCCAGGTTC TGCAGCCCCT GTGGGCAGCC AGCCACACCC 1380 

CAGACTTCTT TGGCTCTGCC CTTCGAGGAG GCCGAGACCT GGATGGCAAT GGATATCCTG 1440 

ATCTGATTGT GGGGTCCTTT GGTGTGGACA AGGCTGTGGT ATACAGGGGC CGCCCCATCG 1500 

TGTCCGCTAG TGCCTCCCTC ACCATCTTCC CCGCCATGTT CAACCCAGAG GAGCGGAGCT 1560 

5 GCAGCTTAGA GGGGAACCCT GTGGCCTGCA TCAACCTTAG CTTCTGCCTC AATGCTTCTG 1620 

GAAAACACGT TGCTGACTCC ATTGGTTTCA CAGTGGAACT TCAGCTGGAC TGGCAGAAGC 1680 

AGAAGGGAGG GGTACGGCGG GCACTGTTCC TGGCCTCCAG GCAGGCAACC CTGACCCAGA 1740 

CCCTGCTCAT CCAGAATGGG GCTCGAGAGG ATTGCAGAGA GATGAAGATC TACCTCAGGA 1800 

ACGAGTCAGA ATTTCGAGAC AAACTCTCGC CGATTCACAT CGCTCTCAAC TTCTCCTTGO 1860 

10 ACCCCCAAGC CCCAGTGGAC AGCCACGGCC TCAGGCCAGC CCTACATTAT CAGAGCAAGA 1920 

GCCGGATAGA GGACAAGGCT CAGATCTTGC TGGACTGTGG AGAAGACAAC ATCTGT6TGC 1980 

CTGACCTGCA GCTGGAAGTG TTTGGGGAGC AGAACCATGT GTACCTGGGT GACAAGAATG 2040 

CCCTGAACCT CACTTTCCAT GCCCAGAATG TGGGTGAGQG TGGCGCCTAT GAGGCTGAGC 2100 

TTCGGGTCAC CGCCCCTCCA GAGGCTGAGT ACTCAGGACT CX3TCAGACAC CCAGGGAACT 2160 

15 TCTCCAGCCT GAGCTGTGAC TACTTTGCCG TGAACCAGAG CCGCCTGCTG GTGTGTGACC 2220 

TGGGCAACCC CATGAAGGCA GGAGCCAGTC TGTGGGGTGO CCTTCGGTTT ACAGTCCCTC 2280 

ATCTCCGGGA CACTAAGAAA ACCATCCAGT TTGACTTCCA GATCCTCAGC AAGAATCTCA 234 0 

ACAACTCX5CA AAGCGACGTG GTTTCCTTTC GGCTCTCCGT GGAGGCTCAG GCCCAGGTCA 2400 

CCCTGAACGG TGTCTCCAAG CCTGAGGCAG TGCTATTCCC AGTAAGCGAC TGGCATCCCC 2460 

20 GAGACCAGCC TCAGAAGGAG GAGGACCTGG GACCTGCTGT CCACCATGTC TATGAGCTCA 252 0 

TCAACCAAGG CCCCAGCTCC ATTAGCCAGG GTGTGCTGGA ACTCAGCTGT CCCCAGGCTC 2580 

TGGAAGGTCA GCAGCTCCTA TATGTGACCA GAGTTACGGG ACTCAACTGC ACCACCAATC 264 0 

ACCCCATTAA CCCAAAGGGC CTGGAGTTGG ATCCCGAGGG TTCCCTGCAC CACCAGCAAA 2700 

AACGGGAAGC TCCAAGCCGC AGCTCTGCTT CCTCGGGACC TCAGATCCTG AAATGCCCGG 2760 

25 AGGCTGAGTG TTTCAGGCTG CGCTGTGAGC TCGGGCCCCT GCACCAACAA GAGAGCCAAA 282 0 

GTCTGCAGTT GCATTTCCGA GTCTGGGCCA AGACTTTCTT GCAGCGGGAG CACCAGCCAT 2880 

TTAGCCTGCA GTGTGAGGCT GTGTACAAAG CCCTGAAGAT GCCCTACCGA ATCCTGCCTC 2940 

GGCAGCTGCC CCAAAAAGAG CGTCAGGTGG CCACAGCTGT GCAATGGACC AAGGCAGAAG 3000 

GCAGCTATGG CGTCCCACTG TGGATCATCA TCCTAGCCAT CCTGTTTGGC CTCCTGCTCC 3060 

30 TAGGTCTACT CATCTACATC CTCTACAAGC TTGGATTCTT CAAACGCTCC CTCCCATATG 3120 

GCACCGCCAT GGAAAAAGCT CAGCTCAAGC CTCCAGCCAC CTCTGATGCC TGAGTCCTCC 3180 

CAATTTCAGA CTCCCATTCC TGAAGAACCA GTCCCCCCAC CCTCATTCTA CT6AAAAGGA 3240 

GGGGTCTGGG TACTTCTTGA AGGTGCTGAC GGCCAGGGAG AAGCTCCTCT CCCCAGCCCA 3300 

GAGACATACT TGAAGGGCCA GAGCCAGGGG GGTGAGGAGC TGGGGATCCC TCCCCCCCAT 3360 

35 GCACTGTGAA GGACCCTTGT TTACACATAC CCTCTTCATG GATGG6GGAA CTCAGATCCA 3420 

GGGACAGAGG CCCAGCCTCC CTGAAGCCTT TGCATTTTGG A6A6TTTCCT GAAACAACTG 34 80 

GAAAGATAAC TAGGAAATCC ATTCACAGTT CTTTGGGCCA GACATGCCAC AAGGACTTCC 3540 

TGTCCAGCTC CAACCTGCAA AGATCTGTCC TCAGCCTTGC CAGAGATCCA AAAGAAGCCC 3600 

CCAGTAAGAA CCTGGAACTT GGGGAGTTAA GACCTGGCAG CTCTGGACAG CCCCACCCTG 3660 

40 GTGGGCCAAC AAAGAACACT AACTATGCAT GGTGCCCCAG GACCAGCTCA GGACAGATGC 3720 

CACAAGGATA GATGCTGGCC CAGGGCCAGA GCCCAGCTCC AAGGGGAATC AGAACTCAAA 378 0 

TGGGGCCAGA TCCAGCCTGG GGTCTGGAGT TGATCTGGAA CCCAGACTCA GACATTGGCA 3840 

CCAATCCAGG CAGATCCAGG ACTATATTTG GGCCTGCTCC AGACCTGATC CTGGAGGCCC 3900 

AGTTCACCCT GATTTAGGAG AAGCCAGGAA TTTCCCAGGA CCTGAAGGGG CCATGATGGC 3960 

45 AACAGATCTG GAACCTCAGC CTGGCCAGAC ACAGGCCCTC CCTGTTCCCC AGAGAAAGGG 4020 

GAGCCCACTG TCCTGGGCCT GCAGAATTTG GGTTCTGCCT GCCAGCTGCA CTGATGCTGC 4080 

CCCTCATCTC TCTGCCCAAC CCTTCCCTCA CCTTGGCACC AGACACCCAG GACTTATTTA 4140 

AACTCTGTTG CAAGTGCAAT AAATCTGACC CAGTGCCCCC ACTGACCAGA ACTAGAAAAA 4200 
AAAA 

50 

Seq ID NO: 6 Protein sequence; 
Protein Accession #: OT_002i96.1 

55 1 11 21 31 41 51 

I I I I I I 

MGSRTPBSPL HAVQIJRWGPR RRPPLVPLLL IiLVPPPPRVG GPNLDAEAPA VI*SGPPGSFF 
GFSVEPYRP6 TDGVSVLVGA PKANTSQPGV liQGGAVYLCP WGASPTQCTP lEFDSKGSRL 
IiESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSIIiACAPIiY SWRTEKEPIiS DPVGTCYLST 

60 DNFTRIIiEYA PCRSDFSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQIIiSATQEQ 
lAESYYPEYL INIiVQGQIiQT RQASSIYDDS YIjGYSVAVGE FSGDDTEDFV AGVPKGNIjTY 
GYVTIIiNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLIjVGAPLL MDRTPDGRPQ 
EVGRVYVYliQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPIjG DLDQDGYNDV AIGAPFGGET 
QQGWFVFPG GPGGIiGSKPS QVLQPIjWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 

65 VDKAWYRGR PIVSASASIiT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADSI 
GFTVEIiQLDW QKQKGGVRRA LFIiASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 
I^PIHIALNF SLDPQAPVDS HGIiRPALHYQ SKSRIBDKAQ ILLDCGEDNI CVPDliQIiBVP 
GEQNHVYIiGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGIiVRHP GHPSSIjSCDY 
FAVNQSRliLV CDLGNPMKAG ASLWGGIiRPT VPHIiRDTKKT IQFDFQIIiSK NLNNSQSDW 

70 SFRIiSVEAQA QVTLNGVSKP BAVIiFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 
SQGVLELSCP QAIiEGQQIiLY VTRVTGIiNCT TNHPINPKGL EIiDPEQSLHH QQKRBAPSRS 
SASSGPQILK CPEAECFRLR CELGPLHQQB SQSIiQIiHFRV WAKTFIiQREH QPFSLQCBAV 
YKAIJCMPYRI LPRQLPQKBR QVATAVQWTK AEGSYGVPIiW IIILAILFGl. IiLIXSLIjiyilj 
YKIiGFFKRSL PYGTAMEKAQ LKPPATSDA 

75 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



181 
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Seq ID NO: 7 Nucleotide sequencer 
Nucleic Acid Accession #: NM_0022ll 

Coding sequence: 104.. 2500 (underlined sequences correspond to start and stop codons) 



11 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



GTCCGCCAAA 
ACAAGCCTAA 
TTTCTGGATT 
ATGTTTAAAA 
GTGGTGCACA 
TTTAGAAGCC 
CAAAGATATA 
CAAGCCA6AG 
GGAGCCACAG 
CTACCTTATG 
AACAGATCTG 
ATTTGTGGAA 
TTGCACAAGT 
TAATAAAGGA 
TTCTCCAGAA 
CTGGAGGAAT 
AGATGGGAAA 
TATGTACACA 
GAGTGAAAAT 
GGAGCTGAAA 
TGTAATTCAG 
CGGCAAATTG 
TGGAACAGGG 
TGAAATTAGC 
GCCTCTGGGC 
CCAAAGC6AA 
TGGCGCGTGC 
AGTTAACAGT 
TAACAATGGA 
AATTTATTCT 
AATTTGTGGA 
TGGCAGTGCA 
CTGCAATGGC 
AGGGCAAACG 
TGTTCAGTGC 
CTATTTTAAC 
TCCTGTGTCC 
AGTGAATGGG 
TCCAGACATC 
ATTACTGCTG 
TGAAAAGGAG 
CGTAACAACT 
ACAACACTGA 
TTGCCATGGT 
TGTTTTATTA 
ACAGGATGGT 
GACTATTGAA 
AGTTAAAGTA 
CTTTACATAT 
AAGGATTGTT 
CTGAAAGACA 
AAAGGCCATG 
GTGCCATTTT 
AGCCTTAGAT 
ACATACAGGC 
TTTATTATTT 
TGTATTTGTT 
AGCGGTCAAT 
TCACCTACGC 
CACCATTTGA 
GTTTTAACAG 



ACCTGCGCGG 
CGTCCGCTGG 
GGACTGATCA 
GCAAATGCCA 
AATTCAACAT 
TTAAAAAAGA 
AAGAAAAATA 
GATATTACTC 
ACATTTACAT 
GACCTGTCTT 
AT6AATGAAA 
AAGACTGTGA 
GAACAGAACT 
GAAGTATTTA 
GGTGGTTTCG 
GTTACACGGC 
CTTGGTGGCA 
ATGAGCCATT 
AATATTCAGA 
AACTTGATCC 
TTGATCATTG 
TCAGAAGGAG 
GAAAATGGAA 
ATAACTTCAA 
TTTACGGAGG 
GGCATCCCTG 
AG6TGCAATG 
GAAGACATGG 
GAGTGCGTCT 
GGCAAATTCT 
GGAAATGGTG 
TGTGACTGTT 
CGGGGCATCT 
TGTGAGATGT 
AGAGCCTTCA 
ATTACCAAGG 
CATTGTAAGG 
AACAACGAGG 
ATTCCAATTG 
ATATGGAAGC 
AAAATGAATG 
GTGGTCAATC 
ATGCAAAGTA 
TTTACTCATG 
TTTTGAAAAT 
TATTCTTGTC 
ATCAAGCTTA 
ATGAGCATGA 
GTCAGTTTGC 
TTAAATCTGT 
AGTATGTTGA 
GGAAAAATTC 
AAGAGTTACT 
AAAAGAACCG 
CATACTTTAC 
TTATTTTGTT 
TGCAATTTTG 
TTGCCTTTTT 
GAGTCTTACT 
GTTGCCCATC 
TTCA 



21 

I 

ATAGGGAAGA 
GCCCCGGACG 
GTTCAGTTTG 
AATCATGTGG 
TTTTACAGGA 
AGGGTTGCCC 
AAAATGTAAC 
AGATCCAACC 
TAAAATTCAA 
ATTCAAT6AA 
TGAGGAGGAT 
TGCCTTACAT 
GCACCACCCC 
ATGAACTTGT 
ATGCCATCAT 
TGCTGGTGTT 
TTGTTTTACC 
ATTATGATTA 
CAATTTTTGC 
CTAAGTCAGC 
ATGCATACAA 
TAACAATAAG 
GAAAATGTTC 
ATAAGTGTCC 
AAGTAGAGGT 
AAAGTCCCAA 
AAGGGCGTGT 
ATGCTTACTG 
GCGGACA6TG 
GCGAGTGTGA 
TTTGCAAGTG 
CTTTGGATAC 
GCGAGTGT6G 
GTCAGACCTG 
ATAAAGGAGA 
TAGAAAGTCG 
AGAAGGATGT 
TCATGGTTCA 
TAGCTGGTGT 
TTTTAATGAT 
CCAAATGGGA 
CGAAGTATGA 
GCAATTTCCA 
TGCAGGTTTT 
AATGTTGTAA 
AGCTAA6GTC 
TTGGATTAAG 
TGAGAGTTTC 
AGTTATGCAG 
TATTTTGCTA 
GAGTT6CTGG 
AGAGAGTTAG 
TAATGTTTGG 
AGCAATTTTC 
AAAGTATTTG 
TAATGTCTGG 
GGGTAAGACT 
AATGAACATG 
TTGAGTTAGT 
TTGTTTCACA 



31 
I 

ACAGCACCCC 
CCGCGCGGAA 
CTGTGTGTTT 
AGAATGTATA 
AGGAATGCCT 
TCCAGATGAC 
CAACCGTAGC 
ACAGCAGTTG 
GAGAGCTGAA 
AGACGATTTG 
TACTTCGGAC 
TAGCACAACA 
ATTTA6CTAC 
TGGAAAACAG 
GCAAGTTGCA 
TTCCACAGAT 
AAATGATGGA 
TCCTTCTATT 
AGTTACTGAA 
AGTAGGAACA 
TTCCCTTTCC 
TTACAAATCT 
CAATATTTCC 
AAAAAAGGAT 
TATTCTTCAG 
GTGTCATGAA 
TGGTAGACAT 
CA6GAAAGAA 
TGTTTGTAGG 
TAATTTCAAC 
TCGTGTGTGT 
TAGTACTTGT 
TGTCTGTAAG 
CCTTGGTGTC 
AAAGAAAGAC 
GGACAAATTA 
TGACGACTGT 
TGTTGTGGAG 
GGTTGCTGGA 
AATTCATGAC 
CACGGGTGAA 
GGGAAA ATGA 
TAGTCACAGT 
GAAAATGTAC 
TTCATGCCAG 
ACATTGTGCC 
TGATATTTCT 
TGTTAATCAT 
AATCCAAAGT 
TTTGCCTGTT 
TGTAAAATAC 
GAAG6AAAAA 
TAACTTTTAT 
TGCTAAAAAG 
CTGAATGGGG 
TGCTTTCTAT 
TTTTTATGAG 
TGAAGTTATA 
GCCATAACAG 
CTAGTCACAT 



41 



GGCGCCGATT 
AAGATGAATT 
GCTCAAACAG 
CAAGCAGGGC 
ACTTCTGCAC 
ATAGAAAATC 
AAAGGAACAG 
GTTTTGCGAT 
GACTATCCCA 
GAGAATGTAA 
TTCAGAATTG 
CCAGCTAAGC 
AAAAATGTGC 
CGCATATCTG 
GTTTGTGGAT 
GCCGGGTTTC 
CAATGTCACC 
GCTCACCTTG 
GAATTTCAGC 
TTATCTGCAA 
TCAGAAGTCA 
TACTGCAAGA 
ATTGGAGATG 
TCTGACAGCT 
TACATCTGTG 
GGAAATGGGA 
TGTGAATGCA 
TU^CAGTTCAG 
AAGAGGGATA 
T6TGATAGAT 
GAGTGCAACC 
GAA6CCAGCA 
TGTACAGATC 
TGTGCTGAGC 
ACATGCACAC 
CCCCAGCCGG 
TGGTTCTATT 
AATCCAGAGT 
ATTGTTCTTA 
AGAAGGGAGT 
AATCCTATTT 
GTACTGCCCG 
TAGGTAGCTT 
AATATGTATA 
GGACTGACAA 
TTTTTGACCT 
ATAGCGATTG 
GTATTAAAAC 
AAATGTCCT6 
A6ACAT6ACT 
GTTTGAAATA 
CCAATAGCTT 
GCCTTCACTT 
TCCTTGATTT 
ACCTTTTGAG 
CACCTCTTCT 
TACTTTTTCT 
CTGTGGCTAT 
ACCACTGTAT 
TCTTGTTTTA 



51 

I 

GCCGTACCAA 
TACAACCAAT 
ATGAAAATAG 
CAAATTGTGG 
GATGTGATGA 
CCAGAGGCTC 
CAGAGAAGCT 
TAAGATCAGG 
TTGACCTCTA 
AAAGTCTTGG 
GATTTGGCTC 
TCAGGAACCC 
TCAGTCTTAC 
GAAATTTGGA 
CACTGATTGG 
ACTTTGCTGG 
TGGAAAATAA 
TCCAGAAACT 
CTGTTTACAA 
ATTCTAGCAA 
TTTTGGAAAA 
ACGGGGTGAA 
AGGTTCAATT 
TTAAAATTAG 
AATGTGAATG 
CATTTGAGTG 
GCACAGATGA 
AAATCTGCAG 
ATAGAAATGA 
CCAATGGCTT 
CCAACTACAC 
ACGGACAGAT 
CGAAGTTTCA 
ATAAA6AAT6 
AGGAATGTTC 
TCCAACCTGA 
TTACGTATTC 
GTCCCACTGG 
TTGGCCTTGC 
TTGCTAAATT 
ATAAGAGTGC 
TGCAAATCCC 
TAGGGCAATA 
ATTTTTAAAA 
AAGACTTGAG 
TTTCTTCCTG 
AAAGGGCAAT 
TGATTTTTAG 
CTAGCTAGTT 
GATGACATAT 
GTTGATCTAC 
TAAAACCT6T 
TACAAATTCA 
AGCACTATTT 
TTGAATTTAT 
AATCTTTTAA 
TTGAAGTTTT 
GCAACAGCTC 
GTTTACTTCT 
AGTGCCTTTA 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 



Seq ID NO: 8 Protein secruence; 
Protein Accession #: NP 002202 



7^ 1 . 11 21 31 41 51 

75 I I I I I 1 

MNIiQPlFWIG LISSVCCVFA QTDENRCIiKA NAKSCGECIQ AQPNCX5WCTN STPLQEGMPT 



60 



182 



NSOOCID: <WO 0207e482A2. 1. > 



wo 02/079492 



PCT/US02/04915 



SARCDDLEAIi KKKGCPPDDI BNPRGSKDIK KNKNVTNRSK GTAEKLKPED ITQIQPQQIiV 120 

LRIiRSGEPQT FTLKPKRABD YPIDLYYLMD LSYSMKDDLE NVKSI^TDLM NEMRRITSDF 180 

RIGFGSPVBK TVMPYISTTP AKLRNPCTSE QNCTSPFSYK NVLSLTNKGE VFNBLVGKQR 240 

ISGNIiDSPEG GFDAIMQVAV CGSLIGWRNV TRIiLVFSTDA GFHFAGDGKL GGIVLPNDGQ 300 

CHLENNMYTM SHYYDYPSIA HIiVQKIiSBNN IQTIFAVTBB FQPVYKELKN LIPKSAVGTL 360 

SANSSNVIQIi IIDAYNSIiSS EVILENGKLS EGVTISYKSY CKNGVNGTGB NGRKCSNISI 42 0 

GDEVQFEISI TSNKCPKKDS DSPKIRPLGP TEEVEVILQY ICECECQSEG IPESPKCHEG 4 80 

NGTFECX3ACR CNEGRVGRHC ECSTDEVNSE DMDAYCRKEN SSEICSNNGE CVCGQCVCRK 540 

RDNTNBIYSG KPCECDNFNC DRSNGLICGG NGVCKCRVCE CNPNYTGSAC DCSLDTSTCE 600 

ASNGQICNGR GICECGVCKC TDPKFQGQTC EMCQTCLGVC AEHKECVQCR APNKGEKKDT 660 

CTQECSYFNI TKVESRDKIiP QPVQPDPVSH CKEKDVDDCW FYFTYSVNGN NEVMVHWEN 720 

PECPTGPDII PIVAGWAGI VLIGIALLLI WKLLMIIHDR REFAKPEKEK MNAKWDTGEN 780 
PIYKSAVTTV VNPKYEGK 

Seq ID NO: 9 Nucleotide sequence: 
Nucleic Acid Accession #:NM_002425 

Coding sequence: 23.. 1453 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 

AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 24 0 

GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

TCC3X3ACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 

TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 4 80 
AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 
TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 
TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 
GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 
GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 

TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 

TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 13 80 

ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC AT ATTAA AGA GTAACAGCTG 144 0 

GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

ATTATTCATC TAATGTATTA TGAGCCAT^ TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 

GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTG CAAT AGGTGAGAGA 1680 

ATGTATTTTC ATAGATGTOT TATTACTTCC TCAATAAAAA GTTTTATTTT GG6CCTGTTC 1740 
CTT 



Seq ID NO: 10 Protein sequence: 
Protein Accession #: NP_002416 

1 11 21 31 41 51 

iwHIiAFLVIiL CIiPVCSAYPIj SGAAKEEDSN KDIiAQQYLBK YYNIiEKDVKQ FRRKDSNLIV 60 
KKIQGMQKFL GLBVTGIOiDT DTIiEVMRKPR CGVPDVGHPS SFPGMPKWRK THLTYRIVNY 120 
TPDLPRDAVD SAIEKALKVW BEVTPLTFSR LYBGBADIMX SFAVKEHGDF YSFDGPGHSI* 180 
AHAYPPGPGL YGDIHFDDDE KWTEDASGTN LFLVAAHELG HSLGIiFHSAN TETVLMYPIiYN 240 
SFTELAQFRIi SQDDVNGIQS LYGPPPASTE EPLVPTKSVP SGSEMPAKCD PAIiSPDAIST 300 
LRGEYLFFKD RYFWRRSHWN PEPEFHIiISA EWPSLPSYLD AAYEVNSRDT VFIFKGNEFW 360 
AIRGNEVQAG YPRGXHTIiGF PPTIRKIDAA VSDKEKKKTY FFAADKYWRF DENSQSMEQG 420 
FPRLIADDFP GVEPKVDAVIj QAFGFFYFFS GSSQFEFDPN ARMVTHILKS NSWIiHC 



Seq ID NO: 11 Miifrleoti de sequence; 
Nucleic Acid Accession #: XM_058189 

Coding sequence: 169.. 774 (underlined sequences correspond to start and stop 

1 11 21 31 41 51 

GAAGACCAGC TCAGCTCTTC AGTTGTTGAT CATTGTCTAT TGTTCTCCAA ACAGTAAACC 60 



183 



wo 02/(»79492 



PCT/US02/049I5 



10 



15 



20 



25 



30 



AGTATTTCAC 
TATGAAGTGA 
AAGTGTGGAG 
AACATATTAT 
AACTACGTGT 
ACAGTTCTTC 
AGCAAAAAAT 
GGATACTGCC 
GATGGCTGGG 
TGGATTCAGT 
CTCATAACCC 
TCCAAGATAC 
ACAAAATGTT 
GACTTGAGGG 
TTTGCAGTCC 
AATATGCATC 
G7U\TCTGTAC 
TGCAACATTA 
AGGATTAAGT 
ATAAAAAATA 



ACTGAGATTG 
AGGGCTCTGA 
GCTGCCTAAG 
TGTATTTCCC 
GGTATTTTGA 
TGGTACTGGA 
ATGTGACACT 
TGGTCATCTC 
AGTATGCTTT 
GCCTGGAACC 
TCAGTGGGCT 
TGTGTGGAAG 
TTCCATTATC 
CMTATTGAA 
TCACTGCACA 
AGAAACTTCA 
TAGGGAAGTA 
TTTAATATTC 
AAAGAGT6GT 
AAGAGTACTA 



TCGGCTGCGG 
CCCTGGAAGT 
TTGTTTGCTG 
GAATGGGCAA 
AGGAATCTGT 
GAATAATAAC 
GCTGTCAATT 
TGCCTTGGGT 
TGAAGGCACT 
TGCACATGTT 
TCAAGTGATC 
CTATTCAGTG 
AAGACATGGC 
ATGATGGTGC 
TGCAAGTATA 
GAAATACTTC 
AATAAGAATA 
TGGAAAATT6 
ACATACTGTA 
CATGGTTGTA 



GTATATTCCA 
GGTTCTAAGC 
ATTCCGCTTG 
ACTTCCTATG 
TTCTCAGGCA 
AACTATAAAT 
ATCTTTTCTT 
CTTGTCCAAG 
GCTGGACGTT 
GTGGAGTGGA 
ATCTGCCTCA 
ATCTTCCAGC 
CATCTATCTA 
TTTCTGCATT 
CCACCCTTCC 
TGCCCTTT6A 
T6AGAGAAAC 
GAAACACCCC 
AATGTTTTCT 
AAA 



ATTCCCCGTC 
AGGGCAAAAT 
CACTTTGGAG 
CATCCAGCAA 
TCATGATGCT 
GTTGCCAGAG 
CCCTCGGAAT 
GGCCATATTG 
TCCTTACAGA 
ACATCATTTT 
TCAGAGTAGT 
CTGGAATCAT 
AATATTATAT 
TGGTGTTTAT 
ATTTAGTATG 
TCAAACAAAT 
CTTTATGCAA 
AAAATTCTAA 
GATATTAAAA 



TCCTCATGAA 
^GGGTCTCGG 
TATAATOGTG 
TAAACTCACC 
TATAGTAACA 
TGAAAACTGC 
TGCTTTTTCT 
CCGCACCCTT 
TTCTAGCATA 
ATTTTCCATT 
CATGCAACTA 
TTG^TAAGG 
CAACTGTGTA 
TTGTAAAAAA 
TTTTTTAAGT 
CCATTTCCAA 
ATATGTATAT 
ACTCAGAG6A 
AAAAAATTAA 



Seq ID NO: 12 Protein aemienr^*>. 
Protein Accession #: XP 056X89 



41 



51 



120 
180 
240 
300 

360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



1 11 21 31 

I I I I . . 

MGSRKCGGCL SCLIiIPIALW SIIVNIia.YF PNGQTSYASS NKLTNYVWYF EGICFSGIMM 60 
UIVTTVLIiVL ENNNNYKCCQ SENCSKKYVT LLSIIFSSLG lAFSGYCLVI SALGLVQGPY 120 
CRTLDGMEYA FEGTAGRFLT DSSIWIQCiB PAHWBWNII IiFSILITLSG LQVIICLIRV 180 
VMQIjSKIIiCG SYSVIFQPGI I 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Seq ID NO: 13 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005397 

Coding sequence: 251. .1837 (underlined sequences correspond to start and stop codons) 



1 
I 

AAACGCCGCC 

CAGCCCGGCT 

CGGGCCACAG 

TACCGCCCGG 

CGACGACACG 

GCCGCTGCTG 

CCAGACTACT 

CATGGCTACA 

GGCCTCGGTC 

GGCTCAGCAA 

CCCTACTACC 

AACCTCCACA 

AACAAACTCT 

AGAACATCTG 

GCATCCTGTG 

TTCAAGCACT 

ACCGTCATCG 

TACGGCCCCT 

ACCTACCCTG 

CCCCAAAACA 

TGAGACACAG 

TGCAGGGGGC 

CTTCAACCCG 

CGTGGTCGTC 

GCTGAAGGAC 

CCAGGGGCCA 

CTGCATGGCG 

CTCCCAGA6G 

CCATGACAAC 

GGTCAGCCTC 

GGAC6ACCTG 

CAGCACCACA 

CTGGGGAGGG 

TTTCCCTTTT 

CTAGTGCCTG 

GACTTTATAG 

GAGGGTAAGT 



11 

I 

CAGGACGCAG 

CTGCTGCAGC 

CCTGGCCTCC 

ACGCGCGGAT 

ATG CGCTGCG 

CCGTCGTCGC 

ACGGACTCAT 

GATACAGCCC 

AAGGCGACCA 

GTCTCAGGCC 

ACCATCGAGA 

GCCACAGCTA 

GGGGGGAAAA 

ACGACXICCTC 

GCCACCCCAA 

GTGGCTATCC 

GTTATCTCGC 

TCCTCCCAGG 

CCAGAGACCA 

CCTTCTCCCA 

ACACAGAGTG 

GCTTCGGATG 

GCCCAAGATA 

AAAGAAATCA 

AAATGGGATG 

CCGGAGGAGG 

TCATTCCTGC 

AAGGACCAGC 

CCAACACTGG 

AACGGGGAGC 

GATGAGGAGG 

GAGCTCCAGA 

AGAGTGAACT 

CAACCTGAAC 

AGCTCA6TGC 

ATGAACTAGT 

GACTTGCCCA 



21 

I 

CCGCCGCCGC 

GGCAGGGAGG 

GGAGCCACCC 

CCTCCGCCGG 

CGCTGGCGCT 

CGTCX3CCGTC 

CTAACAT^AAC 

AGCAGAGCAC 

CCCTTGGTGT 

CAGTCAACAC 

GCCCCAAGAG 

AACCTAACAC 

GCAGCCACAG 

ACCCTACAAG 

CAA6CTCGGG 

CTGGCTACAC 

AAAGAACTCA 

AGACAGTGCA 

TGAGCTCCAG 

CTGTGGCTCA 

AGAAGCAGCT 

AGAAATTGAT 

AGTGCGGCAT 

CTATTCACAC 

AACTAAAGGA 

CCGAGGACCG 

TCCTCGTGGC 

AGCGGCTAAC 

AAGTGATGGA 

TGGGGGACAG 

AAGACACACA 

CCAACCACXrC 

CCGAGGGGTQ 

AAATCACATT 

TGCTGGATGA 

GGAATCCCTT 

AGGTCAGAGC 



31 
I 

CGCCGCTCCT 
AAGAGCCGCC 
ACAGGCCTCC 
CACCGCAGCC 
CTCGGCGCTG 
GCCGTCGCCG 
AGCACCGACT 
AGTCCCCACT 
ATCCAGTGAC 
TACCGTGGCT 
CACAAAAAGT 
CACAAGCAGC 
TGTGACCACA 
TCCACTTAGC 
ACAT6ACCAT 
CTTCACAAGC 
ACAGACCTCC 
GCCCACX^GC 
CCCCACAGCA 
TGAGAGTAAC 
CGTCCTGAAC 
CTCACTGATA 
ACGGCTGGCA 
TAAGCTCCCT 
GGCAGGGGTC 
CTTCAGCAT6 
GGCCCTCTAT 
AGAGGAGCTG 
GACCTCTTCT 
CTGGATCGTC 
CCT CTAGT CC 
CAAGTGCCGT 
TCCCCTCCCA 
CTGTCCAGAT 
TQAGGGAGAT 
CATTCTGCAG 
CACTTGGTGA 



41 
I 

CTGCCACTGG 

GCAGCGCGAC 

CCGGGCGGCG 

ACCTGCTCCC 

CTGCTACTGT 

TCGCCCTCCC 

CCAGCATCCA 

TCCAAGGCCA 

TCACCGGGGA 

AGAGGAGGCG 

GCAGACACCA 

CAGAATGGAG 

GACCTCACAT 

CCCCGACAAC 

CTTATGAAAA 

CCGGGGATGA 

AGTCAGATGC 

CCGGCAACGG 

GCATCAACTA 

TGGGCAAAGT 

CTCACAGGAA 

TGCCGAGCAG 

TCTGTTCCAG 

GCCAAGGATG 

AGTGACATGA 

CCCCTCATCA 

GGCTGCTGCC 

CA6ACAGTGG 

GAGATGCAGG 

CCTCTGGACA 

GGTCTGCCX3G 

TTGGATGGGG 

ATCCCCCCAG 

TCCTCTTGTA 

CAA6AAAAAG 

TGAGATTGCC 

CAGA6CCAGG 



51 
I 

CTCTGCGCCC 
TCG6GAGCCC 
CCCACGCTCC 
GGCCCAGAGG 
TGTCAACGCC 
AGAATGCAAC 
GTGTCACCAT 
ACGAAATCTT 
CTACAACCCT 
GCTCAGGCAA 
CTACAGTTGC 
CAGAAGATAC 
CCACTAAGGC 
CCACTTTGAC 
TTTCAAGCAG 
CCACCSVCCCT 
CAGCCAGCTC 
CATTGA6AAC 
CCCACCGATA 
6TGA66ATCT 
ACACCCTCTG 
TCAAAGCCAC 
GAAGTCAGAC 
TGTACGAGCX5 
AGCTAGGGGA 
TCACCATCGT 
ACCAGCGCCT 
AGAATGGTTA 
AGAAGAAGGT 
ACCTGACCAA 
TGGCCTCCAG 
AAGGGAAAGA 
GGCCTTAATT 
AAATAACCCA 
CCACGTAAGG 
GAGACCTGAA 
ATGAGAACAA 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1960 
2040 
2100 
2160 
2220 
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AGATTCCATT TGCACCATGC CACACTGCTG 
CCCGG6CAGG GGTGAAACTC CAGCAGGTGG 
CTGGCTCGGT GGGATCTGAC GACCTGAAAG 
CAGCCTCGTG TACCCATCTT CCCACCCTCT 
5 TGTTCCCACT TACTCTGTCC TGGGGCCTCT 
ACCCCTTGTT CTGGATTCGG ATTTTCTCAC 
CACAGGTCTC CGTGCGTGAA CCAGGTCTGC 
AGGGGACACT CGAGTCCAGG CTGGTATCTC 
CTGGCCCATT GCCCCTGGCA CTCCTTGCAG 
10 TTTCCACCAG GGACTAAAAT CTGCTGTAGC 
CACTGCTCAG CTGATGGGAG TGATTCCCTG 
GAGAGGCCTT CCCGGGGCCC CCCATCAGCG 
GCCAGGAAGG ACAGGCTTTC CCCTGTATAT 
TGAGAGTTTT GTTATAGTTG TTTTCTAACC 
15 AGACAGTTGG CAGACGTCCA GTTAGCTCAT 
GAAAAGAGGG CCTGGAAAGC GCAGTGCATG 
GACTGTTGTG GGAACGTCCC ACTGTCCTGG 
CTGTGAGCCT AGTG6GGCTG CTGCGGTTCT 
CACAAAAGCT ATGTCTATTC CCCAATATAG 
20 TGTAGAAGGC TCCTAAGCAG TGGGCATGGT 
CTGTGTGGAA TGTTGAACTT TCGCCATGGT 
ATCAAGTAGG AAAATGG6CA GAACT6CTTA 
AC6CCACACC TCCAGGGTCT TAAGAGTCAG 
GCTATCCACT CGGGATGGCT TACTTTTTAA 
25 GTCCTGCACC ATCTGCCTAA TTCCTTCCTC 
GAAAAGGAAG GCCAGGGGTT CACATAGGGC 
GCGAGGCTAA CAAGTTCCAA AAACATCTGC 
GATGGAGAAC AGTGCCTGTT TGGGGGAAAA 
TTTGCTTCCT TCTTGCCCAG CGCTGGGTTC 
30 CCGTGGGAGA GGCCAGACTG GATTATTCCT 
GCCAGGGGTT TTGACGGAGA CAGCAAATAG 
CTATGGCCTC TTGGAGACAG ATGATGACTG 
AGGTCGGTCC TGACTCTCCT GACTCTCCAT 
CTGGGCTGAT TCAGAGGTTA CTGCTTTATA 
35 GACAGCTTCA GAATCTGACA CCTTGCCTTG 
GCCAAACAGC CATGCATCTA TAAAGGTCAT 
TGCTCTCTGA TAATTCAGAG AGCATTGGGT 
TCAGCATGAC TTAAACAGGT TGTAGCAAAG 
ACTGTGGTTT CCCCAAGCTG CACAGGAGGC 
40 CTACTGTCAT GAGAGTG6GG AGACAGGCAG 
TTTGCGTTGT AAGACAGAAT ACG66TTTAA 
TTGATAAGGA AAGCTAGCAG AAAGTTTATT 
ACAATATACT GGAGAAACTT TGAAGAACAA 
TTGATAATGT AAATACAGTG ACCATGTTAA 
45 TTTGAAAAAG CATTATGTTA GCTGAGTGAT 
AATGTCTTAC TGGAAATGAC AAGTTTTTGC 
TATAACAAGA CAAACTTATG ATAAAGTATT 
TTTTTAATTT TAAAATGCAA CCCTGCCCCC 
AAAGGTTGGA GTCAATATGC TCTGGTTGGC 
50 TTCAAGATCT AGTCCAATCT TTTTCTAGAG 
AGTGACTTCC TCAAAATCAC ATGGTTCAGG 
GACTGTGCGC CTCAGAAGGA ATAATCGGTA 
ATGACACAAA GGACAGAATT CCTTTCCCAG 
GAACACAAAC ATAAGAACTG GTCTTCTCAC 
55 TAAGTGAACA ATTCTTTCTT TCTGCCAAGA 
GAACTTACTC CAACAGGACT GAGGGACCAA 
AAAGAGTAAA ACTGTAGCAT AGCTTTTGTC 
GCAAACACAG CATGGAGGAC ACAGATGACT 
ATGTTCAACA GTTTGCCCAG GAACTGGGGG 
60 AAGTACACTG GAATTTACTG AGAAACTTGT 
TTTTCTTACA AAAATATATT TTGGAAAATT 



TGTTCACATG TGCCTTCCGT CCAGAGCAGT 2280 
CTGGGCTGGA AAGGAGGGCA GGGCTACATC 234 0 
TCCAGCTCCC AAGTTTTCCT TCTCCTACCC 2400 
ATGTTCTTAC CCCTCCCTAC ACTCAGTGTT 2460 
GGGATTAGCA CAGGTTATTC ATAACCTTGA 2520 
ATTTGCTTCG TGAGATGGGG GCTTAACCCA 2580 
TTAGGGGACC TGCGTGCAGG TGAGGAGAGA 2640 
AGGGCAGCTG ATGAGGGGTC AGCAGGAACA 2700 
AGGCCACCCA CGATCTTCTT TGGGCTTCCA 2760 
TAGTGAGAGC AGCGTGTTCC TTTTGTTGTT 2820 
AGACCCAGTA TGAAAGAGCA GTGGCTGCAG 2880 
ATGTGTCTTC AGAGACAATC CATTAAAGCA 2940 
CATAGGAAAC TCAGGGACAT TTCAAGTTGC 3000 
CAGCCCTCCA CTGCCAAAGG CCAAAAGCTC 3060 
CTCACTCACT CTGATTCTCC TGTGCCACAG 3120 
CTGGGTGCAT GAAGGGCAGC CTGGGGGACA 3180 
CCTGGAGCTA GGCCTTGCTG TTCCTCTTCT 3240 
CTTGCAGTTT CTGGTGGCAT CTCAGGGGAA 3300 
GACTTTTATG GGCTCGGCAG TTAGCTGCCA 3360 
6AGGTTTCAT CTGATTGAGA AGGGGGAATC 3420 
CTCCATCGTT CTGGGCGTAA ATTCCCTGGG 3480 
GGGGAATGAA ATTGCCATTT TTCGGGTGAA 3540 
GCTCCGGCTG TAGTAGCTCT GATGAAATAG 3600 
AAGGGTAGGG GGAGGGGCTG GGGAAGATCT 3660 
ACAGTCTGTA GCCATCTGAT ATCCTAGGGG 3720 
CCCAGCGAGT TTCCCAGGAG TTAGAGGGAT 3780 
CCCGATGCTC TAGTGTTTGG AGGTGGGCAG 3840 
CAGGAAATCT TGTTAGGCTT GAGTGAGGTG 3 900 
TCTCCACCCA GTAGGTTTTC TGTTGTGGTC 3960 
CCTTTGCTGA TCCTGGGTCA CACTTCACCA 4020 
GCCTCTGCAA ATCAATCAAA GGCTGCAACC 4080 
GCAAGGACTA GAGAGCAGGA GTGCCTGGCC 4140 
CGCTCTGTCC AAGGAGAACC CGGAGAGGCT 4200 
TTCGTCCAAA CTGTGTTAGT CTAGGCTTA6 4260 
CTCTTGCCAC CAGGACACCT ATGTCAACAG 4320 
CATCTTCTGC CACCTTTACT GGGTTCTAAA 4380 
CTGGGAAGAG GTAAGA6GAA CACTAGAA6C 4440 
ACAGTTTATC ATCAACTCTT TCAGTGGTAA 4500 
CAGAAACCAC AA6TATGATG ACTAGGAAGC 4560 
CAAA6CTTAT GAAGGAGGTA CAGAATATTC 4620 
TCTAGTCTAG GCRCCAGATT TTTTTCCCGC 4680 
TAJ^CCACTT CTTGAGCTTT ATCTTTTTTG 4740 
GTTCAAACTG ATACATATAC ACATATTTTT 4800 
CCTACCCTGC ACTGCTTTAA GTGAACATAC 4860 
GGCCAAGTTT TTTCTCTGGA CAGGAATGTA 4920 
TTGATTTTTT TTTTTAAACA AAAAATGAAA 4980 
TGTCTTGTAG ATCAGGTGTT TTGTTTTGTT 5040 
TCCCCAGCAA AGTCACAGCT CCATTTCAGT 5100 
AGGCAACCCT GTAGTCATGG AGAAAGGTAT 5160 
AAAAAGATAA TCTGAAGCTC ACAAAGATGA 5220 
ACAGAAACAA GATTAAAACC TGGATCCACA 5280 
AATTAAGAAT TGCTACTCGA AGGTGCCAGA 5340 
TTGTTACCCT AGCAAGGCTA GGGAGGGCAT 5400 
ACTTTCTCTG AATCATTTAG GTTTAAGATG 5460 
AACAAAGTTT TGGATGAGCT TTTATATATG 5520 
GGAAACATGA T6GGGGAGGC AA6AGAG6GC 5580 
ACGGTCACTA GCTGATCCCT CAGGTCTGCT 5640 
CTTTQGTGTT GGTCTTTTTG TCTGCAGTGA 5700 
ATCATATATG TCTTAGTGGA CAGGGGTCTG 5760 
TTGTAAAAAC TATAGTTAAT AATTATTGCA 5820 
GTATACTGTC AATTAAAGT 



Seq ID NO: 14 Protein sequence; 
Protein Accession #: NP 005388 

1 11 21 31 41 51 

I I i I I I 

MRCAIALSAL LLLLSTPPLL PSSPSPSPSP SPSQNATQTT TDSSNKTAPT PASSVTIMAT 60 
DTAOQSTVPT SKANEILASV KATTLGVSSD SPGTTTIiAQQ VSGPVNTTVA RGGGSGNPTT 120 

70 TIESPKSTKS ADTTTVATST ATAKPNTTSS QNGAEDTTNS GGKSSHSVTT DL TSTKA EHL 180 
TTPHPTSPIiS PRQPTLTHPV ATPTSSGHDH IiMKISSSSST VAIPGYTFTS PCT«TTTI»PSS 240 
VISQRTQQTS SQMPASSTAP SSQETVQPTS PATALRTPTL PBTMSSSPTA ASTTHRYPKT 300 
PSPTVAHESN WAKCEDIiETQ TQSEKQLVtN LTGNTIiCAGG ASDEKLISIiI CRAVKATFNP 360 
AQDKCGIRIA SVPGSQTVW KEITIHTiCLP AKDVYERLKD KMDEIiKEAGV SDMKIX3DQGP 420 

75 PEEAEDRPSM PLIITIVCMA SPIiLVAALY GCCHQRIiSQR KDQQRI^TBEL QTVBNGYHDN 480 
PTIiEVMETSS BMQBKKWSL NGBLOTSWIV PLDNLTKDDIi DBEEDTHIj 
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Seq ID NO: 15 Nucleotide aecmence: 
Nucleic Acid Accession #: NM_004105 

Coding sequence: 150.. 1631 (underlined sequences correspond to start and stop codons) 
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CTAGTATTCT 
AAAATAAAAA 
CAAGGTAACT 
CTCTGGCGCT 
ACGGATATGA 
TCCCAGACGC 
TTCCGAAAAC 
CAGAAGGAAC 
GAGTGTTGCC 
AGACTGGCCG 
CCAACCCTTC 
GCCAAGACAT 
TCAATTTACG 
AGTGCGTAGA 
CACCAGGCTC 
CCTGCGTAGA 
TTCTTGGTTC 
ACTGTGAAGA 
ATGAACCTGG 
CATGTCAAGA 
GGAATTATCA 
TAACACCAGA 
AGTCAATAGT 
TCCAGATACA 
GAZUVTGAAAA 
TCGTGAAGTC 
GCAGTATAGG 
TTTCATTTTA 
TTACCTTAAA 
ACACTCACCC 
TTCAAAGTTT 
TTCAAGAGAG 
ACCAAGCAAT 
CTCATAAGGA 
AACTGCTTTG 
TCTGCCATAT 
ATAGTAATAA 
AGAAATTTTA 
AAAATGAACC 
AGAAAATAAA 
ATAATAACAT 
TTCACTGGAG 
AACTAAAATT 
TATTCCTGGC 
GGAATATTTT 
TTTAAATAAA 



11 
I 

ACTAGAACTG 

GCTTGAGCAG 

CTGCTAGCTA 

GGTCAAGTCA 

GTGG6ATCCT 

TTGTAAAGGT 

AGCCCAGATT 

CTCAG6GGCA 

CG6GGGTGGT 

AAATAACTTT 

CCACCGTATC 

AGACX5AGTGC 

GGGATCCTTT 

CATAGATGAA 

ATTTTATTGC 

TATAAATGAA 

ATTCATCTGT 

CATTGATGAA 

GAAATTCTCA 

TATAAATGAG 

TGGCGGCTTC 

GAACCGATGT 

CTACAAATAC 

GGCCACAACT 

TGGAGAGTTC 

ATTATCA6GA 

GACCTTCCGC 

gTCTTTTCTA 

GCACTATTTT 

ATAACAAACA 

GTCTTTATTA 

CTAAGTATAC 

GATGATCTTC 

GGCAGCCATC 

TAAGAAAATG 

TTGTGTTGGT 

ATCATTGCTG 

GATTGTGAAT 

AGATACCCCC 

TTCCTTCTAA 

CCCTGAATTC 

AAGTTTTAAT 

TCTCATCTTC 

ATCCAQAGTG 

ATACGTCTCX: 

AATAAATATT 



21 
I 

GAAGATTGCT 
CAATTCATAT 
AGATTCACAA 
CAGGACACCG 
6T6AGACAGC 
GGAAT6AAGT 
ATTGTCAATA 
ACCACCGGGG 
TTTGTGGCCA 
GTCATCCX3GC 
CAGTGTGCAG 
ACTGCAGGGA 
GCATGTCAGT 
TGTACCATCC 
CAGTGCAGTC 
TGTGATGCCA 
CAGTGCAATC 
TGCAGAACCT 
TGTATGTGCC 
TGTGAGACCA 
CGTTGTTATC 
GTTTGCCCAG 
ATGAGCATCC 
ATTTATGCCA 
TACCTACGAC 
CCAAGAGAAC 
ACAAGCTCTG 
AGAGTCAACC 
ATTTATAGAT 
ATTACACCAT 
CTATATGTAA 
ACTATCTGGT 
TGTGGTGCTT 
ATAACCATTG 
GAAAAGGTCA 
TTTTATTTTC 
TACAACATGC 
ATTTTGTAAA 
TAGAAAATTA 
ACCACATTGG 
AGGCATTCAC 
TTCTAAGTAA 
GTACTTGATG 
ACAGTGAACT 
TTGTTTAAAA 
CCTTTAGAAG 



31 
I 

CTCCGAGTTT 
TACTGTCACA 
TGTTGAAAGC 
AAGAAACCAT 
AATGCAAAGA 
GTGTCAACCA 
ATGAACA6CC 
TT6TA6CTGC 
GTGCTGCTGC 
GGAACCCAGC 
CAG6CTAC6A 
CGCACAACTG 
GCCCTCCTG6 
CTCCATATTG 
CTGGGTTTCA 
GCAATCAATG 
AAGGATATGA 
CAAGCTACCT 
CCCAGGGATA 
CAAATGAATG 
CACGAAATCC 
TCTCAT^TGC 
GATCTGATAG 
ACACCATCAA 
AAACAAGTCC 
ATATCGTGGA 
TGTTAAGATT 
ACAGGCATTT 
ATATCTAGTG 
GGTATAAAGT 
ATTAGACATT 
GAAACTTGGA 
AAG6AAACTT 
AATAGCATGC 
ATAAAGATAT 
ATATCCAGCC 
TGGTTTCTGT 
AAACAGTAAG 
TACTATTGAG 
AACTGACCTG 
AAGATGCAGA 
AATTTAAATC 
CTCACAGAGG 
TAAGCAAATT 
TCTGACTGCT 
•ATCACTCTAA 



41 

I 

TTTTTTTGTT 

GGTATTTTTG 

CCTTTTCCTA 

CACGTACACG 

TATTGATGAA 

CTATGGAGGA 

TCAGCAGGAA 

CAGCA6CATG 

AGTCGCAGGC 

TGACCCTCAG 

GCAAA6TGAA 

TA6AGCAGAC 

ATATCAGAAG 

CCACCAAAGA 

ATTGGCAGCA 

TGCTCAGCAG 

GCTAAGCAGT 

GTGTCAATAT 

CCAAGTGGTG 

CCGGGAGGAT 

TTGTCAAGAT 

CATGTGCCGA 

GTCTGTGCCA 

TACTTTTCGG 

TGTAAGTGCA 

CCTGGAGATG 

GACAATAATA 

AAGTCAGCCA 

CaVTCTACATC 

6GGCATTTAA 

AATCCACTAA 

TTCTTTCCTA 

ACTAGAGCTC 

AAGGGTAAGA 

ATTTCTTTAG 

TAAAGGTGGT 

AGGGTATTTT 

CAAAATTTTC 

AAATCTATGG 

AAGAAGCAAA 

ACAAAATGGA 

CTAACACTTC 

AAGAAAATGA 

ACCCTCCTAC 

TTACTTTGAT 

AA 



51 
I 

ATTTTGTTAA 

CTGTGCTGTG 

ACTATGCTGA 

CAATGCACTG 

TGTGACATTG 

TACCTCTGCC 

ACACAACCAG 

GCAACCAGTG 

CCTGAAATGC 

CGCATTCCCT 

CACAAC6TGT 

CAAGTGTGCA 

CGAGGGGAGC 

TGCGTGAATA 

AACAACTATA 

TGCTACAACA 

GACAGGCTCA 

CAATGTGTCA 

AGAAGTAGAA 

GAAATGTGTT 

CCCTACATTC 

GAACTGCCCC 

TCAGACATCT 

ATTAAATCTG 

ATGCTTGTGC 

CTGACAGTCA 

GTGGGGCCAT 

AAGAATATTG 

TCTATACTGT 

TATGTAAAGA 

ACTGGTCTTC 

TAAAAGTGG6 

CACTAACAGT 

ATGAGTTTTT 

AAAATGGGGA 

TGTTTATTAT 

TAATTTTGTC 

CAGAATTCCC 

GGAGGATATG 

CTCGGAAAAT 

TAAAAGGTAT 

ACTAATTTAT 

TGATGGTTTT 

CCAATTCTAT 

GTATCATATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



Seq ID NO: 16 Protein sequence; 
Protein Accession #: NP 004096 



60 



65 



70 



MLKALFLTMIi 
CVNHYGGYLC 
SAAAVAGPEM 
THNCRADQVC 
PGFQIiAANNY 
SSYLCQYQCV 
PRNPCQDPYI 
NTINTFRIKS 
VLRLTIIVGP 



11 

I 

TLALVKSQDT 
LPKTAQIIVN 
QTGRNNFVIR 
INLRGSFACQ 
TCVDINECDA 
NEPGKFSCMC 
LTPBNRCVCP 
GNEN6EFYLR 
FSF 



21 
I 

EETITYTQCT 
NEQPQQBTQP 
RNPADPQRIP 
CPPGYQKRGE 
SNQCAQQCYN 
PQGYQWRSR 
VSNAMCRELP 
QTSPVSAMLV 



31 

I 

DGYBWDPVRQ 
AEGTSGATTG 
SNPSHRIQCA 
QCVDIDECTI 
ILGSFICQCN 
TCQDINECET 
QSIVYKYMSI 
IiVKSLSGPRE 



41 

I 

QCKDIDECDI 
WAASSMATS 
AGYEQSEHNV 
PPYCHQRCVN 
QGYELSSDRIi 
TNECREDBMC 
RSDRSVPSDI 
HIVDLEMIiTV 



51 
I 

VPDACKGGMK 
GVIjPGGGFVA 
CQDIDECTAG 
TPGSPYCQCS 
NCEDIDECRT 
WNYHGGFRCY 
FQIQATTIYA 
SSIGTFRTSS 



60 
120 
180 
240 
300 
360 
420 
480 



75 



Seq ID NO: 17 Nucleotide secmene^ . 
Nucleic Acid Accession #: NM_018894 

Coding sequence: 27.. 1967 (underlined sequences correspond to start and stop codons) 



186 



WSDOCID: <WO 02079402A2 .!_> 



wo 02/079492 



PCT/US02/04915 



1 11 21 31 41 51 

I I I I I I 

AAAACATTCA ACAAATTAAT GGGTGTAAGG AACTGGAAAA CCTGGACTCC TACCACATGC 60 

5 AGATAAAACC AATAGAGTGC AGAATAAGAC TCAAGTCAAG TAAGTAACGT TAAACACCAT 120 

AAAGACACAT GGCCTTCTTT GTGTACATGA CATGCATTCT CAACAATGCA CTGACGGATA 180 

TGAGTGGGAT CCTGTGAGAC AGCAATGCAA AGATATTGAT 6AATGTGACA TTGTCCCAGA 24 0 

CGCTTGTAAA GGTGGAATGA AGTGTGTCAA CCACTATGGA GGATACCTCT GCCTTCCGAA 300 

AACAGCCCAG ATTATTGTCA ATAATGAACA GCCTCAGCAG GAAACACAAC CAGCAGAAGG 360 

10 AACCTCAGGG GCAACCACCG GGGTTGTAGC TGCCAGCAGC ATGGCAACCA GTGGAGTGTT 420 

GCCCGGGGGT GGTTTTGTGG CCAGTGCTGC TGCAGTCGCA GGCCCTGAAA TGCAGACTGG 4 80 

CCGAAATAAC TTTGTCATCC GGCGGAACCC AGCTGACCCT CAGCGCATTC CCTCCAACCC 540 

TTCCCACCGT ATCCAGTGTG CAGCAGGCTA CGAGCAAAGT GAACACAACG TGTGCCAAGA 600 

CATAGACGAG TGCACTGCAG GGACGCACAA CTGTAGAGCA GACCAAGTGT GCATCAATTT 660 

15 ACGGGGATCC TTTGCATGTC AGTGCCCTCC TGGATATCAG AAGCGAGGGG AGCAGTGCGT 720 

AGACATAGAT GAATGTACCA TCCCTCCATA TTGCCACCAA AGATGCGTGA ATACACCA6G 780 

CTCATTTTAT TGCCAGTGCA GTCCTGGGTT TCAATTGGCA 6CAAACAACT ATACCTGCGT 840 

AGATATAAAT GAATGTGATG CCAGCAATCA ATGTGCTCAG CAQTGCTACA ACATTCTTGG 900 

TTCATTCATC TGTCAGTGCA ATCAAGGATA TGAGCTAAGC AGTGACAGGC TCAACTGTGA 960 

20 AGACATTGAT GAATGCAGAA CCTCAAGCTA CCT6TGTCAA TATCAATGTG TCAAT6AACC 1020 

TGGGAAATTC TCATGTATGT GCCCCCAGGG ATACCAAGTG GT6AGAAGTA GAACATGTCA 1080 

AGATATAAAT GAGTGTGAGA CCACAAATGA ATGCCGGGAG GATGAAATGT GTTGGAATTA 1140 

TCATGGCGGC TTCCGTTGTT ATCCACGAAA TCCTTGTCAA GATCCCTACA TTCTAACACC 1200 

AQAGAACCGA TGTGTTTGCC CAGTCTCAAA TGCCATGTGC CGAGAACTGC CCCAGTCAAT 1260 

25 AGTCTACAAA TACATGAGCA TCCGATCTGA TAGGTCTGTG CCATCAGACA TCTTCCAGAT 1320 

ACAGGCCACA ACTATTTATG CCAACACCAT CAATACTTTT CGGATTAAAT CTGGl^AATGA 1380 

AAATGGAGAG TTCTACCTAC GACAAACAAG TCCTGTAAGT GCAATGCTTG TGCTCGTGAA 1440 

GTCATTATCA GGACCAAGAG AACATATCGT GGACCTGGAG ATGCTGACAG TCAGCAGTAT 1500 

AGGGACCTTC CGCACAAGCT CTGTGTTAAG ATTGACAATA ATAGTGGGGC CATTTTCATT 1560 

30 TTAGTCTTTT CTAAGAGTCA ACCACAGGCA TTTAAGTCAG CCAAAGAATA TTGTTACCTT 1620 

AAAGCACTAT TTTATTTATA GATATATCTA GTGCATCTAC ATCTCTATAC TGTACACTCA 168 0 

CCCATAACAA ACAATTACAC CATGGTATAA AGTGGGCATT TAATATGTAA AGATTCAAAG 1740 

TTTGTCTTTA TTACTATATG TAAATTAGAC ATTAATCCAC TAAACTGGTC TTCTTCAAGA 1800 

GAGCTAAGTA TACACTATCT GGTGAAACTT GGATTCTTTC CTATAAAAGT GGGACCAAGC 1860 

35 AATGATGATC TTCTGTGGTG CTTAAGGAAA CTTACTAGAG CTCCACTAAC AGTCTCATAA 1920 

GGAGGCAGCC ATCATAACCA TTGAATAGCA TGCAAGGGTA AGAATGAGTT TTTAACTGCT 1980 

TTGTAAGAAA ATGGAAAAGG TCAATAAAGA TATATTTCTT TAGAAAATGG GGATCTGCCA 2040 

TATTTGT6TT G6TTTTTATT TTCATATCCA GCCTAAAGGT GGTTGTTTAT TATATAGTAA 2100 

TAAATCATTG CTGTACAACA TGCTGGTTTC TGTAGGGTAT TTTTAATTTT GTCAGAAATT 2160 

40 TTAGATTGTG AATATTTTGT AAAAAACAGT AAGCAAAATT TTCCAGAATT CCCAAAATGA 2220 

ACCAGATACC CCCTAGAAAA TTATACTATT GAGAAATCTA TGGG6AG6AT ATGAGAAAAT 2280 

AAATTCCTTC TAAACCACAT TGGAACTGAC CTGAAOAA6C AAACTCGGAA AATATAATAA 2340 

CATCCCTGAA TTCAGGCATT CACAAGATGC AGAACAAAAT GGATAAAAGG TATTTCACTG 24 00 

GAGAAGTTTT AATTTCTAAG TAAAATTTAA ATCCTAACAC TTCACTAATT TATAACTAAA 2460 

45 ATTTCTCATC TTCGTACTTG ATGCTGACAG AGGAAGAAAA TGATGATGGT TTTTATTCCT 2520 

GGCATCCAGA GTGACAGTGA ACTTAAGCAA ATTACCCTCC TACCCAATTC TATGGAATAT 2580 

TTTATACGTC TCCTTGTTTA AAATCTGACT GCTTTACTTT GATGTATCAT ATTTTTAAAT 2640 
AAAAATAAAT ATTCCTTTAG AAGATCACTC TAAAA 

50 Seq ID NO: 18 Protein sequence; 

Protein Accession #: NP_061489.1 

1 11 21 31 41 51 

cc I I I ' ' ' 

55 MHSQQCTDGY EWDPVRQQCK DIDBCDIVPD ACKGGMKCVN HYGGYLCLPK TAQIIVNNBQ 60 

PQQETQPABG TSGATTGWA ASSMATSGVL PGGGFVASAA AVAGPEMQrTG RMNFVIRRNP 120 

ADPQRXPSNP SHRXQCAAGY EQSEHNVCQD XDECTAGTHH CRADQVCINL RGSFACQCPP 180 

GYQKRGEQCV DIDECTIPPY CHQRCVNTPG SFYCQCSPGP QLAANNYTCV DINECDASNQ 240 

CAQCJCYNIIiG SFICQCNQGY ELSSDRIiNCE DIDECRTSSY IjCQYQCVNEP GKPSCMCPQG 300 

60 YQWRSRTCQ DINECETTNE CREDEMCWNY HGGFRCYPRN PCQDPYILTP ENRCVCPVSN 360 

AMCRELPQSI VYKYMSIRSD RSVPSDIFQI QATTIYANTI NTFRIKSGNE NGEFYIjRQTS 420 
PVSAMLVIjVK SLSGPREHIV DLEMLTVSSI GTPRTSSVIiR LTIIVGPFSP 

Seq ID NO: 19 Nucleotide sequence: 
65 Nucleic Acid Accession #: NM_006500 

Coding sequence: 27.. 1967 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

70 I I I 1 I 1 

ACTTGCGTCT CGCCCTCCGG CCAAGC^gG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCA6CCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCG6ACGC 240 

75 TCATCTTCCG TQTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGC66C 300 

TCA6CCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAA6ACGAGC 360 
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GCATCTTCTT 
TCTACAAAGC 
GTAAGGAGCC 
TCATCTGGTA 
CGTCCCAGAC 
TGGTTAAAGA 
GGAACCACAT 
TGTGGCTGGA 
GTTTGGCTGA 
GGGAGGCAGA 
AGGAACACAG 
TGAGTGAACC 
CCCCTGAGAG 
ACCTCGAGTT 
TTCAGTTGCA 
CCAGCATACC 
GGAT6GCATT 
GTGAAGCGTC 
AACAA6ACCA 
TGTTGGAGAC 
TCTTCCTGQA 
TCAGCACTTC 
TGCCGGAGCC 
TGGCGGTGCT 
GCTCAGGGAA 
TTAAGTCAGA 
GGGCTCCGGG 
CAGCTCCCTT 
CCTCCAAAGG 
GGCCACTGGG 
GTCCACCACC 
CCGAGCGGGT 
AAATACCTGG 
CAAAGGCTGG 
GCCTGCTCAT 
AGAAGCAGCT 
ACATTTTTTC 
GGCCAGGTGT 
TCACAAAGTC 
TACAAAAAAA 
CTGAAGCAGG 
CACTGCACTC 
ACGCGTACCT 
TCCCCGTGTT 
GGGGAGCAGA 
TTAGCACCAA 
AGAATGGTAC 
CTGTGTGTAT 
TTGTTTCCTT 
AAAGCTTAAT 
AACCTGGGGG 
AGAGATCAGG 
CTACCCTACT 
TGTTAGCAGG 



GTGCCAGGGC 
TCCGGAGGAG 
TGAGGAGGTC 
CT^GAATGGC 
TGTGGAGTCG 
AGACAAAGAT 
GAAGGAGTCC 
AGTGGAGCCC 
TGGCAACCCT 
GGAAGAGACA 
TGGGCGCTAT 
ACAGGAACTA 
ACAGGAAGGC 
CCAGTGGCTG 
TGACCTGAAA 
CGGCCTGAAC 
CAAGGAGAGG 
AGGGCACCCC 
AGATCCACAG 
AGGTGTTGAA 
GCTGGTCAAT 
CACTGCCAGT 
GGAGAGCCGG 
GGGCGCTGTC 
GCAGGAGATC 
TAAGCTCCCA 
AGACCAGGGA 
CCCTGCCTGG 
GACTAGAGAG 
TTAGGACCTG 
ATCTCCTCCA 
AGGAGAGTTT 
CTCCTGCCAG 
CTTCCACCAT 
GTTGAAGTGC 
GCAGTGTTGC 
TTTGGTCAGA 
GGTGGCTCAC 
AGGACGAGAC 
AATTAGCTAG 
AGAATGGTAT 
CAGCCTG6GC 
GCGGTGAGGA 
CACTTGCTCC 
CAAAGATGAG 
ACTTCTACAA 
TTAGGGATGG 
GCATACATAT 
TATATATGTA 
TGTCCCAGAA 
CCTGTGAAAC 
GGTTACCTCT 
TTTCAGCAGC 
AGCTATGTCC 



AAGCGCCCTC 
CCAAACATCC 
GCTACCTGTG 
CGGCCTCTGA 
AGTGGTTTGT 
GCCCAGTTTT 
AGGGAAGTCA 
GTGGGAATGC 
CCACCACACT 
ACCAACGACA 
GAATGTCAGG 
CTGGTGAACT 
AGCAGCCTCA 
AGA6AAGAGA 
CGGGAGGCAG 
CGCACACAGC 
AAGGTGTGGG 
CGGCCCACCA 
CGAGTCCTGA 
TGCACGGCCT 
TTAACCACCC 
CCTCATACCA 
GGCGTGGTCA 
CTCTATTTCC 
ACGCTGCCCC 
GAAGAGATGG 
GAGAAATACA 
ACCATTCCCA 
AAGCCTCCTG 
AGGACCTCAC 
CGTTGAGTGA 
CTTGCAGAAC 
CAGCTGAGCT 
CCAGGTGCAC 
GCTGTTCACA 
TGCCACCACC 
AGCCAG6AAC 
GCCTGTAATC 
CATCCTGGCT 
GCGTAQTGGT 
GAATCCAGGA 
AACACAGCGA 
AGCTGG6CGC 
CATAGCCCTC 
GTCTACACTG 
ACCAAGCTCA 
AAAACGGGGC 
GTGTGTATAT 
TGTATATATA 
AATCATACAT 
TACAACCAAA 
GCTTCTGAGC 
AAAACGTCCC 
CTTCCTATCQ 



GGTCCCAG6A 
AGGTCAACCC 
TAG6GAG6AA 
AGGAGGAGAA 
ACACCTTGCA 
ACTGTGAGCT 
CCGTCCCTGT 
TGAAGGAAGG 
TCAGCATCAG 
ACGGGGTCCT 
CCTGGAACTT 
ATGTGTCTGA 
CCCTGACCTG 
CAGACCAGGT 
GAGGCGGCTA 
TGGTCAAGCT 
TGAAAGAGAA 
TCTCCTG6AA 
GCACCCTGAA 
CCAACGACCT 
TCACACCAGA 
GAGCCAACA6 
TCGTGGCTGT 
TCTATAAGAA 
CGTCTCGTAA 
GCCTCCTGCA 
TCGATCTGAG 
GCTCCCTGCT 
CTCCCCTCAC 
TTGGCCCTGC 
AGCTCATCCC 
GTGTTTTTTC 
GGGTAGCCTC 
CACTGAAGTG 
CCCGCTCCGG 
CTCCTGCTCG 
TGGTGTCATT 
CCAGCACTTT 
AACACGGTGA 
TGGCACCTAT 
GGTGGAGCTT 
GACTCCGTCT 
TGTTTTCGAG 
TTGATGGATC 
TCCTTCATGG 
GGGCCCCAAC 
CTGGCTAGAG 
ATGGTTTTGT 
TATATGAAAA 
TGCTTTTTTA 
AGGCACACAA 
AAATGGCTCA 
GTATGACGCA 
TTTCCGTCCA 



6TACCGCATC 
CCTGGGCATC 
CGGGTACCCC 
GAACCGGGTC 
GAGTATTCTG 
CAACTACCGG 
TTTCTACCCG 
GGACCXSCGTG 
CAAGCAGAAC 
GGTGCTGGAG 
GGACACCATG 
CGTCCGAGTG 
TGAGGCAGAG 
GCTGGAAAGG 
TCGCTGCGTG 
GGCCATTTTT 
TATGGTGTTG 
CGTCAACGGC 
TGTCCTCGTG 
GGGCAAAAAC 
CTCCAACACA 
CACCTCCACA 
GATTGTGT6C 
GGGCAAGCT6 
GACCGAACTT 
6GGCA6CAGC 
GCATTAgCCC 
CACTCTTCTC 
CTGCACACCC 
AAGCCGCTTT 
AAGCAAGGAG 
TTTACACACA 
TCTGAGCTGG 
AGGACACACC 
AGAGCACCCC 
CCTCTTCAAA 
CCTTAAAAGA 
GGGAGGCCGA 
AACCCTGTCT 
AGTCCCAGCT 
GCAGTGAGCC 
C6A6GAAAAA 
TTCAGGTGAA 
AC6TAAAACT 
GGATTAAAGC 
CCTAGAAG66 
CTTCGGGTGT 
CAGGTGTGTA 
TATATATATA 
TTCTACATGG 
AACCGTTTCC 
AGCTCTACCA 
GCACGAAGGG 
CTT 



CAGCTCCGCG 

CCTGTGAACA 

ATTCCTCAAG 

CACATTCAGT 

AAGGCACA6C 

CTGCCCAGTG 

ACAGAAAAAG 

GAAATCAGGT 

CCCAGCACCA 

CCTGCCCGGA 

ATATCGCTGC 

AGTCCCGCAG 

AGTAGCCAGG 

GGGCCTGTGC 

GCGTCTGTGC 

GGCCCCCCTT 

AATCTGTCTT 

ACGGCAAGTG 

ACCCCGGAGC 

ACCAGCATCC 

ACCACTGGCC 

GAGAGAAA6C 

ATCCTGGTCC 

CCGTGCAG6C 

GTAGTTGAAG 

GGTGACAAGA 

CGAATCACTT 

TCAGCCAAAG 

CCTTTCAGAG 

TCAGGGACCA 

CCCCAGTCTC 

TTATGGCTGT 

TTTCCTGCCC 

GGAGCCAGGC 

AGCGGCATCC 

GTCTCCTGTG 

TACGTGCCGG 

GGCGGGCGGA 

CTACTAAAAA 

ACTCGGAAGG 

GAGACC6T6C 

AAAAGAAAAG 

TTAGCCTCAA 

6AAA6GCA6C 

TATGGTTATA 

CCCAAATGAG 

GTGTGTCTGT 

AATTTGCAAA 

TATGAAAAAT 

GTACCACAGG 

AGTTGGCAGC 

GAGCAGACAG 

CCTGGCAGGC 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 



Seq ID NO: 20 Protein aeouence ; 
Protein Accession #: NP 006491 



60 



65 



70 
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MGLPRLVCAF 
DWPSVHKEKR 
PRSQEYRIQL 
LKEEKNRVHI 
VTVPVPYPTE 
DNGVLVLEPA 
LTLTCEAESS 
QLVKLAIFGP 
LSTLNVLVTP 
TRANSTSTER 
PPSRKTELW 



11 

I 

LLAACCCCPR 
TLIFRVRQGQ 
RVYKAPEEPN 
QSSQTVESSG 
KVWLEVEPVG 
RKKHSGRYEC 
QDIiEFQWLRE 
PWMAFKERKV 
ELLETGVECT 
KLPEPESRGV 
EVKSDKLPEE 



21 
I 

VAGVPGEAEQ 
GQSEPGBYEQ 
IQVNPLGIPV 
LYTIiQSILKA 
MliKEGDRVEI 
QAWNLDTMIS 
ETDQVIiERGP 
WVKENMVliNL 
ASNDLGKNTS 
VIVAVIVCIIi 
M6LLQGSS60 



31 

I 

PAPEIjVBVBV 
RLSliQDRGAT 
NSKEPEEVAT 
QLVKEDKDAQ 
RCLADGNPPP 
LLSEPQELLV 
VLQLHDLKRE 
SCBASGHPRP 
IliPIiELVNLT 
VIiAVLGAVLY 
KRAP60Q6EK 



41 
I 

GSTALLKCGIi 
liALTQVTPQD 
CVGRNGYPIP 
FYCELNYRLP 
HFSISKQNPS 
NYVSDVRVSP 
AGGGYRCVAS 
TISWNVNGTA 
TLTPDSNTTT 
FLYKKGKLPC 
YIDhRH 



51 
I 

SQSQGNIiSHV 
ERIFLCQGKR 
QVIWYKNGRP 
SGNHMKESRB 
TREAEEETTN 
AAPERQEGSS 
VPSIPGIjNRT 
SEQDQDPQRV 
GLSTSTASPH 
RRSGKQEITL 



60 
120 
180 
240 
300. 
360 
420 
480 
540 
600 
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Seq ID NO: 21 Nucleotide secniencei 
Nucleic Acid Accession #: NM_00242l 

Coding sequence: 72.. 1481 (underlined sequences correspond to start and stop codons) 

5 

1 11 21 31 41 SI 

I I I I i I 

GGGATATTGG AGTAGCAAGA GGCTGGGAAG CCATCACTTA CCTTGCACTG AGAAAGAAGA 60 

CAAAGGCCAG T ATGC ACAGC TTTCCTCCAC TGCTGCTGCT GCTGTTCTGG GGTGTGGTGT 120 

10 CTCACAGCTT CCCAGCGACT CTAGAAACAC AAGAGCAAGA TGTGGACTTA GTCCAGAAAT 180 

ACCTGGAAAA ATACTACAAC CTGAAGAATG ATGGGAGGCA AGTTGAAAAG CGGAGAAATA 240 

GTGGCCCAGT GGTTGAAAAA TTGAAGCAAA TGCAGGAATT CTTTGGGCTG AAAGTGACTG 300 

GGAAACCAGA TGCTGAAACC CTGAAGGTGA TGAAGCAGCC CAGATGTGGA GTGCCTGATG 360 

TGGCTCAGTT TGTCCTCACT GAGGGGAACC CTCGCTGGGA GCAAACACAT CTGACCTACA 420 

15 GGATTGAAAA TTACACGCCA GATTTGCCAA GAGCAGATGT GGACCATGCC ATTGAGAAAG 480 

CCTTCCAACT CTGGAGTAAT GTCACACCTC TGACATTCAC CAAGGTCTCT GAGGGTCAAG 540 

CAGACATCAT GATATCTTTT GTCAGGGGAG ATCATCGGGA CAACTCTCCT TTTGATGGAC 600 

CTGGAGGAAA TCTTGCTCAT GCTTTTCAAC CAGGCCCAGG TATTGGAGGG GATGCTCATT 660 

TTGATGAAGA TGAAAG6TGG ACCAACAATT TCAGAGAGTA CAACTTACAT CGTGTT6CGG 720 

20 CTCATGAACT CGGCCATTCT CTTG6ACTCT CCCATTCTAC TGATATCGGG GCTTTGATGT 780 

ACCCTAGCTA CACCTTCAGT GGTGATGTTC A6CTAGCTCA GGATGACATT GATGGCATCC 840 

AAGCCATATA TGGACGTTCC CAAAATCCTG TCCAGCCCAT CGGCCCACAA ACCCCAAAAG 900 

CGTGTGACAG TAAGCTAACC TTTGAT6CTA TAACTACGAT TCGGGGAGAA GTGATGTTCT 960 

TTAAAGACAG ATTCTACATG CGCACAAATC CCTTCTACCC GGAAGTTGAG CTCAATTTCA 1020 

25 TTTCTGTTTT CTGGCCACAA CTGCCAAATG GGCTTGAAGC TGCTTACGAA TTTGCCGACA 1080 

GAGATGAAGT CCGGTTTTTC AAAGGGAATA AGTACTGGGC TGTTCAGGGA CAGAATGTGC 1140 

TACACGGATA CCCCAAGGAC ATCTACAGCT CCTTTGGCTT CCCTAGAACT GTGAAGCATA 1200 

TCGATGCTGC TCTTTCTGAG GAAAACACTG GAAAAACCTA CTTCTTTGTT GCTAACAAAT 1260 

ACTGGAGGTA TGATGAATAT AAACGATCTA TGGATCCAGG TTATCCCAAA ATGATAGCAC 1320 

30 ATGACTTTCC TGGAATTGGC CACAAAGTTG ATGCAGTTTT CATGAAAGAT GGATTTTTCT 13 80 

ATTTCTTTCA TGGAACAAGA CAATACAAAT TTGATCCTAA T^CGAAGAGA ATTTTGACTC 1440 

TCCAGAAAGC TAATAGCTGG TTCAACTGCA GGAAAAATTG AACATTACTA ATTTGAATGG 1500 

AAAACACATG GTGTGAGTCC AAAGAAGGTG TTTTCCTGAA GAACTGTCTA TTTTCTCAGT 1560 

CATTTTTAAC CTCTAGAGTC ACTGATACAC AGAATATAAT CTTATTTATA CCTCAGTTTG 1620 

35 CATATTTTTT TACTATTTAG AATGTAGCCC TTTTTGTACT GATATAATTT AGTTCCACAA 1680 

ATGGTGGGTA CAAAAAGTCA AGTTTGTGGC TTATGGATTC ATATAGGCCA GAGTTGCAAA 1740 

GATCTTTTCC AGAGTATGCA ACTCTGACGT TGATCCCAGA GAGCAGCTTC AGTGACAAAC 1800 

ATATCCTTTC AAGACAGAAA GAGACAGGAG ACATGAGTCT TTGCCGGAGG AAAAGCAGCT 1860 

CAAGAACACA TGTGCAGTCA CTGGTGTCAC CCTGGATAG6 CAAGGGATAA CTCTTCTAAC 1920 

40 ACAAAATAAG TGTTTTATGT TTGGAATAAA GTCAACCTTG TTTCTACTGT TTT 



45 



Seq ID NO: 22 Protein sequence; 
Protein Accession #: NP 002412 



50 



55 



1 
I 

MHSFPPLLLIi 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQIiAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 
I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELiNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQXiPNGLBA 
liSEENTGKTY 
GTRQYKFDPK 



31 
I 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACZDS 
AYEFADRDEV 
FFVANKYWRY 
TKRIIiTIiQKA 



41 


51 




1 

YYNIiKNDGRQ 


1 

VEKRRNSGPV 


60 


VliTEGNPRWE 


QTHLTYRIEN 


120 


ISFVRGDHRD 


NSPFDGPGGN 


180 


GHSLGLSHST 


DIGALMYPSY 


240 


KIiTFDAITTI 


RGEVMFFKDR 


300 


RFFKGNKYWA 


VQGQNVLHGY 


360 


DEYKRSMDPG 


YPKMIAHDFP 


420 


NSWFNCRKN 







Seq ID NO: 23 Nucleotide seonience: 
60 Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 141-1580 (underlined sequences correspond to start and stop codons) 



65 



11 21 31 41 51 



TCTGCGTGTG CCGGGGCTAG GGGCTGGAAG TCCTGGCTCT AGTTGCACCT CGGAAGGAAA 60 

AGGCAAACAG AGGAGGGAAG GCGTCTTAGG ACTGCCTGGA TCCAGAGCAC TTTCCTCGGC 120 

CTCTACAGGC CTGTGTCGCT ATGG GTTCCC CCGCCGCCCC GGAGGGAGCG CTGGGCTACG 180 

TCCGCGAGTT CACTCGCCAC TCCTCCGACG TGCTGGGCAA CCTCAACGAG CTGCGCCTGC 240 

70 GCGGGATCCT CACTGACGTC ACGCTGCTGG TTGGCGGGCA ACCCCTCAGA GCACACAAGG 300 

CAGTTCTCAT CGCCTGCAGT GGCTTCTTCT ATTCAATTTT CCGGGGCCGT GCGGGAGTCG 360 

GGGTGGACGT GCTCTCTCTG CCCGGGGGTC CCGAAGCGAG AGGCTTCGCC CCTCTATTGG 420 

ACTTCATGTA CACTTCGCGC CTGCGCCTCT CTCCAGCCAC TGCACCAGCA GTCCTAGCGG 4 80 

CCGCCACCTA TTTGCAGATG GAGCACGTGG TCCAGGCATG CCACCGCTTC ATCCAGGCCA 540 

75 GCTATGAACC TCTGGGCATC TCCCTGCGCC CCCTGGAAGC AGAACCOCCA ACACCCCCAA 600 

CGGCCCCTCC ACCAGGTAGT CCCAGGCGCT CCGAAGGACA CCCAGACCCA CCTACTGAAT 660 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 

iNSDCX;iD: <WO_ 



CTCGAAGCTG CAGTCAAGGC CCCCCCAGTC CAGCCAGCCC TGACCCCAAG GCCTGCAACT 720 

GQAAAAAGTA CAAGTACATC GTGCTAAACT CTCAGGCCTC CCAAGCAGGG AGCCTGGTCG 780 

GGGAGAGAAG TTCTGGTCAA CCTTGCCCCC AAGCCAGGCT CCCCAGTGGA GACGAGGCCT 840 

CCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GTGAAGAAGG ACCCATTCCT GGTCCCCAGA 900 

GCAGGCTCTC TCCAACTGCT GCCACTGTGC AGTTCAAATG TGGGGCTCCA GCCAGTACCC 960 

CCTACCTCCT CACATCCCAG GCTCAAGACA CCTCTGGATC ACCCTCTGAA CGGGCTCGTC 1020 

CACTACCGGG AAGTGAATTT TTCAGCTGCC AGAACTGTGA QGCTGTGGCA GGGTGCTCAT 1080 

CGGGGCTGGA CTCCTTGGTT CCTGGGGACG AAGACAAACC CTATAAGTGT CAGCTGTGCC 1140 

GGTCTTCGTT CCGCTACAAG GGCAACCTTG CCAGTCATCG TACAGTGCAC ACAGGGGAAA 1200 

AGCCTTACCA CTGCTCAATC TGCGGAGCCC GTTTTAACCG GCCAGCAAAC CTGAAAACGC 1260 

ACAGCCGCAT CCATTCGGGA GAGAAGCCGT ATAAGTGTGA GACGTGCGGC TCGCGCTTTG 1320 

TACAGGTGGC ACATCTGCGG GOGCACGTGC TGATCCACAC CGGGGAGAAG CCCTACCCTT 1380 

GCCCTACCTG CGGAACCCGC TTCCGCCACC TGCAGACCCT CAAGAGCCAC GTTCGCATCC 144 0 

ACACCGGAGA GAAGCCTTAC CACTGCGACC CCT6TGGCCT GCATTTCCGG CACAAGAGTC 1500 

AACTGCX3GCT GCATCTGCGC CAGAAACACG GAGCTGCTAC CAACACCAAA GTGCACTACC 1560 

ACATTCTCQG GGGGCCCTAg CTGAGCGCAG GCCCAGGCCC CACTTGCTTC CTGCGGGTGG 1620 

GAAAGCTGCA GGCCCAGGCC TTGCTTCCCT ATCAGGCTTG GGCATAGGGG TGTGC»GGC 1680 

CACTTTGGTA TCAGAAATTG CCACCCTCTT AATTTCTCAC TGGGGAGAGC AGGGGT6GCA 1740 

GATCCTGGCT AGATCTGCCT CTGTTTTGCT GGTCAAAACC TCTTCCCCAC AAGCCAGATT 1800 

GTTTCTGAGG AGAGAGCTAG CTAGGGGCTG GGAAAGGGGA GAGATTGGAG TCCTGGTCTC 1860 

CCTAAGGGAA TAGCCCTCCA CCTGTGGCCC CCATTGCATT CA6TTTATCT GTAAATATAA 1920 

TTTATTGAGG CCTTTGGGTG GCACCGGGGC CTTCATTCGA TTGCATTTCC CACTCCCCTC 1980 

TTCCACAAGT GTGATTAAAA GTGACCAGAA ACACAGAAGG TGAGATCACA GCTCTGCTGG 2040 

CAGAGATTAC TAGCCCTTGG CTCTCTCGTT TGGCTTGGGT ATTTTATATT ATTTCTGTCA 2100 

TAACTTTTAT CTTTAGAATT GTTCTTTCTC CTGTTTGTTT GCTTGTTAGT TTGTTTAAAA 2160 

TGGAAAAAGG GGTTCTCTGT GTTCTGCCCC TGTAATTCTA GGTCTGGAAC CTTTATTTGT 2220 

TCTAGGGCAG CTCTGGGAAC ATGCGGGATT GTGGAATTGG GTCAGGAACC CTCTCTGGTA 2280 

TTCTGGATGT TGTAGGTTCT CTAGCAGTCT AGAAATGGAT ACAGACATTT CTCTGTTCTT 234 0 

CAAGGGTGAT AGGAACCATT ATGTTGAGCC CAAAATGGAA GTAATAATAA ATGCCTCCTG 2400 

GAGGCTGTGG GTGTGGGGGA TTCTGTATCT GGATTCCGTA TCACTCCAAC TGGAGGCTGT 2460 

GG6TGTGGGG GATTCTGTAT CTGGATTCCG TATCACTCCA AGTGGAGGCT GGCAGGTTTT 2520 

TCTGCAAGAT GGTCCAGAAT CTAAAATGTC CCATTAATCT GGTCACTTGG GTTTGGCTCT 2580 

GCTGTATCCA TCTATAGTGG TAGAGACCCA CCA6GGCTCA AGTGGAGTCC ATCATCCTCC 2640 

CACGGGGGCC TGTTCTTAGC ACTGAGTTGA TCGCTCCAT6 GGGGAGAGAT CAGACATTCC 2 700 

TTATCAGAGA TGATGTGACC TTTTCTGACT CTGCCCAGTC TCTATGAATG TTATGGCCTA 2760 

GGGAAGAATC ATGAAACTCT TTAGCTTGAT TAGATGGTAA ACAGTGTTAA CCCATCCTTT 2820 

ACTACAGAGG CATATGGGTT TGAATGTTAC CTGGGGTTCT CTCTATTGAG TTGAGCCCCT 2880 

TCTTCCTTTA GTGGGTTTT6 GACATCTTCT GGCAAGTGTC CAGATGCCAG AACCTTCTTT 2940 

TCCTCTAGAA GGGATGGTGC TTGGTAACCT TACCTTTTAA AA6CTGGGTC TGTGACCTGG 3000 

TCTTCCCATC CCTGCATTCC TGTCTGGAAC CAGTGAATGC ATTAGAACCT TCCATAG6AA 3060 

AAGAAAAGGG GCTGAGTTCC ATTCTGGGTT TGCTGTAGTT TGGTTGGGAT TATTGTTGGC 3120 

ATTACAGATG TAAAAGATTG ACTAGCCCAT AGGCCAAAGG CCTGTTCTAG TTGACCAAGT 3180 

TTCAAGTAGG ATTAAGAGGT TGGTTGAGGG GTGCAGTTTC TGGTGTAGGC CAGGTAGGTA 324 0 

GAAAGTGAGG AACAGGGTTG CCTCTTGGCT GGGTGGAGTC TCTGAAATGT TAGAAGAAGC 3300 

GCTGAAGCCT TGATTGATAG TTCTGCCCCT TGTTGCCCTG GGGCTTATCT GATTATGGGA 3360 

CGAGGGTAGA AAGTAAGAAG CACTTTTGAA TTTGTGGGGT AGAACTTCAA CAATAAGTCA 3420 

GTTCTAGT6G CTGTCGCCTG GGGACTAGTG AGAAAGCTAC TCTTCTCCCT CTTCCCTCTT 3480 

TCTCCCCATG GCCCCACTGC AGAATTAAAG AAGGAAGAAG GGAAGGCGGA GGAGTCTATA 354 0 

AGAAGGAATC ATGATTTCTA TTTAGCAGAT TGGATGGGCA GGTGGAGAAT GCCTGGGGGT 3600 

AGAAATGTTA GATCTTGCAA CATCA6ATCC TTGGAATAAA GAA6CCTCTC TGYGCWRAAA 3660 
AAAAAAAAAA AAAAAA 



Seq ID NO: 24 Protein sequence! 
Protein Accession #: FGENBSH predicted 



1 11 21 31 41 51 

I I I I I I 

MGSPAAPEGA LGYVREFTRH SSDVLGNLNE LRLRGILTDV TLLVGGQPIiR AHKAVLIACS 60 

GFFYSIFRGR AGVGVDVI^L PGGPEARGFA PLLDFMYTSR LRIiSPATAPA VIAAATYLQM 120 

EHWQACHRF IQASYEPIiGI SLRPLEAEPP TPPTAPPPGS PRRSEGHPDP PTESRSCSQG 180 

PPSPASPDPK ACNWKKYKYI VLNSQASQAG SIiVGBRSSGQ PCPQARLPSG DEASSSSSSS 24 0 

SSSSEEGPIP GPQSRLSPTA ATVQFKCGAP ASTPYLLTSQ AQDTSGSPSE RARPLPGSEP 300 

PSCQNCEAVA GCSSGLDSLV PGDEDKPYKC QLCRSSFRYK GNLASHRTVH TGEKPYHCSI 360 

CGARFNRPAN LKTHSRIHSG BKPYKCETCX3 SRPVQVAHIiR AHVLIHTGEK PYPCPTCGTR 420 
FRHIiQTLKSH VRIHTGEKPY HCDPCGIjHFR HKSQLRIiHLR QKHGAATNTK VHYHXLGGP 



Seq ID NO: 25 Nucleotide sequence! 
Nucleic Acid Accession #: U21551 

Coding sequence: 1..11S5 (xinderlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

aXSGATTQCA GTAACGGATC GGCAGAGTOT ACCGGAGAAG 6A66ATCAAA AGAGGTGGTO 60 
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10 



15 



20 



GGGACTTTTA 
GACCCCAATA 
TCCTCAGAGT 
CCTGGCTCAT 
GGAGTA6ATA 
TCTGCTGTGA 
CAGCTTGTGA 
CGTCCTGCAT 
CTCTTTGTAC 
TCCCTGTGGG 
ATGGGAGGGA 
CAGCAGGTCC 
CTTTTTCTTT 
GGCATCATTC 
GAATTTAAGG 
AACAGAGTGA 
ATACTGTACA 
AGCCX3CATCT 
ATTGTGCTAT 



AGGCTAAAGA 
ATCTGGTTTT 
TTGGATGGGA 
CAGCTTTGCA 
ATAAAATTCG 
GGGCAACTCT 
AATTGGATCA 
TCATTGGAAC 
TCTTGAGCCC 
CCAATCCCAA 
ATTACGGCTC 
TGTGGCTCTA 
ACTGGATAAA 
TTCCAGGAGT 
TGTCAGAGAG 
GAGAGATGTT 
AAGGCGAGAC 
TGAGCAAATT 
CC TGA 



CCTAATAGTC 
TGGAACTGTG 
GAAACCTCAT 
CTATGCAGTG 
ACTGTTTCAG 
GCCGGTATTT 
AGAATGGGTC 
TGAGCCTTCT 
AGTGGGACCT 
GTATGTAAGA 
ATCTCTTTTT 
TGGCAGAGAC 
TGAAGATGGA 
GACAAGGCGG 
ATACCTCACC 
TAGCTCTGGT 
AATACACATT 
AACTGATATC 



ACACCAGCTA 
TTCACGGATC 
ATCAAGCCTC 
GAATTATTTG 
CCAAACCTCA 
GACAAAGAAG 
CCATATTCAA 
CTTGGAGTCA 
TATTTTTCAA 
GCCTGGAAAG 
GCCCAATGTG 
CATCAGATCA 
GAAGAA6AAC 
TGCATTCTGG 
ATGGATGACT 
ACAGCCTGTG 
CCAACTATG6 
CAGTATGGAA 



CCATTTTAAA 
ATATGCTGAC 
TTCAGAACCT 
AAGGATTGAA 
ACATGGATAG 
AGCTCTTAGA 
CATCTGCTAG 
AGAAGCCTAC 
GTGGAACCTT 
GTGGAACTGG 
AAGACGTAGA 
CTGAAGTGG6 
TGGCAACTCC 
ACCTGGCACA 
TGACAACAGC 
TTGTTTGCCC 
AGAATGGTCC 
GAGAAGAGA6 



GGAAAAACCA 
GGTGGAGT6G 
GTCATTGCAC 
GGCATTTCGA 
AATGTATCGC 
GTGTATTCAA 
TCTGTATATT 
CAAAGCCCTG 
TAATCCAGTG 
GGACTGCAAG 
TAATGGGTGT 
AACTATGAAT 
TCCACTAGAT 
TCAGTGGGGT 
CCTGGAGGGG 
A6TTTCTGAT 
TAAGCTGGCA 
CGACTGGACA 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



25 



30 



35 



40 



Seq ID NO: 26 Protein secnaence; 
Protein Accession #: AAB08528 



MDCSNGSAEC 
SSEFGWEKPH 
SAVRATLPVF 
LFVLIiSPVGP 
QQVIiWLYGRD 
EFKVSERYIiT 
SRILSKLTDI 



11 

I 

TGEGGSKEW 
IKPIiQNLSLH 
DKEEIiliECIQ 
YFSSGTFNPV 
HQITEVGTMN 
MDDIiTTALEG 
QYGRBBSDWT 



21 

i 

GTFKAKDIjIV 
PGSSAIiHYAV 
QLVKLDQEWV 
SLWANPKYVR 
LFLYWINEDG 
NRVREMFSSG 
IVIiS 



31 

I 

TPATILKEKP 
ELFEGLKAFR 
PYSTSASLYI 
AWKGGTGDCK 
EEEIiATPPLD 
TACWCPVSD 



41 

I 

DPNNLVFGTV 
GVDNKIRLFQ 
RPAFIGTEPS 
MGGNYGSSIiF 
GIILPGVTRR 
ILYKGETIHI 



51 

I 

PTDHMLTVEW 
PNIjNMDRMYR 
LGVKKPTKAIi 
AQCEDVDNGC 
CIIiDIiAHQWG 
PTMEHGPKliA 



60 
120 
180 
240 
300 
360 



Seq ID NO: 27 Nucleotide sequence; 
Nucleic Acid Accession #: XM_039209 

Coding sequence: 656.. 2758 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



45 



50 



55 



60 



65 



70 



75 



TCGCGCGGGG 
TGACGTCAAG 
AAGAGGGAAC 
CAGCATCATC 
GTTTCTGTTC 
TGTGGCTGTC 
AGCCGCAGCA 
GCACAACTTT 
TATTTTTTGC 
TGGAGTGCCC 
CAGTGGCGTT 
GAAGATGCTC 
TGCTAAGTTT 
GAACCCCCCG 
GAGTGGGGGA 
TGACAGCCCG 
ATGTGGGAAG 
GTTCCACTCA 
AGACTATTGC 
T^CTGCGGAT 
TTTTCCAAGA 
TGACAAAGTG 
TGTGAGTGGG 
CTTCATTCTG 
GGAGCCTTAT 
AGGACTGCTA 
CTATACCACC 
GGAATACACA 
TCTTGAAGTT 
CGGCTTTTTG 
6GATG6GTTA 
CAACGTGCCT 
CGAAGTGTTT 
TGATATAAAC 



GCCGCCCCCT 
GGGCGCGCTG 
GAAACATGAG 
TAGAGCCCAG 
CTGCTACTGT 
CTCTCCTTTT 
GAAAGCCCCC 
ATCTCGCTCC 
AAAGTTGCAT 
TACAGCCCCG 
CCCCCCCATC 
TCCTTTAAGC 
GGGGAAAGAA 
AAGCGCCTGA 
GAGAT6CTGT 
GGGCTT^GGC 
TTACTGGAGG 
CCTGAGAGAG 
AAAGAATTCT 
GAGTTTTGCT 
AAACAAGTCA 
GAAGAGATCA 
CTGCGGCAGC 
GAAAAAGAAG 
TTGGACATTC 
AGCCTCGCAT 
AACCAAGAAC 
GTATCCAGAA 
GCAGAACTCC 
TACATCATTC 
AGTGATTTCA 
TATTCCATAC 
GCTCATGGGC 
ATCAATTTAA 



CCCCTTCCCT 
TGGCAGCACC 
AGGCTGTGTG 
CGCTGGCCCT 
CCCACCTAAA 
CTTCCTCCTC 
AACCCAACTG 
TCGGGCTCCC 
CGCTGTACAT 
CAAACTCCTC 
CTCCCGCGCC 
TGCTGCTGCT 
ACGAAGGGAG 
AAAGGAGAGA 
GCGGTGGCTT 
GCCT6GAGAA 
7\AATCAAAT6 
AAGTCTTGGA 
TTTACACTTG 
TTTACTATGC 
GAGGACCAGC 
GCAGAAAGCA 
CCGTTGGTGC 
GTTATGTGAA 
ACAAACTTGT 
TCCATCCCAA 
GGTGGGCTAT 
AAAATCCACA 
ACAGAAAGCA 
TTGGTGATGG 
CAGGCTCAGT 
CAAGGAGCAA 
TCCACGATCC 
CGATACTGTG 



CCACCCTGGG 
TCCCCGCGCG 
AGAAGCTGCA 
GCCTCCGCCT 
CAACTCCCGT 
TTCCAACTCC 
ACACTGGCAC 
CTAAGGCATT 
ATTTTTGTCC 
CTGGAGCTGC 
CAGCCCCTGC 
GGCCGTGGCT 
CGGAGCAAGG 
CAGGA6GATG 
CTACCCTCX3G 
TAAGATATTT 
TGCACTTTGC 
AAGAGACCTA 
CCGAGGCCAT 
AAGAAAAGAT 
ATCTAACTAC 
CAAACACAAC 
CCTGCATAGT 
GATACTTACC 
TCAAAGTGGA 
TTACAAGAAA 
CGGGCCTCAT 
CCAAGTTGAT 
TCTGG6AGGA 
GATGATTACA 
GCTACGGCT6 
CCCACACTTC 
AGGCAGATGT 
TTCAGACTCC 



CGGGGGCGCG 
CTAGTTAAAA 
GCCGCCGGCA 
GCCCCGCCGC 
TACACGGACA 
TTCTCCTCCT 
AACTGCAAAC 
GGACCCATCG 
CCGCCACCTC 
GCCCTAGTGC 
TGCTCTGGGC 
CTGGGCTTCT 
AGGAGAAGGT 
ATGTCCCAGC 
CTGTCCTGCT 
TCTGTTACCA 
TCTCCACATT 
GTACTTCCTC 
ATTCCAGGTT 
GGTGGGTTGT 
TTGGACCAGA 
TGCTTCTGTA 
GGGGATGGCT 
CCTGAAGGAG 
ATAAAGGGAG 
AATGGAAAGT 
GACCACATTC 
TTGAGAACAG 
CAACTGCTCT 
CTGGATGATA 
GAT6TGGACA 
AACAGCACCA 
GCTGT66ATA 
AATGGAAAAA 



CGAGAAGCGG 
AGAAGAAGAA 
GAGGAGACCT 
CGCCGTCGCC 
AGTGAACATC 
CCCACTTCCC 
GGTGTCATCC 
CCGCGTCTTT 
CCTCTGTCTC 
CCCTGCTGGG 
AGAC GATG CT 
TTGAAGGAGA 
GCCTGAAT6G 
TGGA6CTGCT 
GCCTGCGGAG 
ACAACACAGA 
CTCAAAGCCT 
TGCTCTGCAA 
TCCTTCAT^C 
GCTTTCCAGA 
TGGAAGAATA 
TTCAGGAGGT 
CGCAACGTCT 
AAATTTTCAA 
GAGATGAAAG 
TGTATGTGTC 
TTAGGGTTGT 
CCAGAGTCTT 
TTGGCCCTGA 
TGGAAGAAAT 
CAGACATGTG 
ACCAGCCCCC 
GACATCCCAC 
ACAGATCATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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AGCCAGAATT CTACAGATAA TAAAGGGGAA AGATTATGAA AGTGAGCCAT CACTTTTAGA 2100 

ATTCAAGCCA TTCAGTAATG GTCCTTTGGT TGGTGGATTT GTATACCGGG GCTGCCA6TC 2160 

AGAAAGATTG TATGGAAGCT ACGTGTTTGG AGATCGTAAT GGGAATTTCC TAACTCTCCA 2220 

GCAAAGTCCT GTGACAAAGC AGTGGCAAGA AAAACCACTC TGTCTCGGCA CTAGTCGGTC 2280 

CTGTA GAGG C TACTTTTCCG GTCACATCTT GGGATTTGGA GAAGATGAAC TAGGTGAAGT 2340 

TTACATTTTA TCAAGCAGTA AAAGTATGAC CCAGACTCAC AATGGAAAAC TCTACAAAAT 2400 

TGTAGATCCC AAAAGACCTT TAATGCCTGA GGAATGCAGA GCCACGGTAC AACCTGCACA 2460 

GACACTGACT TCAGAGTGCT CCAGGCTCTG TCGAAACGGC TACTGCACCC CCACGGGAAA 2520 

GTGCTGCTGC AGTCCAGGCT GGGAGGGGGA CTTCTGCAGA ACTGCAAAAT GTGAGCCAGC 2580 

ATGTCGTCAT GGAGGTGTCT GTGTTAGACC GAACAAGTGC CTCTGTAAAA AAGGATATCT 264 0 

TGGTCCTCAA TGTGAACAAG TGGACAGAAA CATCCGCAGA GTGACCAGGG CAGGTATTCT 2700 

TGATCAGATC ATTGACATGA CATCTTACTT GCTGGATCTA ACAAGTTACA TTGTATAGTT 2760 

TCTGGGACTG TTTGAATATT CTATTCCAAT GGGCATTTAT TTTTTATCCT GTCATTAAAA 2820 

AAAAAAGACT GTTATCCTGC TACACACTCC TGTGATTTCA TTCTCTTTTA TTAATTTAAA 2 880 

AATAATTTCC AGAAATGTGC AGATCCTCTG TGTGTATGTC AGCATGTTTG TTCACATATG 2940 

CACAT ACA CA TACTCATAAC CCCTATATGC QTTGTTGCAT AACAGATGAT TTTTTAAAAT 3000 

ATATACTTCC TTATGCAAA6 TAATTTACAC AGAAATTCCA TTGTAAATTG ATAATGGATT 3060 

TTTTATGTTA CTAGAAGAGA TTATTTGACT TCCCAGGAAT TTTCTGTCTG TAATCACTAA 3120 

AGTCAACTTT AATAGAGTTT TGAAACAGTA CTGTGCAATC CGATGGATCT AATTAAAAAA 3180 

AAG GCA ATAT TTTTATATTA AAGTACTATA CTAGGAGAGA ATGTTTCAGA ACTCCCTGAT 3240 

GAATTTCTAA GTGAGCAACT TGATATAAAA TTGTAATCTT CATTTTTGTC AGTGTATCCA 3300 

GTTACAGAAT GCTACACACT TACCTTTTTA TTGGCTGAGA AATCTGGTTA TTTCATCTTA 3360 

ATCTCAAGAT TGTTTTCAAG TGTTTTATAA TTAAATCATA ATAGCATATT TTAA7UVTCAA 3420 

TCTTCCTAAA AGGT CTGCT T TTATTGTATA TTTTATTTAA CAATAGQCAC TGGGTTTGTG 3480 
TTACATATTT ATATATTTTA TTTTATTTTT ATAATATAGA CATCACCTAG 

Seq ID NO: 28 Protein searuence ; 
Protein Accession #: XP 039209 
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MLKMLSFKIilj LLAVALGFFE GDAKFGERNE GSGARRRRCL NGNPPKRLKR RDRRMMSQLE 60 

IiLSGGEMLCG GFYPRLSCCL RSDSPGLGRIi ENKIFSVTNN TECGKLLEEI KCALCSPHSQ 120 

SIiFHSPEREV liERDLVIiPLL CKDYCKEPFY TCRGHIPGPL QTTADEFCFY YARKDGGLCF 180 

PDFPRKQVRG PASNYIJ3QMB EYDKVEEISR KHKHMCPCIQ EWSGLRQPV GALHSGDGSQ 240 

RliFILEKEGY VKILTPEGEI FKEPYLDIHK LVQSGIKGGD ERGUiSIiAFH PNYKKNGKLY 300 

VSYTTNQERW AIGPHDHILR WKYTVSRKN PHQVDLRTAR VPLEVAELHR KHLGGQIiLFG 360 

PDGFIiYIIIiG DGMITLDDME EMDGLSDFTG SVLRhDVDTD MCNVPYSIPR SNPHFNSTNQ 420 

PPEVFAHGliH DPGRCAVDRH PTDININLTI LCSDSNGKNR SSARILQIIK GKDYESEPSL 480 

liEFKPFSNGP LVGGFVYRGC QSERLYGSYV PGDRNGNFItT IiQQSPVTKQW QEKPIjCLGTS 540 

GSCRGYFSGH ILGFGEDELG EVYILSSSKS MTQTHNGKLY KIVDPKRPLM PBBCRATVQP 600 

AQTLTSECSR LCRNGYCTPT GKCCCSPGWE GDFCRTAKCE PACRHGGVCV RPNKCliCKKG 660 
YLGPQCEQVD RNIRRVTRAG IIiDQIIDMTS YLLDLTSYIV 



Seq ID NO: 2 9 Nucleotide sequence; 
Nucleic Acid Accession NM_024756 

Coding sequence: 75.. 2924 (underlined sequences correspond to start and stop codons) 



AAGACAACGT 
GCCCCACCAC 
GGCTGCTGGG 
GGACACCT6G 
ACTGGT6CCC 
AATTCCTCAT 
AAGTCATGTA 
CTTTGGCCTG 
TGGCAATCCC 
TCAGCTTCAA 
AGCAGGAACA 
CAGGCCTGTG 
CAGGGCACGA 
TCCTACAAGT 
CCCAGGCCAT 
TCCAGGACAG 
AGGTCCAGGA 
ACGCCCAGCA 
AATTGAAGAG 
CAACX;CCTGG 
TGCAGAGGAA 
ACACCCTG6A 
ACTCCGAATC 
T6CAGGTGAA 



11 
I 

.CACTAGCAGT 
CAAGATGATC 
GGCATGGGCC 
GGTCTGGAAG 
CTACCCAATG 
CCACTCGCAG 
CCGCATGGCC 
GAGGTGCTGC 
TGAGCCTGCA 
ACCTGGCCAC 
TCTGCTGGGA 
GAAAGCCCTG 
GTTCCCTGAT 
GCATTTCAGC 
AAGAAACCTG 
TGCCGTGGCC 
GAACACTCAG 
CTTTACCCTG 
GCTGCACAAG 
GGCTGGGGCA 
CCTCTCAOAG 
GGACATGAGG 
GGACGAGACT 
CCACACGGGG 



21 
I 

TTCTGGAGCT 
CTGAGCTT6C 
CAGGCTTCCA 
GCAGAGGCTG 
TCCAAGCTGG 
CAGCCGTGTC 
CACAAGCCAG 
CCTGGCTACA 
GATCCTGGTG 
CTTGCTGCAG 
GATCTCCAGA 
CCTGGTAACC 
AGATCCTTGG 
CCCATCTGGA 
TCTCTTGACG 
AGGGCTGACT 
AGAGTGGGTC 
CACCX3CTCGA 
GCTCAGGAGG 
AGGCCTGAGC 
CTGCACATGA 
GCCACCCTGA 
TTCGATCAGA 
CTCCGTGAGC 



31 

I 

ACTTGCCAAG 
TGTTCAGCCT 
GTACTAGCCT 
AGGACACCA6 
TCACCTTACT 
CGCA6GGAGC 
TGTACCAGGT 
CGGGCCCCAA 
ACAGCCACCA 
TGATCAATGA 
ATGATGTGCA 
TCACAGCTGC 
AGCAGGTGCT 
GGAGCTTTAA 
TGGAGGCCAA 
TCCAGGAGCT 
AGCTGCGACA 
TCTCAGAGCT 
CCCCAGGGAC 
CGGACAGCCT 
CCACGGCCCG 
CCCXS6CACGT 
TTA6CAAGGT 
TGC6CGTGAT 



41 

I 

GCTGAGTGTG 
TGGGGGCCCC 
CTCTGATCTG 
CAAG6ACCCC 
AGCTCTTTGC 
TCCAGACTCC 
CAAGCAGAAG 
CTGCGAGCAC 
GGAACCTCAG 
GGTTGAGGTG 
CCGGGTGGCA 
AGTGATGGAA 
GCTACCCCAC 
CCAAAGCCTG 
CCGCCAGGCC 
TGGTGCCAAA 
GGACGTGGAG 
CCAAGCCGAT 
CAATGGCAGT 
GCAGGCCAGG 
CAGGGAGGAG 
GGATGAGATC 
GGAGCX36CAG 
CCTGAT6GAG 



51 

I 

AGCTGAGCCT 
CTGGGCTGGG 
CAGAGCTCCA 
GTTGGACGTA 
AAAACAGAGA 
CAGAAAGTCA 
GTGCTGACCT 
CACGATTCCA 
GATGGACCAG 
CAACAGGAAC 
GACAGCCTGC 
GCAAATCAAA 
GTGGACACCT 
CACAGCCTTA 
ATCTCCAGAG 
TTTGAGGCCA 
GACCGCCTGC 
GTGGACACCA 
CTGGTGTTGG 
CTGGGCCAGC 
GAGTTGCAGT 
AAGGAACTGT 
GTGGAGGAGC 
AAGTCTCTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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TCATGGAGGA GAACAAGGAG GAGGTGGAGC GGCAGCTCCT GGAGCTCTU^C CTCACGCTGC 1500 

AGCACCTGCA GGGTGGCCAT GCCGACCTCA TCAAGTACGT GAAGGACTGC AATT6CCAGA 1560 

AGCTCTATTT AGACCTGGAC GTCATCCGGG AGGGCCAGAG GGACGCCACG CGTGCCCTGG 1620 

AGGAGACCCA GGTGAGCCTG GACGAGCGGC GGCAGCTGGA CX3GCTCCTCC CTGCAGGCCC 1680 

TGCAGAACGC CGTGGACGCC GTGTCGCTGG CCGTGGACGC GCACAAAGCG GAGGGCGAGC 1740 

GGGCGCGG6C GGCCACGTCG CGQCTCCGGA GCCAAGTGCA GGCGCTGGAT GACGAGGTGG 1800 

GCGCGCTGAA 6GC6GCCGCX3 GCCGAGGCCC GCCACGAG6T GCGCCAGCTQ CACAGCGCCT 1860 

TCGCC6CCCT GCTGGAGGAC GCGCTGCGGC ACGAGGCGGT GCTGGCCGCG CTCTTCGGGG 1920 

AGGAGGTGCT GGAGGAGATG TCTGAGCAGA CGCCGGGACC GCTGCCCCTG AGCTACGAGC 1980 

AGATCCXSCGT GGCCCTGCAG GACGCCGCTA GC6GGCTGCA 6GAGCAGGCG CTCGGCTGGG 2040 

ACGAGCTGGC CGCCCGAGTG ACGGCCCTGG AGCAGGCCTC GGAGCCCCCG CGGCCGGCAG 2100 

AGCACCTGGA GCCCAGCCAC GACGCGGGCC GCGAGGAGGC CGCCACCACC GCCCTGGCCG 2160 

GGCTGGCGCG GGAGCTCCAG AGCCTGAGCA ACGACGTCAA GAATGTCGGG CGGTGCTGCG 2220 

AGGCCGAGGC CGGGGCCGGG GCCGCCTCCC TCAACGCCTC CCTTGACGGC CTCCACAACG 2280 

CACTCTTCGC CACTCAGCGC AGCTTGGAGC AGCACCAGCG GCTCTTCCAC AGCCTCTTTG 2340 

GGAACTTCCA AGGGCTCATG GAAGCCAACG TCAGCCTGGA CCTGGGGAAG CTGCAGACCA 24 00 

TGCTGAGCAG GAAAGGGAAG AAGCAGCAGA AAGACCTGGA AGCTCCCCGG AAGAGGGACA 2460 

AGAAGGAAGC GGAGCCTTTG GTGGACATAC GGGTCACAGG GCCTGTGCCA GGTGCCTTGG 2520 

GCGCGGCGCT CTGGGAGGCA GGATCCCCTG TGGCCTTCTA TGCCAGCTTT TCAGAAGGGA 2580 

CGGCTGCCCT GCAGACAGTG AAGTTCAACA CCACATACAT CAACATTGGC AGCAGCTACT 2640 

TCCCTGAACA TGGCTACTTC CGAGCCCCTG AGCGTGGTGT CTACCTGTTT GCAGTGAGCG 2700 

TTGAATTTGG CCCAGGGCCA GGCACCGGGC AGCTGGTGTT TGGAGGTCAC CATCGGACTC 2760 

CAGTCTGTAC CACTCbGCAG GGGAGTGGAA GCACAGCAAC GGTCTTTGCC ATGGCTGAGC 2820 

TGCAGAAGGG TGAGCGAGTA TGGTTTGAGT TAACCCAGGG ATCAATAACA AAQAGAAGCC 2880 

TGTCGGGCAC TGCATTTGGG GGCTTCCTGA TGTTTAAGAC CTGAACCCCA GCCCCAATCT 2940 

GATCAGACAT CATGGACTCG CCCAGCTCTC CTCGGCCTGG GGCTCTGGCC AAG6ATGGGC 3000 

TG6AGGTCAT TCAGTTGGTC TGTCTCTTCC CTGGAAACCT TCTGCAAAGA TG6TGTGGTG 3060 

TACGTGGCTT CCCTGTAACC ACATGGGGCT TGGCCATTTC TCCATGATGA GAAGGACTGG 3120 

AATGCTTCTC CGGGCAGGAC ATGGTCCTAG GAAGCCTGAA CCTTGGCTTG GCATGCCTTC 3180 

TCAGACAGCA CGGCCTGGGC TCCAACTCTT CACCACACCC TGTATTCTAC AACTTCTTTG 3240 

GTGTTTTGCT CCTCCTGTGG TTGGAAACTT CTGTACAACA CTTTAAACTT TTCTCTTGCT 3300 

TCCTCTTCTC TTCTCCCTTA TCGTATGATA GAAAGACATT CTTCCCCAGG AGGAATGTTT 3360 

AAAATGGAGG CAACATTTTG GCCAACATTG GAAAGCACTA GAGGGCAATG GGATTT^AACC 3420 

AACCTGCTTG GTCTCTATTA GTCAGTAATG AAGACGACAG CCTGGCCAAC CAAGGGAAAG 3480 

GAAATTAGTA TCTTTAGTTT CAGTCATTCC TTGTAGGATA TGGTTTAGCT GTGCCCCCAC 3540 

CTAAAATATC ATCTTGAATT GTAATCCCTA TAATCCCCAC ATCAAGGGAG AGATCAGGTG 3600 

GAGGTAATTG GATCTTGGGG GCGGTTCCCC CATGCTGTTC TTGTGATAGT TCTCACGAGA 3660 

TCTGATGATT TTATAAGTTT GATAGTTCCT CCTGTGTTCA TTCTCCTTCC TGCCACCTTG 3720 

TGAAGATGCC TTGGTTCCTC TTCACTGTCT GCCATGATTG TAAGTTTCCT GAGGCCTCCC 3780 
CAGCCATGTG GAACAGTGAG TCAATTAAAC CTCTTTCCTT TATAAATT 



Seq ID NO: 30 Protein sequence; 
Protein Accession #: NP_079032 
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MILSLIiFSLG 6PIjGWGI*LGA WAQASSTSLS DLQSSRTPGV WKAEABDTSK DPVGRNWCPY 60 

PMSKLVTLLA LCKTEKPLIH SQQPCPQGAP DCQKVKVMYR MAHKPVYQVK QKVLTSIiAWR 120 

CCPQYTGPNC EHHDSMAIPE PADPGDSHQE PQDGPVSPKP GHLAAVINEV EVQQEQQEHIj 180 

liGDIiQNDVHR VADSLPGIiWK ALPGNLTAAV MEANQTGHEF PDRSLEQVLIj PHVDTFIiQVH 240 

FSPIWRSFNQ SLHSIiTQAIR NLSUDVEANR QAISRVQDSA VARADFQEIiG AKFEAKVQEN 300 

TQRVGQIiRQD VEDRLHAQHF TLHRSISBLQ ADVDTKIJCRL HKAQEAPGTN GSLVIATPGA 360 

GARPEPDSLQ ARLGQLQRNIi SELHMTTARR EEELQYTLED MRATLTRHVD EIKELYSESD 420 

ETFDQISKVE RQVEELQVNH TAIiRELRVIL MEKSLIMEEN KEEVERQIiLE L.NI.TIiQHIiQG 4 80 

GHADLIKYVK DCNCQKLYIiD LDVIREGQRD ATRALEETQV SIiDERRQIiDG SSLQAI^QNAV 540 

DAVSLAVDAH KAEGERARAA TSRLRSQVQA LDDBVGAIiKA AAAEARHEVR QLHSAFAALIj 600 

EDALRHEAVL AALFGEEVLE EMSEQTPGPL PIiSYEQIRVA LQDAASGIiQB QALGWDEIAA 660 

RVTAIiEQASE PPRPAEHLEP SHDAGREEAA TTAIAGIARE LQSLSNDVKN VGRCCEAEAG 720 

AGAASIiNASL DGLHNALFAT QRSIiBQHQRL FHSLFGNFQG LMEANVSLDIi GKLQTMIjSRK 780 

GKKQQKDLEA PRKRDKKEAE PliVDIRVTGP VPGALQAAIiW EAGSPVAFYA SFSEGTAALQ 840 

TVKFNTTYIN IGSSYFPEHG YFRAPERGVY laPAVSVEPGP GPGTGQIiVFG GHHRTPVCTT 900 
GQGSGSTATV FAMAELQKGE RVWFELTQGS ITKRSLSGTA FGGFUIFKT 

Seq ID NO: 31 Nucleotide secniencej^ 
Nucleic Acid Accession #: AB037715 

Coding sequence: 370.. 3489 (underlined sequences correspond to start and stop codons) 
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GAACGCTCAC AGAACAGGCA GTGCAATTCC ATGTTCCTCT TAAGTATGTT AGCCCTACCG 60 

GGAGCTGAGC TGGCCAGTCT ACTTGGAGAG GAAAAGTAGA TCTGGGGAAG GTGGAAGGGT 120 

CAGTTCCTAA GTGACTTCCT CCTCGGGGAT GGTAAGGGCA TTTGCTGATC TCCAGTGACT 180 

GCCTGGTGCC TCATGGTCAG ACTCGGCTGT CTCACTCCCA GATATCTGAT TTTGCAAAAA 240 

GGGACACACC TATCTGCAGC AAAGAAGACA CTGACCAGAT TGCGAGCGGT GCTTTTGGAT 300 
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GCTCTGTAGC CACCCGGGGC CCAGGAGGAC 
TCGGAGACC^ SISGCAGTGCA GCTGGTGCCC 
GAGGGCCGCC GATGTCAAGT ACATCTTCTT 
CCCAAGCTGT TGGCCAAGGA GCTTCTTGAC 
5 AAGGAGTACT TTGGAATAGC ATTCACAGAT 
GATCGAAGAG TATTGGAACA TGACTTCCCT 
TGTGTCAGGT TCTATATAGA AAGCATTTCA 
TTCTTTCTGA ACGCGAAGTC CTGCATCTAC 
GTGTTTGAAT TAGCTTCCTA TATTTTACAG 
10 GTTGTGAGGA GTGACTTGAA GAAGCTGCCA 
CCTTCCCTGG CCTACTGTGA AGACAGAGTC 
ACAAGAGGTC AAGCAATCGT AAACTACATG 
GTTCACTATT ATGCAGTGAA 6GACAAGCAG 
AAAGGGATCT TCCAGTATGA CTACCATGAT 
15 AGACAGTTGG AAAACCTGTA CTTCAGAGAA 
CGCAGG6CTT CAGTGACAAG GAGGACGTTT 
TATGCATGTC CGGCATTGAT CAAGTCCATC 
TATCTGGACA GAAAGCAGA6 TAAGTCCAAA 
GCCATCGACC TGACCX3AGAC GGGGACGCTG 

20 AAGGGGAAGA TCATCAGCGG CAGCAGCGGC 
GATAGCTCGC AGTCGGCCAA GAAGGACATG 
CTGGAGGAAA CCCTGCGTCA GAGGCTGGAG 
GAGCTCACGG GCAAGCTGCC AGTAGAATAT 
GTTCGGAGAA GAATAGGAAC AGCCTTCAAA 

25 GAGGAAGCTG AGCTGGAACG CCTGGAACGA 
GCCGCCCX3CC GCCTAGCCAG TGACCCCAAC 
ACCTCGTATC TGAATGCACT GAAGAAACTG 
CGCATCAAGT CTGGGAAGAA ACCCACCCAG 
ATTGCCAGTG AAGACAGCTC CCTCTCAGAT 

30 GTTACCAGCA CAATATCCCC CCTACATTCT 
TCGCACAACA GGCCTCCTCC TCCCCAGTCC 
CGCAACGACT ATGACAAGTC ACCCATCAAG 
GAACCCTATG AGAAGGTCAA GAAGC6CTCC 
TTCCCCAGCA GAGGAAGCTG TGCGGAAGCC 

35 CCCATCCGCG GCCTCCCGCA CTGGAACTCC 
CX3GGTCCGGA GTCCCCACTA CGTCCATTCC 
CTGCACAGCC TCGCACTGCA CTTTAGGCAC 
CTCCTGGGCT CGGAAAACGA CACCGGGAGC 
AGCAACGGCT CAGACCCCAT GGACGACTGC 

40 CACTACTACC CGGCGCAGAT GAACGCCAAC 
AAGGCGCGCC AGAGGCAGAG GCAGCGGCAG 
TCGGGCAGCA TGCCCAACCT GGCGGCGCGC 
GGCGGTGTGT ACCTGCACAG CCAGAGCCAG 
CCGCTGTACA TCGAGGGCGG CGCCACGCCC 

45 GAGTGCCACT ACAGCGTCAA GGCTCAGTTC 
CTGTTCAAGG AGAGCTGGCG CGGCGGCGGC 
CCGTCGCGAT CGCAGATCCT GCGGACTCCG 
GGCGCGGGCC GTGCCGCCGT CTCAGACGAG 
TCGCACAAGG AGCACAGCCG CCTGTCGCAC 

50 CAGTACAGCA CCTCCTCCCA GAGCACCTTC 
CAGATGTGCA AGGCCACGTC AGCTGCCTTA 
AGTGAAATTG GAGCCACCCC CCCAAGCAGC 
GAAGCAACAG AAAACTCACC CATTCT6GAT 
GATGAATAGA GGAGCTACAA TGATAGCTGT 

55 GCTGATGTCC AGTGGTACGG 6CAGGAAAAA 
CCGGCCTAAT CTGACCGCCT CAACGCCATT 
TTACCCAGAC GCACCGTCAC CCTGCACCA6 
CTCCGCATTC CCTCCCCCTT GAAAACCTGA 
CACTGTGTGT CCCCTGGCGC TCTTGCCCAT 

60 CTTGGTGGCT TCCCTCTGCC ATGACAGCCC 
GGCATCCAAT TCCTGCGGAT AAGTAGCGTT 
CAGGGTGACC CAGAAAGACG ATTCAGCTGT 
CAAGCACTTC ATGAAGAGGA GGCCTCGTGG 
GATGGGACAG CTTGTGGGGA TGGCTATGGG 

65 ACACCAGAAA TGCATCGGAG GACCACAATC 
TAAAAACATA AAAAATTAAG AGGGGCCAAG 
TTTTAAATTC TGAACTGCTA CTACACACAA 
CTCTCTCTAG CCCTCTCCCT TACTGGCCCA 
GCCCCAATGC CACGGTAAAG GCGAGGAAGT 

70 ATCCATCTGG ACACAAAGAG AGACCTGTGG 
CAT6CAGG6G GTTCAGCCGA GCCCAAGACT 
AACGTAAGGT GATAATGGCC AAAAGTGGTT 
ATCCTATTTT TTTGCATAAG GTGTTTCATT 
ACATTGCGAT CCATTCAGT6 TTTAACTGTC 

75 GTGACAAAAG AGCTCaOATC CGACTTCTCC 
TGCCCTTAGG TAGAAAGATT T6ACTCGTGT 



TGACTCGGCA GCAGGATTCX3 TGCATGGGAA 360 

GACTCAGCTC TCGGCCTGCT GATGATGACG 420 

GATGACAGGA AGCTGGMCT CCTAGTACAG 480 

CTTGTGGCTT CTCACTTCAA TCTGAAGGAA 540 

GAAACGGGAC ACTTAAACTG GCTTCAGCTA 600 

AAAAAGTCAG GACCCGTGGT TTTATACTTT 660 

TACCTGAAGG ATAATGCTAC CATTGAGCTT 720 

AAGGAGCTTA TTGACGTTGA CAGCGAAGTG 780 

GAGGCAAAGG GAGATTTTTC TAGCAATGAA 84 0 

GCCCTTCCCA CCCAAGCCCT GAAGGAGCAC 900 

ATTGAGCACT ACAAGAAACT GAACGGTCAG 960 

AGCATCGTGG AGTCTCTCCC AACCTACGGG 1020 

GGCATACCAT GGTGGCTGGG CCTGAGCTAC 1080 

AAAGTGAAGC CAAGAAAGAT ATTCCAATGG 114 0 

AAGAAGTTTT CCGTGGAAGT TCATGACCCA 1200 

GGGCACAGCG GCATTGCAGT GCACACGT6G 1260 

TGGGCTATG6 CCATAAGCCA ACACCAGTTC 1320 

ATCCATGCAG CACGCAGCCT GA6TGA6ATC 1380 

AAGACCTCGA AGCTGGCCAA CATGGGTA6C 1440 

AGCCTGCTGT CTTCAGGTTC TCAGGAATCA 1500 

CTGGCTGCCT TGAAGTCCAG GCAGGAA6CT 1560 

GAACTGAAGA AGCTGTQTCT CCGAGAAGCT 1620 

CCCCTGGATC CAGGGGAGGA ACCACCCATT 1680 

CTGGATGAAC AGAAAATCCT GCCCAAAGGA 174 0 

GAGTTTGCCA TTCAGTCCCA GATTACGGAG 1800 

GTCAGCAAAA AACTGAAGAA ACAAAGGAAA 1860 

CAGGAGATTG AAAATGCAAT CAATGAGAAC 1920 

AGGGCTTCGC TGATCATAGA CGATGGAAAC 1980 

GCCCTTGTTC TTGAGGATGA AGACTCTCAG 204 0 

CCTCACAAGG GACTCCCTCC TCGGCCACCG 2100 

CTGGAGGGAC TCCGACAGAT GCACTATCAC 2160 

CCCAAAATGT GGAGTGAGTC CTCTTTAGAT 222 0 

TCTCACAGCC ATTCCAGCAG CCACAAGCGC 2280 

GGCGGAGGAA GCAACTCCTT GCAGAACAGC 234 0 

CAGTCCAGCA TGCCGTCCAC GCCAGACCTG 2400 

ACGAGGTCGG TG6ACATCAG CCCCACCCGA 2460 

CGGAGCTCCA GCCTGGAGTC CCAGGGCAAG 2520 

CCCGACTTCT ACACCCCGCG GACTOGTAGC 2580 

TCGTCGTGCA CCAGCCACTC GAGCTCGGAG 2640 

TACTCCACGC TGGCCGAGGA CTCGCCGTCC 2700 

CGGGCGGCGG GCGCACTGGG CTCAGCCAGC 2760 

GGGGGTGCGG GGGGC6CGGG 6G6CGC6GGG 2820 

CCCAGCTCGC AGTACCGCAT CAAGGAGTAC 2 880 

GTGGTGGTGC GCAGCCTGGA GAGCGACCAG 294 0 

AAGACGTCCA ACTCCTACAC GGCGGGCGGC 3000 

GGCGACGAGG GCGACACGGG CCX3CCTGACG 3060 

TCGCTGGGCC GCGAGGGCGC CCACGACAAG 3120 

CTGCGCCAGT GGTACCAGCG TTCCACCGCC 3180 

ACCAGCTCCA CCTCCTCGGA CAGCGGCTCG 324 0 

GTGGCGCACA GCAGGGTCAC CAGGATGCCC 3300 

CCTCAAAGCC AGAGAAGCTC GACACCGTCA 3360 

CCCCACCACA TCCTAACCTG GCAGACTGGA 3420 

GG6TCTGAGT CTCCACCTCA CCAAAGTACT 3480 

TTCCTGGATT CCTCCCTCTA TCCAGAACTA 3540 

GCCAAGCCC6 GGACCCTOGT GTGAGCCAGC 3600 

CTGAGATCAC CTCACTGCCT CTCATTTGCC 3660 

CTTTGGCCCT CAGCACTTTT TTTCTCCTGT 3720 

CTGAGGAGAC ATTCTGGAAG GTTCCX3GTCC 3780 

AGA6AGCCAG ACACCAATCC TCAATGGCAC 3840 

CTAGGCCAGG AACCATCAGG GGGGCCAGCC 3900 

GGGAGAGAAC GGGAAAGGGG ACTTGGGTTA 3960 

GTCCAGCCTG CCACCCATAC GTAGGCCAAC 4020 

CATATTCAGT TTACACCTGA AATATTCCTT 4 080 

GGAAGGGGAG GTTGAGAAAG GAAGTTCTCG 4140 

AGTTCTATGC TGCCAAAGAT TAAAAATAAA 4200 

AGGAAGACAT TCTTTCTGCA AGGAAATTTC 4 260 

GTGAAAGTCA ACCCTATGTA AACTGGTGTC 4320 

CTTCTCTCTC CGTAGAGAGC CTGAAAAACT 4 380 

CTTGGCTGGC GTTGCTGACT CACAGTCGCC 444 0 

GAGTCATAGA GGGTACTGTT AGCCCCGQTC 4500 

CAAAGCTGCT TTCCTTTCAG GATTTGTAGT 4560 

CTCTCTCATT AAACCAACCA GTAAAAGCGT 4620 

TTCGTTTTTA T6G6AAACCA AGGGAAAAGC 4680 

GTGGCTCATT TTCTGTTCGT TAGCACTTGT 474 0 

TATGTGTCAC TTATTCCAAG AACCCAACTA 4800 

GTCTACTA6C CAACAGGCAG AGCAGGGTTG 4860 
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AAAAAAATAT CAGCTCCCAA AGGGCCCAT6 TGTCTACATC ATCAGTTACT GTCATGCACC 4920 

ACATTTGTGT GCMATACCA AAAGAGGAGG AAAGAAGAAA AAAATTAATG TGTGGGAGCT 4980 

GCAC6TTTAC ATGTTTTGAG CTATGCTTCA AACACAACTG GAAAGCCATC AATCTTCAAA 5040 

GGCCTCAAAA ATACTTTTAT AGTAACAAGT GCACGACTTT AGTTGGGTTA TTCAAGATGG 5100 

5 CACAAAAAGG TTTCCGCAGA GGTGGTATGC TGTGCTTTTG GCGCAAGTGG TGGGGGGATG 5160 

GGGGTGGGGG TGGAATTTTT TTCTCACTCT AATGACTTCC TATTGGAAA6 GCATTGACAG 5220 

CCAGGGACAG GAGCCAGGGT GGGGGTAGTT TTGTGGGAAA GCAGAACTGA AGTTAGCTTA 5280 

AGCATAAAAA CAAAGAAAAA TCTTCGCTTT TCATGTATGT GGAATCCAAG AATAACCATA 5340 

GGCTCTACCA GACCAGGAGG GTAAGGATGG ACACTAAAAT GAAACAAATA CCAAGGTATT 5400 

10 CCTTCTGCTG CAGCCTGGAG ACCACCGAGA GTCGAGCTGG GGCACACACA CACCTGGCCG 5460 

GGACCCGGCA GGGACAAGGC GGGCCGTGGC CTCCTCCACC AAGTCTCTCT AGACAATTCA 5520 

GGGCCTGCTT TCCCCAGCTC CATGCATGGC TGGACTGGTG ATTCCAGGGT GCAGAAGGGA 5580 

TTCATATTCC CAGAACGCTT TAAGTGTACA CCTGCAGGAT AAAGAGATAC CGGTTACATT 564 0 

ATTAAATGAT TCTAGGGATT CACTGGGGGA TATTTTTGTT GCTTTTACTT TCATGGTTAG 5700 

15 AGCTACAAAG AACAGTGATT tTTTTTTTTT CTCCCTTCCC CATTCAGAAA CATTATACAT 5760 

TGGGCCATTT TTCTTTCTCC CAAAGAAGAT TCATGGATAG TCAGACTGAA CTGTGTGCAA 5820 

CAGGAAAAGT CAAAAGGGAA AAGGCAGCTG ATGAGGTTAC ATGGTTACAT GTTCTACATC 5880 

ATGCAGAGTA GCTTGAAATC TAGTCTGGAG AAAACTGGAT CAAG ATTC TA GCCCACTGGA 5940 

GTTGC7VAGGA ATGAGAGGCA AAAATTCTAA AGATTTGGGT TATATTTTCA ACTTGGGGGA 6000 

20 CAGAGAGAAA TGGAGAGCAG GAATTACAGT TCCAACAAAC ATCATGATAG TCTGGTAGTC 6060 

AAGACAGAGA TTAAGTAAAA CAGGTTTTAC TGTTTAGCTG AGTTCAGTTA ATACAAAATG 6120 

TACATAAAAC GTTAGTCCTT TGAGACTGAC AT6ATTAATG ATCAG TGTGG TGGGAAATGA 6180 

TGTAGTTATT GTACACAAGC ACTTGCAAAC TCTTTATCCC TATTTCTTTA AAACAAAATA 6240 

AGGTGAAATA CGAAGTCCTT GGTCTGATAT AAAGCCCCTA TTGGATTCTT CGGATGCGTA 6300 

25 AAAGAAATTG CCTGTTTCAG CCAGAAGACT GGTGAAAACA CATACATCAG ACTATGTTGT 6360 

GAGCCAGGTT GATTTTTTAT TTTATTATAT GCAGGTGAGT GTTGAAACTG TTAAAATTCC 6420 

AATTTGTTTT CATTCAGTAT TAGTTTAGTT CTAAATATAG CAAACCCCAT CCAGGTGCTA 6480 

TCAGATGACC AGTTACTGCT TAGTTAACTA GGTGTAAAGT TTTACATATA CATTAATTTC 6540 

AATAGTTTAT TACAAGTTGT GTAAAATGGA CTCTAGTTTA ATAATGGGGG AAAAAAGATT 6600 

30 AGGTTGCTCC TGAAACTGAC TGTAGAGCAT GTAAAATGAT TTTACTGGAT TCTGTTCAAC 6660 

TGTAATCAAT GAAAAAGATG TACGTTGTAG ACAAAGTTGC AGAATTAAAA AAAGAAATCT 6720 

GCTTTTAATT TATTCTTTTT GTATTAAGAA TTTGTATAGT ATCTTTACAT TTTGCAAAAC 6780 
AGTGTTGTCA ACACTTATTA AAGCATTTTC AAAATG 

35 

Seq ID NO: 32 Protein sequence; 
Protein Accession #: BAA92532 



40 1 11 21 31 41 51 

I I I i I > 

MAVQIiVPDSA LGLUIMTBGR RCQVHLIJJDR iCLEIiIiVQPKI* LAKELLDLVA SHFNLKBKBY 60 

FGXAFTDBTG HIiNWLQLDRR VIiEHDFPKKS GPWLYFCVR FYIBSISYIiK DNATIELFFL 120 

NAKSCIYKEL IDVDSEWFB IiASYILQEAK GDFSSNEWR SDLKKLPALP TQALKEHPSL 180 

45 AYCEDRVIEH YKKUMGOTRG QAIVNYMSIV ESIiPTYGVHY YAVKDKQGIP WWLGLSYKGI 240 

FQYDYHDKVK PRKIFQWRQL ENLYFREKKF SVEVHDPRRA SVTRRTFGHS GIAVHTWYAC 300 

PALIKSIWAM AISQHQFYLD RKQSKSKIHA ARSLSEIAID LTETGTLKTS KLANMGSKGK 360 

IISGSSGSLL SSGSQESDSS QSAKKDMLAA LKSRQEALEE TLRQRLEELK KLCLREAELT 420 

GKIiPVEYPIiD PGEEPPIVRR RIGTAFKLDE QKILPKGEEA ELERLBREFA IQSQITEAAR 4 80 

50 RliASDPNVSK KliKKQRKTSY liNALKKLQEI ENAINENRIK SGKKPTQRAS LIIDDGNIAS 540 

EDSSLSDALV LEDEDSQVTS TISPLHSPHK GLPPRPPSHN RPPPPQSLEG LRQMHYHRND 600 

YDKSPIKPKM WSESSIJ3EPY EKVKKRSSHS HSSSHKRFPS TGSCABAGGG SNSIiQNSPIR 660 

GLPHWNSQSS MPSTPDLRVR SPHYVHSTRS VDISPTRLHS LAIiHFRHRSS SLESQGKI»U3 720 

SENDTGSPDP YTPRTRSSNG SDPMDDCSSC TSHSSSEHYY PAQMNANYST LAEDSPSKAR 780 

55 QRQRQRQRAA GAIiGSASSGS MPNIiAARGGA GGAGGAGGGV YLHSQSQPSS QYRIKBYPIiY 840 

lEGGATPVW RSLESDQECH YSVKAQFKTS NSYTAGGLFK ESWRGGGGDE GDTGRLTPSR 900 

SQIIiRTPSLG REGAHDKGAG RAAVSDELRQ WYQRSTASHK BHSRIiSHTSS TSSDS6SQYS 960 

TSSQSTFVAH SRVTRMPQMC KATSAAIiPQS QRSSTPSSEI GATPPSSPHH II.TWQTGEAT 1020 
ENSPXItDGSE SPPHQSTDE 

60 

Seq ID NO: 33 Nucleotide sequence: 
Nucleic Acid Accession #; NM_014331 

Coding sequence: 1..1506 (underlined sequences correspond to start and stop codons) 

65 

*1 11 21 31 41 51 

I I I I I I 

ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

70 AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

75 TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACG CCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 
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CCTGAACTTG CGATCAAGCT CATTACAGCT 
AGCATGAGTG TCAGCTGGAG CGCCCGGATC 
GCAATTCTGA TAATTATAGT CCCTGGAGTT 
TTTAAAGACG CGTTTTCAGG AAGAGATTCA 
5 TATGGAATGT ATGCATATGC TGGCTGGTTT 
AACCCTGAAA AAACCATTCC CCTTGCAATA 
TATGTGCTGA CAAATGTGGC CTACTTTACG 
AATGCAGTGG CAGTGACCTT TTCTGAGCGG 
ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC 

10 TTATTCTATG TTGCGTCTCG AGAGGGTCAC 
CGCAAGCACA CTCCTCTACC AGCTGTTATT 
TTCTCTGGAG ACCTCGACAG TCTTTTGAAT 
GGGCTGGCAG TTCCTGGGCT GATTTATCTT 
TTCAAGGTGC CACTGTTCAT CCCAGCTTTG 

15 CTTTCCCTCT ATTCGGACCC ATTTAGTACA 
GTCCCTGCGT ATTATCTCTT TATTATATGG 
TCAGAGAAAA TAACCAGAAC ATTACAAATA 
TT ATGAA CTA ATGGACTTGA GATCTTGGCA 
TTTTTACTTC ATTTTCTGAA AGTCTAGAGA 

20 CAGTTATTTT TATTCATATA TTTTAGCATA 
AACTCTATGT AGTTATAGAA AGTGAATATG 
GTCTCTGATA CCTACCTATT GGGGTTAGGA 
TCTCTACAAC ATATGTTAGC ACGGCAAAGA 
TATATATGGG TTTTGTAAAG ATGGTTTTAC 

25 TTTTCAATTC TGAAAAAAAG CATACATCAT 
ATTTTACATT GACATTGCAT TGCTTCCCCT 
GCTTTAATGG ATTATACCCA GAGCACTTTG 
TTAAAGAAGA GTTTCTAGGG GCTACTGTTT 
AAAAATCCTT GAGAATTTAT TATGTCAGAT 

30 TTATCTGTCA TTTTTTTTTT TCACATCAGT 
AGCAAGAGTT AGTTTGGTAT TAAATCCTCA 
TACCCCTGAT GAGTCTATCT AAACATATGC 
TGAGAGAAAT AACCAACAAA GAAGAT6TTC 
CTACATGCAA TGTTAGTAAT TCTGAAGTTT 

35 TGAATTTTGA CAGTTTGTGC ATTTTCTTTA 
CTTCAGATGA AACTGTCCAG ATTAATTAGG 
AAGAAATGTC GCTGTAAATA AGATTTACAA 
ATCTAGGCTT TGTCAGTAAT TTCCACACCT 
CTGATAAGAA GAAAATTGAA ATGAGAATCT 

40 GTTTTGCCAG TATTAGAAAA TACTGTGAGC 
GCACTTTGGG AGGCTGAGGG GGTGGATCAC 
CAACATGGAG AAACCCCATC TCTACTAAAA 
GCTGGTAATC TCAGCTATTG AGGAGGCTGA 
GAGGTTGCAG TGAGCCAAGA TTGCACCACT 

45 CCATCTCCAA AAAAAAAAAA AAAA 



GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 

TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

TGTATATCCA TGGCCATTGT CACCATTGGC 84 0 

ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

CTTCCAGAAA TCCTCTCCAT 6ATTCATGTC 1080 

GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

GATTATGGCA AAGAGGAGAG AAAGAAATTT 198 0 

TAGATACCAA TTTAGATAAC AAACACTCAT 204 0 

AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

AAAATAATA6 TCCCATATCT GTAATCATAT 2460 

TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TACATTTTAT ATTCTTCTQT TAAAATATCT 2580 

AAAAGGCATA TATTAACATA AAAATT6CAA 2640 

CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 

CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

ATACAAT^TT AGCTGGGCAT GGTGGCACAT 3000 

GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GTACTCCAGC CTGGGTGACA AAGTCA6ACT 3120 



Seq ID NO: 34 Protein sequence: 
Protein Accession #: NP 055146 

50 

1 11 21 31 41 51 

I i I I i I 

.MVRKPWSTI SKGGYIiQGNV NGRIiPSLGNK EPPGQEKVQL KRKVTLIiRGV SIIIGTIIGA 60 

55 GIFISPKGVX, QNTGSVGMSL TIWTVCGVLS LFGAIiSYAEL GTTIKKSGGH YTYILEVPGP 120 

LPAPVRVWVE LLIIRPAATA VISLAFGRYI LEPFFIQCEI PELAIKLITA VGITWMVliN 180 

SMSVSWSARI QIFLTFCKI.T AILIIIVPGV MQLIKGQTQN FKDAFSGRDS SITRLPLAFY 24 0 

YGMYAYAGWF YLNFVTEEVE NPEKTIPIiAI CISMAIVTIG YVLTNVAYFT TINAEEULliS 300 

NAVAVTFSER LLGNFSIiAVP IFVAIiSCFGS MNGGVFAVSR LFYVASREGH LPEILSMIHV 360 

60 RKHTPLPAVI VLHPLTMIMIi FSGDIjDSIjLN FliSFARWLFI GLAVAGLIYL RYKCPDMHRP 420 

FKVPLFIPAli FSFTCLFMVA LSLYSDPPST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 480 
SEKITRTLQI ILEWPEEDK L 



65 Seq ID NO: 35 Nucleotide secmence; 

Nucleic Acid Accession #: NM_002422 

Coding sequence: 64.. 1497 (underlined sequences correspond to start and stop codons) 

70 1 11 21 31 41 51 

I I I I I I 

ACAAGGA66C AGGCAAGACA GCAAGGCATA GAGACAACAT AGAGCTAAGT AAAGCCAGTG 60 

GAAATGAAGA GTCTTCCAAT CCTACTGTTG CTGTGCGTQG CAGTTTGCTC AGCCTATCCA 120 

TTG6ATGGAG CTGCAA6GGG TGAGGACACC AGCATGAACC TT6TTCAGAA ATATCTAGAA 180 

75 AACTACTAC6 ACCTCAAAAA AGATGT6AAA CAGTTTGTTA GGA6AAA66A CAGTGGTCCT 240 

GTTGTTAAAA AAATCCGAGA AATGCAGAAG TTCCTTGGAT TGGAG6TGAC GGGGAAGCTG 300 
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GACTCCGACA CTCTGGAGGT GATGCGCAAG CCCAGGTGTG GAGTTCCTGA TGTTGGTCAC 360 

TTCAGAACCT TTCCTGGCAT CCCGAAGTGG AGGAAAACCC ACCTTACATA CAGGATTGTG 420 

AATTATACAC CAGATTTGCC AAAAGATGCT GTTGATTCTG CTGTTGAGAA AGCTCTGAAA 480 

GTCTGGGAAG AGGTGACTCC ACTCACATTC TCCAGGCTGT ATGAAGGAGA GGCTGATATA 540 

5 ATGATCTCTT TTGCAGTTAG AGTVACATGGA GACTTTTACC CTTTTGATGG ACCTGGAAAT 600 

GTTTTGGCCC ATGCCTATGC CCCTGGGCCA GGGATTAATG GAGATGCCCA CTTTGATGAT 660 

GATGAACAAT GGACAAAGGA TACAACAGGG ACCAATTTAT TTCTCGTTGC TGCTCATGAA 720 

ATTGGCCACT CCCTGGGTCT CTTTCACTCA GCCAACACTG AAGCTTTGAT GTACCCACTC 780 

TATCACTCAC TCACAGACCT GACTCGGTTC CGCCTGTCTC AAGATGATAT AAATGGCATT 840 

10 CAGTCCCTCT ATGGACCTCC CCCTGACTCC CCTGA6ACCC CCCTGGTACC CACGG AACCT 900 

GTCCCTCCAG AACCTGGGAC GCCAGCCAAC TGTGATCCTG CTTTGTCCTT TGATGCTGTC 960 

AGCACTCTGA GGGGAGAAAT CCTGATCTTT AAAGACAGGC ACTTTTGGCG CAAATCCCTC 1020 

AGGAAGCTTG AACCTGAATT GCATTTGATC TCTTCATTTT GGCCATCTCT TCCTTCAGGC 1080 

GTGGATGCC6 CATATGAAGT TACTAGCAAG GACCTCGTTT TCATTTTTAA AGGAAATCAA 1140 

15 TTCTGG6CCA TCAGA6GAAA TGAGGTACGA GCTGGATACC CAAGAGGCAT CCACACCCTA 1200 

GGTTTCCCTC CAACCGTGAG GAAAATCGAT GCAGCCATTT CTGATAAGGA AAAGAACAAA 1260 

ACATATTTCT TTGTAGAGGA CAAATACTGG AGATTTGATG AGAAGAGAAA TTCCATGGAG 1320 

CCAGGCTTTC CCAAGCAAAT AGCTGAAGAC TTTCCAGGGA TTGACTCAAA GATTGATGCT 138 0 

GTTTTTGAAG AATTTGGGTT CTTTTATTTC TTTACTGGAT CTTCACAGTT GGAGTTTGAC 144 0 

20 CCAAATGCAA AGAAAGTGAC ACACACTTTG AAGAGTAACA GCTGGCTTAA TTGT TGAA AG 1500 

AGATATGTAG AAGGCACAAT ATGGGCACTT TAAATGAAGC TAATAATTCT TCACCTAAGT 1560 

CTCTGTGAAT TGAAATGTTC GTTTTCTCCT GCCTGTGCTG TGACTCGAGT CACACTCAAG 1620 

GGAACTTGAG CGTGAATCTG TATCTTGCCG GTCATTTTTA TGTTATTACA GGGCATTCAA 168 0 

ATGGGCTGCT GCTTAGCTTG CACCTTGTCA CATAGAGTGA TCTTTCCCAA GAGAAGGGGA 1740 

25 AGCACTCGTG TGCAACAGAC AAGTGACTGT ATCTGTGTAG ACTATTTGCT TATTTAATAA 1800 
AGACGATTTG TCAGTTGTTT T 
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Seq ID NO: 36 Protein sequence; 
Protein Accession #: NP 002413 



35 



40 



MKSLPILLIili 
VKKIREMQKF 
YTPDIiPKDAV 
LAHAYAPGPG 
HSIiTDIjTRFR 
TL.RGEILIFK 
WAXRGNEVRA 
6FPKQ1AEDF 



11 

I 

CVAVCSAYPIi 
liGLEVTGKLD 
DSAVEKALKV 
INGDAHFDDD 
LSQDDINGIQ 
DRHFWRKSLR 
GYPRGIHTLG 
PGIDSKIDAV 



21 
I 

DGAARGBDTS 
SDTLEVMRKP 
WEEVTPLTPS 
EQWTKDTTGT 
SLYGPPPDSP 
KLEPBIiHLIS 
FPPTVRKIDA 
FEEFGPFYFF 



31 
I 

MNLVQKYLEN 
RCGVPDVGHF 
RIiYEGEADIM 
NIiPLVAAHEI 
ETPLVPTEPV 
SFWPSLPSGV 
AISDKEKNKT 
TGSSQIiEFDP 



41 
I 

YVDLKKDVKQ 
RTFPGIPKWR 
ISFAVREHGD 
GHSIiGIiFHSA 
PPEPGTPANC 
DAAYEVTSKD 
YFFVEDKYWR 
NAKKVTHTLK 



51 
I 

FVRRKDS6PV 
KTHLTYRIVN 
FYPFDGPGNV 
NTEAIiMYPIiY 
DPALSFDAVS 
LVFIFKGNQF 
FDEKRNSMEP 
SNSWLNC 



60 
120 
180 
240 
300 
360 
420 



45 Seq ID NO: 37 Nucleotide sequence: 
Nucleic Acid Accession #: NM^003246 

Coding sequence: 112., 3624 {\anderlined sequences correspond to start and stop codons) 



50 



11 



21 



31 



41 



51 



55 



60 



65 



70 



75 



GGACGCACAG 
CGCCGCGCTC 
GCCTGGGGAC 
TCTGGCGGAG 
TCTGGGCGCC 
GCCAACCTGA 
GCAGAAAAGG 
CTGCTGGCCC 
AAGGCGGGCA 
GAAGAAGCTC 
AGGGCCCAGC 
CAAAGCGTCT 
GTCAATGACA 
GT^GACATCC 
AACAACGTGG 
AAGGACTTGC 
CTCAGGGGCC 
GA6AACAAAG 
CA6TACAGAA 
TCAGTTACCA 
CCTGATGGAG 
CCATGGTCCX5 
CGCTCCTGCG 
CACATTCA6G 
TGGTCATCTT 



GCATTCCCCG 
CGGTACACAC 
TAGGCGTCCT 
ACAACAGCGT 
GACTGGTGAA 
TCCCCCCTGT 
GTTTCCTCCT 
TGGAGCGGAA 
CCCTGGACCT 
TCCTGGCAAC 
TGTACATCGA 
TCACCAGAGA 
ATTTCCAGGG 
TCAGGAACAA 
TGAATGGTTC 
AAGCCATCTG 
TGCGCACCAT 
AGTTGGCCAA 
ATAACGAGGA 
TCTGCAAAAA 
AATGCTGTCC 
AGTGGACCTC 
ATAGCCTCAA 
AGTGTGACAA 
GTTCTGTGAC 



CGCCCCTCCA 
AGGATCCCTG 
GTTCCT6ATG 
GTTTGACATC 
GGGCCCCGAC 
GCCTGATGAC 
TCTGGCATCC 
AGACCACTCT 
CAGCCTGACC 
CGGCCAGTGG 
CTGTGAAAAG 
CCTGGCCAGC 
GGTGCTGCAG 
AGGCTGCTCC 
CAGCCCTGCC 
CGGCATCTCC 
TGTGACCACG 
TGAGCTGAGG 
ATGGACTGTT 
GGTGTCCTGC 
TOGCTGTTGG 
CTGTTCTACG 
CAACCGATGT 
AAGATTTAAA 
ATGTGGTGAT 



GCCCTCGCCG 
CTGGGCACCA 
CATGTGTGTG 
TTTGAACTCA 
CCTTCCAGCC 
AAGTTCCAAG 
CTGAGGCAGA 
GGCCAGGTCT 
GTCCAAGGAA 
AAGAGCATCA 
ATGGAGAATG 
ATCGCCAGAC 
AATGTGAGGT 
AGCTCTACCA 
ATCCGCACTA 
TGTGATGAGC 
CTGCA6GACA 
CGGCCTCCCC 
GATA6CTGCA 
CCCATCATGC 
CCCAGCGACT 
AGCTGT6GCA 
6AGGGCTCCT 
CAGGATGGTG 
GGT6TGATCA 



CCCTCGCCAC 
ACA6CTCCAC 
GCACCAACC6 
CCGGGGCCGC 
CAGCTTTCCG 
ACCTGGTGGA 
TGAT^GAAGAC 
TCAGCGTGGT 
AGCAGCACGT 
CCCTGTTTGT 
CTGAGTTGQA 
TCCGCATCGC 
TTGTCTTTGG 
GTGTCCTCCT 
ACTACATTGG 
TGTCCAGCAT 
GCATCCGCAA 
TATGCTATCA 
CTGAGTGTCA 
CCTGCTCCAA 
CTGCGGACGA 
ATGGAATTCA 
CGGTCCAGAC 
GCTGGAGCCA 
CAAGGATCCG 



CGCTCCCGGC 
CATGGGGCTG 
CATTCCAGAG 
CCGCAAGGGG 
CATCGAGGAT 
TGCTGTGCGG 
CCGGGGCACG 
GTCCAATGGC 
GGTGTCTGTG 
GCAGGAAGAC 
CGTCCCCATC 
AAAGGGGGGC 
AACCACACCA 
CACCCTTGAC 
CCACAAGACA 
GGTCCTGGAA 
AGTGACTGAA 
CAACG6AGTT 
CTGTCA6AAC 
TGCCACAGTT 
T6GCTGGTCT 
GCAGC6CGGC 
AC66ACCTGC 
CTGGTCCCCG 
GCTCT6CAAC 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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10 



15 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



TCTCCCAGCC 
TGCAAGAAAG 
TGTTCTGTCA 
CCCCAGTTTG 
CAGGACTGTC 
AGCTACCCT6 
ATCCAGTGCA 
GGAGAGCACC 
TTCACCGGCT 
TGCAAGCCCC 
AACTACCTGG 
GGCAATGGCA 
GTGTGCGTGG 
TCAGGGCAGG 
AATGATAAAA 
GACTATGACA 
GATCAGGCAG 
GACGGTATCC 
ACTGATATGG 
CAGCTGGACT 
GAAGATGGCC 
GACCATGACA 
CCTGATGACA 
GATGGTCGAG 
GACATCTGTC 
CCTCTGGACC 
AAAGAACTCG 
AATGCTGTGQ 
GGATTTGTCT 
ACCCAGTCCT 
AAAGTTGTAA 
GGAAACACCC 
GATTTCACCG 
GTGATGTATG 
GCTGGTGGTA 
AAATACGAAT 
AATGCTGGTA 
CTTGGCTTCC 
TCAAGAAAAT 
ATCTTCCAAG 
AGTTGGGAAG 
CATCTCTGAG 
AGAATTAGCA 
AAGCACTAAG 
TTACATGTTC 
TGCTGACTGG 
GGAGACAAAG 
AGTGGCCAGA 
CTGGTCACAT 
AATAGGAAAA 
CTTATATTTT 
TTCTCTTTTT 
TTTTAAAGTC 
AGGATTATTC 
CTTCGTGACT 
TAGGCTTCAT 
CAAGAACAGC 
ATGTTATGGG 
ATTTGCAAAT 
TCCCTTGTGC 
AAAAGAGAAA 
TGACTGAGTA 
GCTACTGTAG 
TTCCCAATAA 
TCTATCATCT 
TAGAATATTC 
AGAGTTAGTA 
TTTCTTTTTT 
CTATTTGCCA 
TACATTCTAA 
AATCATATGG 



CCCAGATGAA 
ACGCCTGCCC 
CCTGTGGAGG 
GAGGCAAGGA 
CAATTGATGG 
ATGGCAGCTG 
CAGATGTTGA 
GGTGTGAGAA 
CACAGCCCTT 
GTAACCCCTG 
GCCACTATAG 
TCATCTGCGG 
CCAATGC6AC 
AAGACTATGA 
TTCCAGATGA 
GAGATGATGT 
ACACAGACAA 
TCAATGAACG 
ATGGGGTTGG 
CTGACTCAGA 
ACCAGAACAA 
AAGATGGCAA 
AGGACAACTG 
GTGATGCCTG 
CTGAGAATGT 
CCAAAGGGAC 
TCCAGACTGT 
ACTTCAGTGG 
TTGGCTACCA 
ACTGGGACAC 
ACTCCACCAC 
CTGGCCA6GT 
CCTACA6ATG 
AAGGGAAGAA 
GACTAGGGTT 
GTAGAGATCC 
TTGCACCTTC 
TTCTTTTCTG 
GCAGTTTTCA 
CATATAAACA 
GTGCCCATTC 
CAGTGGACTC 
AACAAAACCA 
GGGAGGGCGC 
GGTACTAAGT 
CGTTAGCTGA 
ACTGGCTTCT 
ATTAGGGAAT 
TGAAATTGGT 
CCTACCATCT 
TATGGTTACA 
TCCGTAATTA 
TTTACGATGT 
ATGGAACAGG 
GTAAGATTGT 
ACGGAAAGTG 
ACAAGGAAAA 
ATTCCTTTTT 
GTTTTAAATT 
ATATTTCCAG 
AAAATGACAA 
AAGAATTTTT 
TACCTAAAAA 
GGAAATAGCA 
G6TATACCAT 
AGATTGTGTA 
TTTCTATTTT 



ATACCTTTTT 
AGCAGTGTAA 
AAATTTATAT 



TGGGAAACCC 
CATCAATGGA 
AGGGGTACAG 
CTGCGTTGGT 
ATGCCTGTCC 
GAAATGTGGT 
TGAGTGCAAA 
CAC6GACCCC 
CG6CCAGGGT 
CACGGATGGG 
CGACCXrCATG 
GGA6GACACA 
TTACCACTGC 
CAAGGATGGA 
CAGGGACAAC 
GGGAGACCGC 
CAATGGGGAA 
GGACAACTGC 
AGATCAGTGT 
CCGCATTGGA 
TCTGGACAAC 
■ GGGAGATGCC 
CAGACTCGTG 
CAAAGATGAT 
TGACATCAGT 
ATCCCAAAAT 
CAACTGTGAT 
CACCTTCTTC 
GTCCAGCAGC 
CAACCCCACG 
AGGGCCTGGC 
GCGCACCCTG 
GCX5TCTCAGC 
AATCATGGCT 
GTTTGTCTTC 
C TAA TCATCA 
TGGAACTATG 
TGCTTGCATC 
AAAACAGACT 
ATTGCTTTGG 
CACTCTGCCT 
AAAAGCATTT 
CCCTGACATC 
ATACCCGAGA 
CATTTTCAGG 
TTAACCCATG 
GGACTTCCTC 
CA6AATCAAA 
GGCTTCATTC 
CAGTGAGCAC 
ATGGCACAAA 
CTAGGTAGTT 
AAAATATTTA 
AAGAAGCGTA 
AAATACAGAT 
TTTGAGAGCA 
TCAGTCTAAT 
TCTCTGTTTT 
GCAAAGAAAG 
GGAGAAGGAA 
AAGGTGAAAC 
GGATCAAGCG 
GTCAGTGTTG 
TTGAAATGTT 
TGCTTTATTT 
GATATGCTAT 
TATATAATGT 
TTTTTTTTTT 
CTAGGAATOT 
GTTGTATATT 
TT 



TGTGAAGGCG 
GGCTGGGGTC 
AAACGTAGTC 
GATGTAACAG 
AATCCCTGCT 
GCTTGTCCCC 
GAAGTGCCTG 
GGCTACAACT 
GTCGAACATG 
ACCCACXtf^CT 
TACCGCTGCG 
GACCTGGATG 
AAAAAGGATA 
ATTGGTGATG 
TGTCCATTCC 
TGTGACAACT 
GGAGACGCCT 
CAGTACGTCT 
GACAATTGCC 
GATACCTGTG 
TGTCCCTATG 
TGTGACCACG 
CCCAATCCCG 
TTTGACCATG 
GAGACCGATT 
GACCCTAACT 
CCTGGACTCG 
ATCAACACCG 
CGCTTTTATG 
AGGGCTCAGG 
GAGCACCTGC 
TGGCATGACC 
CACAG6CCAA 
6ACTCAGGAC 
TCTCAAGAAA 
AATTGTTGAT 
GGCTTGAGAA 
AGTGTGGACT 
CATCAGCATT 
TTTCCTTTTG 
TTGTCACAGA 
TCAGGCATGT 
CTCCTTCAGG 
CGATTGTATG 
GGATTGAAAG 
TAAATAGGCA 
CCTGATCCCC 
CCAGTGTAAG 
TA6ATGTAGC 
CAGCTGCCTC 
ATTATTATCA 
TTCTAATTCT 
TTTTTTACTT 
AAGACTATCC 
TATTTATTAA 
AGTAGTTGAC 
AAGCTGCTCT 
ATCTTTTCAA 
CCATGAGGTC 
AGCATATACA 
TTACATACAA 
GAAAGAGTTT 
TACATAGCAT 
AAATACAATT 
TTATAAATTA 
TTAAATAATT 
TTGCACACTG 
TTTTTTTTTG 
GCTTTTTTTT 
ACTGTTTCTT 



AAGCGCGGGA 
CTTGGTCACC 
GTCTCTGCAA 
AAAACCAGAT 
TTGCCGGCGT 
CTGGTTACAG 
ATGCCTGCTT 
GCCTGCCCTG 
CCACGGCCAA 
GCAACAAGAA 
AGTGCAAGCC 
GCTGGCCCAA 
ATTGCCCCAA 
CCTGTGATGA 
ATTACAACCC 
GTCCCTACAA 
GTGCTGCA6A 
ACAATGT6GA 
CCTT66AACA 
ACAACAATCA 
TGCCCAATGC 
ATGATGACAA 
ACCAGAAGGA 
ACAGTGTGCC 
TCCGCCGATT 
GGGTTGTACG 
CTGTAGGTTA 
AAAGGGACGA 
TTGTGATGTG 
GATACTCGGG 
GGAACGCCCT 
CTCGTCACAT 
AGACGG6TTT 
CCATCTATGA 



TGAAAGACTG 
AACCCCCAGG 
CCTAGAACGT 
CAGCCTCCAA 
AAAAAGCATC 
GCAGGGTGCT 
CAGAGAAGGG 
AACACGGGGA 
AAGAAAATAT 
ACTATTGCTG 
CTTAAATAGA 
ACCCTTACTC 
GCAGTGCTGG 
TTGTGCAGAT 
CCAAAGGAGG 
ACCTAACTAA 
CTCTTTTGGA 
ATTCTGGAAG 
ATGTCATCTT 
CTCTGTTCTG 
ATTTATCAGC 
GCCCCTTGTG 
GTGGAATTAG 
TTCAATACTG 
CTTTTTTCTT 
ATATTACCTC 
AAGTGTCTAA 
AAAAACTCTG 
TCTGAAAGTT 
TTTTCTCATT 
TATCAGGAAA 
AATTGAAGAA 
CTTTTGACCT 
GTACACATTT 
ATGTACAAGG 



GACCAAAGCC 
ATGGGACATC 
CAACCCCGCA 
CTGCAACAAG 
GAAGTGTACT 
TGGAAATGGC 
CAACCACAAT 
CCCCCCACGC 
CAAACAGGTG 
CGCCAAGTGC 
TGGCTACGCT 
TGAGAACCTG 
CCTTCCCAAC 
TGACGATGAC 
AGCTCAGTAT 
CXACAACCCA 
CATTGATGGA 
CCAGAGA6AC 
CAATCCGGAT 
GGATATTGAT 
CAACCAGGCT 
CGATGGCATT 
CTCTGACGGC 
AGACATCGAT 
CCAGATGATT 
CCATCAGGGT 
TGATGAGTTT 
TGACTATGCT 
GAAGCAAGTC 
CCTTTCTGTG 
GTGGCACACA 
AGGCTGGAAA 
CATTAGAGT6 
TAAAACCTTAT 
CTCTGACCTG 
ATCATAAACC 
ATCACTTCTC 
GC6ACCTGCC 
TGAATAAGAC 
TACTTGCTTC 
ATTGTGAGGC 
AGGACTCACT 
GCAGAGGCCA 
GGAGGAACTG 
GATTTCATGA 
AGCAGGAAAG 
ATCACCTTGC 
CTGCCATTGC 
6TAGCAG6AA 
6GCA6CCGTG 
AACATTCCTT 
AGTAT6ATTT 
ATCTGGCTGA 
TGTTGAGAGT 
CCTG6AAATT 
AAATCTCTTG 
CTCAGAGTGG 
TTGGTTATCC 
TTTTACCCCA 
TCATTTTTCC 
ATTTGTTGTG 
CAAACTTAAA 
CAGAGAAGTA 
ATGTTTTTTT 
GCCATTGGAA 
TACTGCCTGT 
TTGTTGGTTT 
CCCATTTTTA 
TTATCCATTT 
AACAACAATA 



1560 

1620 

1660 

1740 

1800 

1B60 

1920 

1980 

2040 

2100 

2160 

2220 

2260 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2680 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5260 

5340 

5400 

5460 

5520 

5560 

5640 

5700 



75 



Seq ID NO: 38 Protein aeouenee: 
Protein Accession #: NP 003237 
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10 



15 



20 



MGUVWGL6VL 
lEDANLIPPV 
SNGKAGTLDL 
VPIQSVFTRD 
TLDNNWNGS 
VTEENKELAN 
ATVPDGECCP 
RTCHIQECDK 
TKACKKDACP 
CNKQDCPIDG 
NHNGEHRCEN 
AKCNYLGHYS 
LPNSGQEDYD 
HNPDQADTDN 
NPDQXjDSDSD 
DGXPDDKDNC 
QMIPLDPKGT 
DYAGFVFGYQ 
WHTGNTPGQV 
KTYAGGRLGIi 



11 

I 

FLMHVCGTNR 
PDDKFQDLVD 
SIjTVQGKQHV 
LASIARLRIA 
SPAIRTNYIG 
ELRRPPLCYH 
RCWPSDSADD 
RFKQDGGWSH 
INGGWGPWSP 
CLSNPCFAGV 
TDPGYNCLPC 
DPMYRCECKP 
KDGIGDACDD 
NGEGDACAAD 
RIGDTCDNNQ 
RliVPNPDQKD 
SQNDPNWWR 
SSSRFYWMW 
RTIiWHDPRHI 
FVFSQBMVFF 



21 
I 

IPESGGDNSV 
AVRAEKGFLIi 
VSVEEALIiAT 
KGGVNDNFQG 
HKTKDLQAIC 
NGVQYRNNEE 
GWSPWSEWTS 
WSPWSSCSVT 
WDICSVTCGG 
KCTSYPDGSW 
PPRFTGSQPF 
GYAGNGIICG 
DDDNDKIPDD 
IDGDGIXjHER 
DID6DGHQNH 
SDGDGRGDAC 
HQGKEIiVCyrV 
KQVTQSYWDT 
GWKDFTAYRW 
SDLKYECRDP 



31 
I 

FDIFELTGAA 
LASLRQMKKT 
GQWKSITLFV 
VLQNVRFVFG 
GISCDEIiSSM 
WTVDSCTECH 
CSTSCGNGIQ 
CGDGVITRIR 
GVQKRSRLCN 
KCGACPPGYS 
GQGVEHATAN 
EDTDIiDGWPN 
RDNCPFHYNP 
DNCQYVYNVD 
LDNCPYVPNA 
KDDFDHDSVP 
NCDPGIAVGY 
NPTRAQGYSG 
RIiSHRPKTGF 



41 
I 

RKGSGRRLVK 
RGTLLALERK 
QEDRAQIiYID 
TTPEDIIiRNK 
VLELRGLRTI 
CQNSVTICKK 
QRGRSCDSLN 
liCNSPSPQMN 
NPAPQFGGKD 
GNGIQCTDVD 
KQVCKPRNPC 
ENLVCVANAT 
AQYDYDRDDV 
QRDTDMDGVG 
NQADHDKDGK 
DIDDICPENV 
DEFNAVDFSG 
IiSVKWNSTT 
XRWMYEGKK 



51 
t 

GPDPSSPAFR 
DHSGQVFSW 
CEKMENAEUD 
GCSSSTSVIiL 
VTTL.QDSIRK 
VSCPIMPCSN 
NRCEGSSVQT 
GKPCEGEARE 
CVGDVTENQI 
ECKEVPDACF 
TDGTHDCNKN 
YHCKKDNCPN 
GDRCDNCPYN 
DQCDNCPLEH 
GDACDHDDDH 
DISBTDFRRF 
TFPINTERDD 
GPGEHIiRHAIi 
IMADSGPZYD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Seq ID NO: 3 9 Nucleotide se craence ; 
Nucleic Acid Accession #: BC004299 

coding sequence: 6 9.. 1235 (underlined sequences correspond to start and stop codons) 



CCCGACCCGT 
GTGCGGCCAT 
CCCTGGACGC 
ACAAGGGCTC 
ACGAGAGGAA 
TGCTGGGAAA 
CGGAGCGGCT 
GGAAGAA6CA 
TCTCCCGGGA 
AGAAG6AGGA 
ACCAC6A6GG 
CGTACGGGCT 
CCTTCTTCTC 
CAGGGCACCC 
GCTCCCTGGC 
CCCCATCTCC 
CCCACCTGGG 
TGAGCCAGGT 
CTCCTGGCCA 
CCCAGGTGAC 
CGGCCACGTA 
CCCTCGCGCC 
TTTCCTCCCA 
GCTGGACTCT 
GCCCACATTT 
CTCCCAGTGG 
GACAGACTTG 
ATTAATAAAG 
TGATAATTTT 
TCCAAAGT6A 
GATTTGAGAA 
TATTTTATTT 
ACACACACTT 
TATAAAATTC 
GATGTTTAAA 



11 
I 

GCGAGGGCCA 
GGCTTCGCTG 
CGAGCTGTCG 
CGAGAGCCGT 
ACGGCTGGCA 
GTCGTGGAAG 
GCGCCTGCAG 
GGCCAAGCGG 
CCAGAAC6CC 
CAGGGGT6AG 
GCCGGCTGGT 
GCCCACACCT 
CTCCCCCTGC 
GTACTCACCG 
CCTTGGCCAG 
TGCCTATTAC 
CCAGCTTTCC 
GGAACTCCTG 
CCCAGACTCC 
ACC7U\CGGGT 
CTACAACAGC 
CTCTCCTTCT 
CCGCTCAGGG 
CCTTATCCGA 
TAAGTATATT 
AATGTTCACT 
ATAGCCAAGG 
6AAGATGG6G 
GTGTGCACAG 
CCACAAAATT 
ATTAACCAGT 
TAAATATACA 
CAAGAGCCAC 
AGTGTATTAG 
AACAAAACAG 



21 



31 



GGTCCGCGCC 
CTGGGAGCCT 
GATGGACAAT 
ATCCGGCGGC 
GT6CA6AACC 
GCGCTGAC6C 
CACATGCAGG 
CTGTGCAAGC 
CTGCCGGAGA 
TACTCCCCCG 
GGTGGCGGCG 
CCTGAAATGT 
CAGGAGGAGC 
GAGTACGCCC 
TCCCCCGGCG 
TCCCCGGCCA 
CCGCCTCCTG 
GGGGACATGG 
GCCACAGGGG 
CCCACAGAGA 
TACAGTGTGT 
TGTGCCTTGA 
CAGGGAGGTC 
GTGCCX3CCTC 
CCTTCAAGTG 
GACGTCTTTT 
TCCCTTCT6G 
AAATTTGACT 
CCCAAGGACC 
TCAAAGGGAC 
ATGGCTAACT 
TTTTAAAGCA 
CGCGCCCAGC 
TTTCATTACA 
GCTGTTGTAA 



TGCCCCGCCA 
ACCCTTGGCC 
CGCCGCCGGC 
CCATGAACGC 
CGGACCTGCA 
TGTCCCAGAA 
ACTACCCCAA 
GCGTGGACCC 
AGAGAAGCGG 
GCACTGCCCT 
GCGGCACCCC 
CTCCCCTGGA 
ATGGCCATCC 
CAAGCCCTCT 
TCTCCATGAT 
CCTACCACCC 
AGCACCCTGG 
ATCGCAATGA 
CCATGGCCCT 
CCAGCCTCAT 
C ATAGA GCTG 
GTGGCAGAGG 
TGAACT6CGG 
TATCCCCTTC 
AGTTTTCCTC 
CTTGGTAGCC 
TCCAGTTTTC 
CATTAATGAG 
ACGAGGCTTT 
TCATACAATT 
ATATCACAGA 
GTTCTTTTTT 
CTACATTTAT 
TAGGAGAAAT 
AAAAAAAAAA 



41 
I 

GGCGAAGCX3A 
CGAGGGTCTC 
CGTCCCCCGG 
CTTCATGGTT 
CAACGCCGAG 
GAGGCCGTAC 
CTACAAGTAC 
GGGCTTCCTT 
CAGCCGGGGG 
GCCCAGCCTC 
GAGCAGTGTG 
CGTGCTGGAG 
CCGCCGCATC 
CCACTGTAGC 
GTCCCCTGTA 
ACTCCACTCC 
CTTCGACGCC 
ATTCGACCAG 
CAGTGGGCAT 
CTCCGTCCTG 
6AGGCGCCCC 
AGCCGTCCA6 
CCCCAGAGCC 
CCCACGTTCC 
CAGCCCCTGA 
ATCATCGAAA 
TGATTTAGGG 
CTCGCTAACC 
CTGCACTTTC 
TGAGAAAAAA 
AAATGGGATT 
TTTGTTAATT 
AATTTTCATT 
TATATTTCTA 
AAAAAAAAA 



51 
I 

GGCXSACCCGC 
GAGTGCCCGG 
CCCCCGGGGG 
TGGGCCAAGG 
CTCAGCAAGA 
GTGGACGAG6 
CGGCCGCGCA 
CTGAGCTCCC 
GCGCTGGGGG 
CGGGGCTGCT 
GACACGTACC 
CCGGAGCAGA 
CCCCACCTGC 
CACCCCCTGG 
CCCGGCTGTC 
AACCTCCAAG 
CTGGATCAAC 
TATTTGAACA 
GTTCCGGTCT 
GCTGATGCCA 
GTCG6GTCAG 
CCACACCAGC 
TTTGGCCTAA 
AGCCCCTGCA 
GAGTTGCTGT 
CTAATGGGGG 
TTCTCTCAAG 
TACGATCTGG 
TGCACCCCCT 
CAGTCAACCT 
GAGTTAAAAC 
TGTTTATTAT 
CTCTTTTACC 
AACATTTTAT 



Seq ID NO: 40 Protein aecruencet. 
Protein Accession #: AAH04299 

•L 11 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



51 



21 31 41 I 

ilASLLGAYPW PBGIiECPAIiD AELSDGQSPP AVPRPPGDKG SBSRIRRPMM APMVWAKDER 
KRIiAVQNPDL HNAELSKMLG KSWKAIiTIiSQ KRPYVDEABR UUiQHMQDYP NYKYRPRRKK 
QAKRIiCKRVD PGPLLSSLSR DQNAIiPBKRS GSRGAIiGEKB DRGBYSPGTA LPSLRGCYHE 



60 
120 
180 



199 
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GPAGGGGGGT PSSVDTYPYG LPTPPBMSPL DVLEPEQTFF SSPCQEEHGH PRRIPHLPGH 240 

PYSPEYAPSP IiHCSHPIiGSL ALGQSPGVSM MSPVPGCPPS PAYYSPATYH PLHSNIiQAHL 300 

GQLSPPPEHP GFDALiDQIiSQ VEI*IiGDMDRN EPDQYUJTPG HPDSATGAMA LSGHVPVSQV 360 
TPTGPTETSL ISVIiADATAT YYNSYSVS 



Seq ID NO: 41 Nucleotide secmence: 
Nucleic Acid Accession #r NM_004449 

Coding sequence: 1..1389 (underlined sequences correspond to start and stop codons) 



ATSATTCAGA CTGTCCCX^GA CCCAGCAGCT CATATCAAGG AAGCCTTATC AGTTGTGAGT 60 

GAGGACCAGT CGTTGTTTGA GTGTGCCTAC GGAACGCCAC ACCTGGCTAA GACAGAGATG 120 

ACCGCGTCCT CCTCCAGCGA CTATGGACAG ACTTCCAAGA TGAGCCCACG CGTCCCTCAG 180 

CAGGATTGGC TGTCTCAACC CCCAGCCAGG GTCACCATCA AAATGGAATG TAACCCTAGC 240 

CAGGTGAATG GCTCAAGGAA CTCTCCTGAT GAATGCAGTC TGGCCAAAGG CGGGAAGATG 300 

GTGGGCAGCC CAGACACCGT TGGGATGAAC TACX3GCAGCT ACATGGAGGA 6AAGCACATG 360 

CCACCCCCAA ACATGACCAC GAACGAGCGC AGAGTTATCG TGCCAGCAGA TCCTACGCTA 420 

TGGAGTACAG ACCATGTGCG GCAGTGGCTG GAGTGGGCGG TGAAAGAATA TGGCCTTCCA 480 

GACGTCAACA TCTTGTTATT CCAGAACATC GATGGGAAGG AACTGTGCAA GATGACCAAG 540 

GACGACTTCC AGAGGCTCAC CCCCAGCTAC AACGCCGACA TCCTTCTCTC ACATCTCCAC 600 

TACCTCAGAG AGACTCCTCT TCCACATTTG ACTTCAGATG ATGTTGATAA AGCCTTACAA 660 

AACTCTCCAC GGTTAATGCA TGCTAGAAAC ACAGATTTAC CATATGAGCC CCCCAGGAGA 720 

TCAGCCTGGA CCGGTCACGG CCACCCCACG CCCCAGTCGA AAGCTGCTCA ACCATCTCCT 780 

TCCACAGTGC CCAAAACTGA AGACCAGCGT CCTCAGTTAG ATCCTTATCA GATTCTTGGA 84 0 

CCAACAAGTA GCCGCCTTGC AAATCCAGGC AGTGGCCAGA TCCAGCTTTG GCAGTTCCTC 900 

CTGGAGCTCC TGTCGGACAG CTCCAACTCC AGCTGCATCA CCTGGGAAGG CACCAACGGG 960 

GAGTTCAAQA TGACGGATCC CGACGAGGTG GCCCX3GCGCT GGGGAGAGCX3 GAAGAGCAAA 1020 

CCCAACATGA ACTACGATAA GCTCAGCCGC GCCCTCCGTT ACTACTATGA CAAGAACATC 1080 

ATGACCAAGG TCCATGGGAA GCGCTACGCC TACAAGTTCG ACTTCCACGG GATCGCCCAG 1140 

GCCCTCCAQC CCCACCCCCC GGAGTCATCT CTGTACAAGT ACCCCTCAGA CCTCCCGTAC 1200 

ATGGGCTCCT ATCACGCCCA CCCACAGAAG ATGAACTTTG TGGCGCCCCA CCCTCCAGCC 1260 

CTCCCCGTGA CATCTTCCAG TTTTTTTGCT GCCCCAAACC CATACTGGAA TTCACCAACT 1320 

GGGGGTATAT ACCCCAACAC TAGGCTCCCC ACCA6CCATA TGCCTTCTCA TCTG6GCACT 1380 
TACTA CTAA 



Seq ID NO: 42 Protein sequence i 
Protein Accession #: NP 004440 



1 11 21 31 41 51 

I I I I I I 

MIQTVPDPAA HIKEAIiSWS EDQSLFECAY GTPHIiAKTEM TASSSSDYGQ TSKMSPRVPQ 60 

QDWLSQPPAR VTIKMECNPS QVNGSRNSPD ECSVAKGGKM VGSPDTVGMN YGSYMEEKHM 120 

PPPNMTTNER RVIVPADPTL WSTDHVRQWL EWAVKEYGLP DVNILLFQNI DGKEIiCKMTK 180 

DDFQRLTPSY NADILLSHLH YLRBTPLPHL TSDDVDKALQ N5PRLMHARN TDLPYEPPRR 240 

SAWTGHGHPT PQSKAAQPSP STVPKTEDQR PQLDPYQILG PTSSRIiANPG SGQIQLWQFIj 300 

LELLSDSSNS SCITWEGTNG BFKMTDPDBV ARRWGERKSK PNMMYDKLSR AIiRYYYDKNI 360 

MTKVHGKRYA YKFDFHGIAQ ALQPHPPESS LYKYPSDLPY MGSYHAHPQK MNFVAPHPPA 420 
LPVTSSSFFA APNPYWNSPT GOIYPNTRLP TSHMPSHLGT YY 



Seq ID NO: 43 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005100 

Coding sequence: 192.. 553 7 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CCTTCTTTTA AGGAGTTTGC CGCGAGCGCG TCTCCTTCAT TCGCAGGCTG GGCGCGTTCG 60 

CAGTCGGCTG GCGGCGAAGG AAGGCGCTCT CGGGACCTCA CGGGCGCGCG TCTTTTGGCT 120 

CTTGCCCCTG TCCCTGCGGC TTGGGGAAAG CGTAACCCGG CGGCTAGGCG CGGGAGAAGT 180 

GCGGAGGAGC qWSQGCGCC GGGA6CTCCA CCGAGCAGCG CAGCCCGGAG CAGCCGCCCG 240 

AGGGGAGCTC CACGCCGGCT GAGCCCGAGC CCAGCGGCGG CGGCCCCTCG GCCGAGGCGG 300 

CGCCAGACAC CACCGCGGAC CCCGCCATCG CTGCCTCGGA CCCCGCCACC AAGCTCCTAC 360 

AGAAGAATGG TCAGCTGTCC ACCATCAATG GCGTAGCTGA GCAAGAT6AG CTCAGCCTCC 420 

A6GAGGGT6A CCTAAATGGC CAGAAAGGAG CCCT6AACGG TCAAGGAGCC CTAAACAGCC 480 

AG6AGGAAGA AGAAGTCATT GTCACGGAG6 XTGGACAGAG AGACTCTGAA GATGTGAGCG 540 

AAAGAGACTC CGATAAAGAG ATGGCTACTA AGTCAGCGGT T6TTCACGAC ATCACAGATG 600 

ATGGGCAGGA GGAGAACCGA AATATCGAAC AGATTCCTTC TTCAGAAAGC AATTTAGAAG 660 

AGCTAACACA ACCCACTGAG TCCCAGGCTA ATGATATT6G ATTTAAGAAG GT6TTTAAOT 720 

TTGTTGGCTT TAAATTCACT GTGAAAAAGG ATAAGACAGA GAAGCCTGAC ACTGTCCAGC 780 



200 
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TACTCACTGT GAAGAAAGAT GAAGGGGAGG 
CCAGCCTTGG GGCTGGAGAA GCAGCATCCA 
AACCCGAAGA GACCCTGAAG CGTGAGCAAA 
CTGGCCAAGC AGTGGAGGAA TGCAAAGAGG 
5 GCAAGTCTGC AGAATCTCCG ACTAGTCCCG 
AATTCTTCAC TCAAGGTTGG GCCGGCTGGC 
AGGATGAAGT GGAAGCTTCA GAGAAGAAAA 
AAGAAGACGG AAAGGCAGAG GTTGCCTCCG 
CACAGGAGCC GGCAGAAAGT GCCCACGAGC 
10 AGCTGCCCTC AGAGGAGCAA GTCAGTGGCT 
CGTTGGCGAC AGAAGTGTTT GATGAGAAAA 
AAGTCCACGT CAGCACCGTG GAGGAGAGAA 
CAGCAGGGTC TGTGCCAGCT GAAGAATTGG 
AACCTGCCAA GGAGCTGGTG AAGCTCAAAG 
15 AGGGAGCTGA CCTCAGTCCT GATGAGAAGG 
GTGAGGTGGA AATGCTGTCA TCACAGGAGA 
AGCTTTTTAC CAGCACTGGC TTAAAAAAGC 
GAGGAGGAGA CGAGGAATCT^ GGGGAGCACA 
AGGAGGAGCA AAAGGGCGAG AGCTCTGCCT 
20 GTCTGGAAAA GGGCTTAGCC GAGGTGCA6C 
CCGATGGAGA GAAAAAAAGA GAA6GTGTCA 
CGCCCAAGAA GCGTGTTAGA CGGCCTTCGG 
TCAAGAGCGC TACCTTGTCT TCCACCGAGA 
AAGGGAGCGT GGAAGAGCCA AA6CCGGAAG 
25 CTTGG6AAGC TTTAATTTGT GTGGGATCAT 
CTGATGAGGA AGGGGGACCA AAAGCAATGG 
GAAAAGACAA AGAGACGGGG ACAGACGGGA 
GGCAGGGAAG TTCCTCCCCG GAGCAAGCTG 
CCTGGGAGTC ATTTAAAAGG TTAGTCACGC 
30 AGAAAAGCGA AGACTCCATA GCTGGGTCTG 
CCGGTAAAGA AGAATCCTGG GTCTCT^TCA 
GGCCAGATGG GAAACAAGAA CAAGCCCCTG 
AAGATGACTC TGATGTCCCG GCCGTGGTCC 
AGAAAATGGA GGCACAGCAA GCCCAAAAAG 
35 CTGAGGTGTC CAAGGAGCTC AGCGAGAGTC 
ACGGGACGAG GGCAGCTACC ATTATTGAAG 
TGACAGAACC TCTTGAACAA GTAGAAGCTG 
AAAGAGAAGT AATTGCAGAA GAAGAACCCC 
GAGAGGCCCG GGGCGACACG GTCGTTAGTG 
40 CTGCAGAAAC TGCAGGGCCA TTGGGTTCCG 
AGACCACAGA AATGGTGTCA GCAGTCTCCC 
AGGCCACTCC GGTGCAGGAG 6TGGAAGGTG 
GGACTCAAGA GGTCCTCCAG GCAGTGGCAG 
GCACCGGTGG GCCAGAAGAT GTGCTTCAGC 
45 AAGAGCAGGC TGAAGCGTCG GGTCTGAAGA 
CTCAGGAGGC AAAAACTGAG CCTTTTACAC 
AAAGCTTTGA AAAAGCTCCT CAAGTCACAG 
CTTGTCAAGC CGAAACCTTA GCTGGGGTAA 
TCCCCCCTGA CTCGGTGGAA ACCCCTACAG 
50 CCGACTTTGA CGCACCAGGC ACAACCCAGA 
ATGAGGTCGC ATCTGGTACC CAGTCAGGGG 
AAGAGAGGCC TCCAGCACCT TCCAGTTTTG 
AGATGGAAGA CACTCTAGAG CATACAGATA 
TGTCAAAGAC TGAGGGGACT CAA6AG6CTG 
55 TACCATTTTT CGAAGGACTT GA6GGGTCTA 
AGGTCACT6A AGTTGCCCTT AAAGGTGAAG 
ATGCTCTTGA ACTGCAGAGT CACGCTAAGT 
TAGTTCAAGT CGAAAGGGA6 AAAACAGAAG 
TTGAGCACGA AACAGCTGTT ACCGTATCTG 
60 TGAATOTGCC CATCATAGAT GGGGCAAAGG 
CCTGCCTAGG TCAAGAGGAG GCAGTATGCA 
CATTCACTCT AACAGCGGCT GCAGAGGAGG 
TAGAAACAGG TGAAACGTTG GAGCCTGCAG 
CTGAAAAAAA TGAAGACTTT GCCGCTCATC 
65 ACTGTCAGGC AAAATCGACA CCAGTGATAG 
CCGACCTGGA AGGAGAGAAA ACCACATCAC 
AGGTTGCTTG CCAGGAGGTC AAAGTGAGTG 
GGATTTTGGA ACTTGAGACC AAAAGCAGTA 
TTGACCAGTT TGTACGTACA GAAGAAACAG 
70 CACAAGCTCA CGTGATAAAA GCTGACAGCC 
GAGAGGAACC TCAGGCCTCT GCACAGGATG 
CAGAGTCAAC CGCAGTGGGA CAAGCACATT 
CAGAAAAGAC CATGACTGTT GAGGTAGAAG 
AGGTCGTCCT CCCATCTGAG GAAGAGGGAG 
75 ATGATGGTCA TGCCTTGTTA GCAGAAAGAA 
AT6AAAAAGG TGATGATGTT GAT6ACCCTG 



GAGCA6CAGG GGCTGGCGAC CACCAGGACC 84 0 

AAGAAAGCGA ACCCAAACAA TCTACAGAGA 900 

GCCACGCAGA AATTTCTCCC CCAGCCGAAT 960 

AA66AGAAGA GAAACAAGAA AAAGAACCTA 1020 

TGACCAGTGA AACAGGATCA ACCTTCAAAA 1080 

GCAAAAAGAC CAGTTTCAGG AAGCCGAAGG 114 0 

AGGAACAAGA GCCAGAAAAA GTAGACACAG 1200 

AGAAACTGAC CGCCTCCGAG CAAGCCCACC 1260 

CCCGGTTATC AGCTGAATAT GAGAAAGTTG 1320 

CGCAGGGACC TTCTGAAGAG AAACCTGCTC 1380 

TAGAAGTCCA CCAAGAAGAG GTTGTGGCCG 144 0 

CCGAAGAGCA GAAAACGGAG GTGGAAGAAA 1500 

TTGGAATGGA TGCAGAACCT CAGGAAGCCG 1560 

AAACGTGTGT TTCCGGAGAG GACCCTACAC 1620 

TGCTGTCCAA ACCCCCCGAA GGCGTTGTGA 1680 

GAATGAAGGT GCAGGGAAGT CCACTAAAGA 1740 

TTTCTGGAAA GAAACAGAAA GGGAAAAGAG 1800 

CTCAGGTTCC AGCCGATTCT CCGGACAGCC 1860 

CATCCCCTGA GGAGCCCGAG GAGATCACGT 1920 

AGGATGGGGA AGCTGAAGAA GGAGCTACTT 1980 

CTCCCTGGGC ATCATTCAAA AAGATGGTGA 2040 

AAAGTGATAA AGAAGATGAG CTGGACAAG6 2100 

GCACAGCCTC T6AAATGCAA GAAGAAATGA 2160 

AACCAAAGCG CAAGGTGGAT ACCTCAGTAT 2220 

CCAAGAAAAG AGCAAGGAGA AGGTCCTCTT 2280 

GAGGAGACCA CCAGAAAGCT GATGAGGCCG 234 0 

TCCTTGCTGG TTCCCAAGAA CATGATCCAG 2400 

GAAGCCCTAC CGAAGGGGAG GGCGTTTCCA 2460 

C/^GAAAAAA ATCAAAGTCC AAGCTGGAAG 2520 

GTGTAGAACA TTCCACTCCA GACACTGAAC 2580 

AGAAGTTTAT TCCTGGACGA AGGAAGAAAA 2 640 

TTGAAGACGC AGGGCCAACA GGGGCCAACG 2700 

CTCTGTCTGA GTATGATGCT GTAGAAA6GG 2760 

GCGCAGAGCA GCCCGAGCAG AAGGCAGCCA 2820 

AGGTTCATAT GATGGCAGCA GCTGTCGCTG 2880 

AAAGGTCTCC TTCTTGGATA TCTGCTTCAG 2940 

AAGCCGCACT GTTAACTGAG GAGGTATTGG 3000 

CCACGGTTAC TGAACCTCTG CCAGAGAACA 3060 

AGGCGGAATT GACCCCCGAA GCTGTGACAG 3120 

AAGAAGGAAC CGAAGCATCT GCTGCTGAAG 3180 

AGTTAACCGA CTCCCCAGAC ACCACAGAGG 3240 

GCGTACCTGA CATAGAAGAG CAAGAGAGGC 3300 

AAAAAGTGAA AGAGGAATCC CAGCTGCCTG 3360 

CTGTGCAGAG AGCAGAGGCA GAAAGACCAG 3420 

AAGAGACGGA TGTAGTGTTG AAAGTAGATG 3480 

AAGGGAAGGT GGTGGGGCAG ACCACCCCAG 3540 

AGAGCATAGA GTCCAGTGAG CTTGTAACCA 3600 

AATCACAGGA GATGGTGATG GAACAGGCTA 3660 

ACAGTGAGAC TGATGGAAGC ACCCCCGTAG 3720 

AAGACGAGAT TGTGGAAATC CATGAGGAGA 3780 

GCACAGAAGC A6AGGCAGTT CCTGCACAGA 3840 

TGTTCCAGGA AGAAACTAAA GAACAATCAA 3900 

AAGAGGTGTC AGTGGAAACT GTATCCATTC 3960 

ACCAGTATGC TGATGAGAAA ACCAAAGACG 4020 

TAGACACAGG CATAACAGTC AGTCGGGAAA 4080 

GGACAGAAGA AGCTGAATGT AAAAAGGATG 4140 

CTCCTCCATC CCCC6TGGAG AGAGAGATGG 4200 

CA6AGCCAAC CCATGTGAAT GAAGAGAAGC 4260 

AAGAGGTCAG TAAGCAGCTC CTCCAGACAG 4320 

AAGTCAGCAG TTTGGAAGGA AGCCCTCCTC 4380 

CCAAAATTCA AGTTCAGAGC TCTGAGGCAT 4440 

AAAAGGTCTT AGGAGAAACT GCCAACATTT 4500 

GTGCACATTT AGTTCTGGAA GAGAAATCCT 4560 

CAGGGGAAGA TGCTGTGCCC ACAGGGCCCG 4620 

TATCTGCTAC TACCAAGAAA GGCTTAAGTT 4680 

TGAAGTGGAA GTCAGATGAA GTCGATGAGC 4740 

TAGCAATTGA GGATTTAGAG CCTGAAAATG 4800 

AACTTGTCCA AAACATCATC CAGACAGCCG 4860 

CCACCGAAAT GTTGACX3TCT GAGTTACAGA 4920 

AGGACGCTGG ACAG6AAACG GAGAAAGAA6 4980 

AAACACCAAT TACTTCAGCC AAAGAGQAGT 5040 

CTGATATTTC CSkAAGACATG AGTGAAGCCT 5100 

GTTCCACTGT AAATGATCAG CAGCTGGAAG 5160 

GTGGAGCTG6 AACAAAGTCT GTGCCAGAAG 5220 

TAGAGAAGTC ACTAGTTGAA CCX3AAAGAA6 5280 

AAAACCAGAA CTCAGCCCTG GCTGATACTG 5340 



201 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



ATGCCTCAGG 
AGGAGGATGC 
CGATCACACC 
AACTTACAGA 
TGTGAAGACA 
AGAACTTTGA 
CAATCCTGAG 
CTACAATTTT 
GAACTGGAGT 
TTATATGTAT 
AGGAAATGTG 
CATAGTGCAG 
CCTAGATGTA 
CATATTTAGT 
AAAAGCATTT 
CAGTGGAGGT 
ACTTGTTATG 
GTTTTATATG 
GACTGCATTC 
TACTGGATTG 
TCATTTGGAT 
AAAAAAAA 



AGGCTTAACC 
CCAGGAAGTA 
CCAAGCACAG 
ATC TTAA AAC 
AGTAGTAGAA 
GAATTGGAGA 
GCTTCATCAG 
CCCTTGATAA 
TGGCAATACC 
GTTTTAAGTA 
CAGGATACTA 
GCTTGGGGAG 
ACATTCCTGA 
GGTACTCTGA 
TGAAACATAC 
TA6AAAGAAG 
6TTGTTTGGA 
TTATGAAGAA 
ACACATGGCA 
TGTTTGTGCC 
AAGTTGTGAT 



AAAGAGTCCC 
6AATTGCAGG 
GAGGAGTTAC 
ATCATGCAGT 
GAAAATGAAT 
GCAGGCACAT 
GAGCTAGAGC 
CCATATAAAT 
TAGTTCTGCT 
GTCCTCCTGT 
CATGCTTTTT 
CTTTAAGCCT 
TCAAGGTACA 
AATTGGTCAC 
AGAATGTTCT 
TTATATTCTG 
CCGATAAGTG 
AAQAATTGTT 
TGAAATAAGT 
ATATTTGT6C 
TTGACGACTG 



CAGATACAAA 
AAGGAAAAGT 
AGAAACAA6A 
TAAACTCATT 
GCTGCTGCTG 
CAACTGATCT 
CATTTAACAT 
TCTGATTTAA 
TCTGAAACTG 
ATCTATTGTA 
GTATCACACA 
CAGTTATATA 
ATTCTTTAAA 
TTTCCTATTA 
ATTGTCATTG 
GTAGCAAATT 
TGCTTAATCC 
GTAAGTTTTT 
CAGGTTCTTT 
CATTCCTTTA 
ATTTAAATAA 



TGGACCAAAA 
GCACA6TGAA 
GAGAGAATCT 
GTCTGTTTGG 
AGACTGAAGA 
CATTTCTAGA 
TTCCTCTTTC 
GGTCCTAAAT 
GAGTATCATT 
TATTTTTTTC 
GTATATGATG 
ACCCACAAAA 
ATTCACTAAT 
CACGGAGTGT 
GGAAATTTTG 
AACTTTACAT 
TGAGGCAAAG 
GATTCTACTC 
ACAAATGGTA 
AGAACAATGT 
AATATTTGCT 



CAAAAAGAGA 
TCAGATAAAG 
GCAAAGTCAG 
AAGACCAGAA 
CCAGTATTTC 
GAGCCCCTGA 
CAAGACCAAC 
TCTTAACCTG 
CTTTACATAT 
TTAATGTTTA 
GGGCATGTGC 
AACAGAGCCT 
GATTGAGGTC 
GCCAAAACTA 
CTTTCTAACC 
CCTTTTTCCT 
TAGTGAATAT 
TTATATGCTG 
TTTTGATAGA 
TGCAACACAT 
TCACTTAAAA 



5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 



25 



Seq ID NO: 44 Protein aecnience : 
Protein Accession #: MP 005091 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



I 

MGAGSSTEQR 

QLSTINGVAE 

DKEriATKSAV 

KFTVKKDKTE 

TLKREQSHAK 

QGWAGWRICKT 

AKSAHEPRLS 

STVEERTEEQ 

LSPDEKVLSK 

EESGEHTQVP 

KKRE6VTPWA 

EEPKPEEPKR 

ETGTDGIIiAG 

DSIAGSGVEH 

DVPAWPLSE 

AATIIBERSP 

GDTWSEAEL 

VQEVEGGVPD 

EASGLKKETD 

ETLAGVKSQE 

SGTQSGGTEA 

EGTQBADQYA 

LQSHAKSPPS 

IIDGAKEVSS 

ETLEPAGAHIi 

GEKTTSLKWK 

VRTEETATBM 

AVGQAHSDIS 

ALIiAERZEKS 

QEVELQEGKV 



11 

t 

SPBQPPEGSS 

QDELSLQEGD 

VHDITDDGQE 

KPDTVQLLTV 

ISPPAESGQA 

SFRKPKEDEV 

ABYEKVEXiPS 

KTEVEETAGS 

PPEGWSEVE 

ADSPDSQBEQ 

SPKKMVTPKK 

KVDTSVSWEA 

SQEHDPGQGS 

STPDTEPGKE 

YDAVEREKME 

SWISASVTEP 

TPEAVTAAET 

lEEQERRTQE 

WLKVDAQEA 

MVMEQAIPPD 

EAVPAQKERP 

DEKTKDVPFF 

PVEREMWQV 

LEGSPPPCLG 

VXjEEKSSBKN 

SDEVDEQVAC 

LTSBXiQTQAH 

KDMSEASEKT 

LVEPKEDBKG 

HSESDKAITP 



21 
I 

TPAEPEPSGG 

liNGQKGAIiNG 

KNRNIEQIPS 

KKDEGEGAAG 

VEECKEEGEE 

EASEKKKEQE 

EEQVSGSQGP 

VPAEBIjVGMD 

MLSSQERMKV 

KGBSSASSPE 

RVRRPSESDK 

LXCVGSSKKR 

SSPEQA6SPT 

ESWVSIKKFI 

AQQAQKGAEQ 

LEQVEAEAAL 

AGPLGSEEGT 

VliQAVAEKVK 

KTEPFTQGKV 

SVETPTDSET 

PAPSSPVFQE 

EGIiEGSIDTG 

EREKTEAEPT 

QEEAVCTKIQ 

EDFAAHPGBD 

QEVKVSVAIE 

VIKAOSQDAG 

MTVEVEGSTV 

DDVDDPENQN 

QAQEELQKQE 



31 

I 

GPSAEAAPDT 

QGAXiKSQBEE 

SESNLEBLTQ 

AGDHQDPSIiG 

KQEKEPSKSA 

PBKVDTEBDG 

SEEKPAPLAT 

AEPQEAEPAK 

QGSPLKKLFT 

EPEEXTCIiBK 

EDEIiDKVKSA 

ARRRSSSDEB 

EGE6VSTWES 

PGRRiOCRPDG 

PEQKAATEVS 

LTEBVLEREV 

EASAAEETTE 

EESQLPGTGG 

VGQTTPESFE 

DGSTPVADFD 

ETKBQSKMED 

ITVSREKVTE 

HVNEEKLEHE 

VQSSEASFTIi 

AVPTGPDCQA 

DIiEPENGILE 

QETEKEGEEP 

NDQQLEEWL 

SAIiADTDASG 

RESAKSBLTE 



41 
I 

TADPAIAASD 
EVIVTEVGQR 
PTESQANDIG 
AGEAASKESE 
BSPTSPVTSE 
KAEVASEKLT 
EVFDEKIEVH 
ELVKIiKETCV 
STGLKKLSGK 
GliAEVQQDGB 
TLSSTESTAS 
G6PKAN66DH 
PKRLVTPRKK 
KQEQAPVEDA 
KELSESQVHM 
lAEEBPPTVT 
MVSAVSQLTD 
PEDVLQPVQR 
KAPQVTESIE 
APGTTQKDEI 
TLEHTDKEVS 
VALKGEGTEE 
TAVTVSEEVS 
TAAAEEEKVL 
KSTPVIVSAT 
LETKSSKLVQ 
QASAQDETPI 
PSEEEGGGAG 
GZiTKESPOTN 
S 



51 

I 

PATKLIiQKNG 

DSEDVSERDS 

FKKVFKFVGF 

PKQSTEKPEE 

TGSTFKKFFT 

ASEQAHPQEP 

QEEWAEVHV 

SGEDPTQG7UD 

KQKGKRGGGD 

AEEGATSDGE 

EMQEEMKGSV 

QKADBA6KDK 

SKSKLBEKSB 

6PTGANBDDS 

MAAAVAOGTR 

EPLPENREAR 

SPDTTEBATP 

ABAERPEEQA 

SSELVTTCQA 

VEIHEENEVA 

VETVSILSKT 

AECKKDDAiE 

KQLLQTVNVP 

GETANILETG 

TKKGIiSSDLE 

NIIQTAVDQF 

TSAKEESEST 

TKSVPEDDGH 

GPKQKBKBDA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 45 Nucleotide aemieng^. 
Nucleic Acid Accession NM_001290 

Coding sequence: 110 .. 1231 (underlined sequences correspond to start and stop codons) 



1 

I 

GTGAGCGTGT 
TGGACX3TTAC 
ACCACATGAC 
CATGGTACAG 
GGATAGTGAC 
ATTAACCCTT 
CATCCCCCGT 
CAAACACTCG 
CATGGTCACC 



11 
I 

GTGCGTGCGT 
TGATACTCTG 
CCCTTCTATT 
CCAGAGTACC 
AACCTCTGGT 
TCATTTTGTT 
TACTTTAGCA 
AAAGAGTCAT 
CAGCACGGGA 



21 
I 

CTACTTTGTA 
TTTAGCTTGA 
CTTCTCCTTT 
GAATCTATGA 
GGGACGCCTT 
TGGAAGATGG 
CTGTGTTTGA 
ACCACAACTC 
A6CCCATGTT 



31 

I 

CTGGGAAGAA 
TTTTCGAAAA 
CGGCCCATTT 
GATGAACAAG 
TGCCACTGAA 
ACCAAAGCGA 
AGGAGGGGTG 
ATCCATCACQ 
TACCAAGGTA 



41 

I 

CACAGCCCAT 
GCAGGCAAGA 
TATAGGAGGC 
AGACTGCAGT 
TTTTTTGAAG 
TACACTATCG 
ACCGACCTGT 
GTGGACTGCG 
TGTACAGAAG 



51 
I 

GTGCTCTGCA 
TGTCCAGCAC 
ATACACCATA 
CTCGCACAGA 
AT6ACGCCAC 
GCAGGACCCT 
ATTACATTCT 
ACCAGTGTAC 
GCAGACTGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 



202 



NSOOCID: <WO 02079492A2^I„> 



wo 02/079492 



PCT/US02/04915 



CTTGGAGTTC ACCTTTGATG ATCTCATGAG AATCAAAACA TGGCACTTTA CCATTAGACA 600 

ATACCGAGAG TTAGTCCCGA GAAGCATCCT AGCCATGCAT GCACAAGATC CTCAGGTCCT 660 

GGATCAGCTG TCCAAAAACA TCACCAGGAT GGGGCTAACA AACTTCACCC TCAACTACCT 720 

CAGGTTGTGT GTAATATTGG AGCCAATGCA GGAACTGATG TCGAGACATA AAACTTACAA 780 

CCTCAGTCCC CGAGACTGCC TGAAGACCTG CTTGTTTCAG AAGTGGCAGA GGATGGTGGC 840 

TCCGCCAGCA GAACCCAC/IA GGCAACCAAC AACCAAACGG AGAAAAAGGA AAAATTCCAC 900 

CAGCAGCACT TCCAACAGCA GCGCTGGGAA CAATGCAAAC AGCACTGGCA GCAAGT^GAA 960 

GACCACAGCT GCAAACCTGA GTCTGTCCAG TCAGGTACCT GATGTGATGG TGGTAGGAGA 1020 

GCCAACTCTG ATGGGAGGTG AGTTTGGGGA CGAGGACGAA AGGCTAATCA CTAGATTAGA 1080 

AAACACGCAA TATGATGCGG CCAACGGCAT GGACGACGAG GAGGACTTCA ACAATTCACC 1140 

CGCGCTGGGG AACAACAGCC CGTGGAACAG TAAACCTCCC GCCACTCAAG AGACCAAATC 1200 

AGAAAACCCC CCACCCCAGG CTTCCCAATA AGATGATCGG CACCAGAATC CACTGTCAAT 1260 

AGGCCCGTGG GTGATCATTA CAATTGCAAA TCTTTACTTA CAGGAGAGGA AACAGAAGAG 132 0 

ATAAAAACTT TTCCATGCAA ATATCTATTT CTAT^VCCACA ATGATCTGAT TTTCTTTCTT 13 8 0 

CTTTCTTTTT TTCTAATTGA GAGGATTATT CCCAGTAAGC TTCCATGACC CTTTCTTGGA 1440 

GGCCTTCACA GGTAATACAG ATACTGGCAC TGATTGTAAT TAAAATGAGA GAAAACTCTA 1500 

GCGCATCTTC TGGCACGGTT TTAACAACGT GTTTGTGTTG AATTTCCTTT TTATGCATCA 1560 

AACGAAGGCC ATATTGTCCA TAAATGCTCA GTGCTCAGGA TCTCATTAAT ATGCCGAACC 1620 

TAACTACAGA TGACTTTTTA ATATTGTAAA ATATTTTCTG CTTTTT6ACT TGCATCT6AG 1680 

AGTTTCTTGT TTCAGTAAAA i^AAGAAAAGA CAAAAAAATC AGCTTTGGAA AGTAATTTAA 1740 

ATGTACCTTA TTTTTTTTTT CTTTATGTTT TCTTTCATTG GGCAACAGCT AAGAGG6CCC 1800 

AGCAAGGTAA TTTATGGTTG AGCTGATGTC AATTGGTTCT TGTCTTGA6T CGACTCA ATT 1860 

TA6CCCAAGT GCTGAAACAA GAAATGTCAT TTTTTTCATC AAAGACACCA GGGCAGATTT 1920 

TTAAGTAAAG AAAGACAATT GGACCCTTAA GAATTTATGC ATTTGTAAAG TTGCTGTTGA 1980 

TCCAAATATT TTCAAGCCAT GTAATCCATT GGTTTTGTGG GCAGTTTAAT AAACCTGAAC 2040 

CTTTGTGTGT TTTCTAATTG TACXrTGAGTT GACCATCCTT TCTTTTTATA GTATATTTCT 2100 

TGTATGATAT TTTGTAAAGC TCTCACCTGG TTCTTTTATG GGGACTTTTC GTTTTTGGGC 2160 

AACTCCAGTG TATTTATGTG AAACTTTATA AGAGAATTAA TTTTTCCATT TGCATATTAA 2220 

TATGTTCCTC CACACATGTA AAGGCACAGT GGCTCCGTGT GTTAAAAAAC AGCTGTATTT 2280 
TATGTATGCT TTACTGATAA GTGTGCCAAT AATAAACTGT GTTAATGACC 



Seq ID NO: 4 6 Protein sequence; 
Protein Accession #: NP__001281 



1 11 21 31 41 51 

I I I I I I 

MSSTPHDPFY SSPFGPFYRR HTPYMVQPEY RIYEMNKRLQ SRTEDSDNLW WDAFATBFPE 60 
DDATLTLSPC liEDGPKRYTI GRTLIPRYFS TVFEGGVTDL YYILKHSKES YHNSSITVDC 120 
DQCTMVTQHG KPMPTKVCTB GRIiILEFTFD DIiMRIKTWHP TIRQYRBLVP RSIIAMHAQD 180 
PQVIiDQLSKN ITRMGIiTNFT LNYLRIiCVIL EPMQEI.MSRH KTVNLSPRDC LiCTCLFQKWQ 240 
RMVAPPAEPT RQPTTKRRKR KNSTSSTSNS SAGNNANSTG SKKKTTAANI* SLSSQVPDVM 300 
WGEPTIjMGG EFGDEDERLI TRLENTQVDA ANGMDDBEDF NNSPALGHNS PWNSKPPATQ 360 
ETKSENPPPQ ASQ 

Seq ID NO: 47 Nucleotide sequence; 
Nucleic Acid Accession #: NM_004126 

Coding sequence: 108.. 329 {underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I 1 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCG6TTCTGG GGCGAAAATG CCTGCCCTTC 120 

ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 

AAGTGAAGTT GCAGAGACAA CAAGTGTCTA AATGTTCTGA AGAAATAAAG AACTA TATTG 240 

AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAQ AACCCCTTTA 300 

AA6AAAAAG6 CAGCT6TGTT ATTTC ATAAA TAACTTQGGA GAAACTTGCAT CCTAAGTGGA 360 

AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 

TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 

GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 

ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA AAGTTTTGTC TT 

Seq ID NO: 48 Protein aeouence: 
Protein Accession #: NP_004117 

1 11 21 31 41 51 

I I I 1 I I 

MPAliHIEDLP BKEKIiKMEVE QLRKEVKLQR QQVSKCSfiEI KNYIBBRSGB DPIiVKGIPED 60 
KNPFKEKGSC VIS 



Seq ID NO; 49 Nucleotide sequence; 
Nucleic Acid Accession #: XM_051896 

Coding sequence: 139. .2388 (underlined sequences correspond to start and stop codons) 
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20 



25 



30 



35 



40 



45 



50 



GTTTTAAAGA 
GAAACCAAAA 
CACAAGTTTA 
ATGCTTGATA 
AAGAGAACAA 
ATTTTGGATC 
ATGGATGAAA 
AAAGAAGTTC 
GTTTGCTCAT 
A6ACAACAGA 
AGTGAAGGAT 
GGTTTCCGAG 
CTGGATTGTG 
TATTCTCACC 
AATGTTAGCC 
TCTTTATGGA 
TTAATAGGAG 
AAAGTTAATA 
GTTTCAGAGC 
AAATATGGTA 
GTTAAGAAGT 
TCCATATTGT 
GAGGAAGAAT 
GATGATGAAT 
AGTGATAATC 
TTATTCAATA 
CTCAATACAT 
GATGATGAAC 
CTGGAT6TCA 
TATCCCTTGA 
GCAAGGCCAA 
AAAATGAACA 
AAGGAGTGCT 
ATCCACTTTG 
GAAACTGAGG 
TTTTCAACCT 
CACTTCAATA 
TATAGAAGAC 
TTTTTCAACA 
CAGTTTCTGA 
AGATAGTCGT 
ATGAGAATAA 
TTAGCTACAT 
TTTTAAACAT 
AAAATATTTA 
ACTAGATTTA 
TACATGAAAT 



11 
I 

CGCTAGAGTG 
TGTCATTTAT 
CGGTAGTGGT 
CTCCAGATCC 
GACATTTCAA 
CTAATCAGGA 
CTCTAGGGAC 
CTTTTATTTT 
GCCCAGACCT 
GAAAAGAACA 
TGCATTCTGC 
CCATG6TGGG 
CTACCTACGT 
CTGATTTTCC 
ACAATCCCCT 
AGAAGAAAAG 
AAACACTAAT 
CTGCACAATG 
TGATGTTTGC 
CTTTTATGGC 
ATGAAGAAAA 
TCAACAGAGT 
TAGAAAATAT 
CACACGAACC 
AAGCAAGTTG 
CCAGAGAAGG 
CTTATCCACT 
TGGATGCAGC 
AAAGTAAAAA 
TACTGAGACC 
GTGACTCTAG 
AGCTCCCCTT 
ATGTCTTTAA 
TTCTGGCCAA 
AAGAGAAAGA 
TCAATTTTCA 
CTCTGAACAA 
AGAATCCATC 
AGGAGTTTCT 
TGCTGAGGCA 
ACTGATCATG 
TACTATTATA 
TTTCAGTCAG 
TTCTCACCAA 
ACAGTTCAAT 
TTCATACCAT 
AAATACATCA 



CCAAAGAAGA 

AGATCCTTAC 

GTTACGTGCC 

CTATGTGGAA 

TAATGACATA 

AAATGTTTTG 

AGCAACATTT 

CAACCAAGTC 

ACGATTTAGT 

CATAAGGGAG 

ACGTGATGTG 

ATTCTCTGGT 

TGCTGGTCTT 

AGAGAAAGGG 

TTTACTTCTC 

CTCTGGACAA 

TCATAATAGA 

CCCTTTACCT 

AGATTGGGTT 

TCCCGACTTA 

CCCCTTGCAT 

TTTGGGCGTT 

TACCACAAAG 

CAAAGGCACT 

GATTCATCGT 

ACGTGCTGGG 

GTCTCCTTTG 

TGTAGCAGAT 

GATTCATGTA 

TCAGAGAGG6 

TCCTCCGTTC 

TCCAAAGATT 

ACCCAAGAAT 

CATCAACTTC 

AATCGCTGAC 

ATATCCAAAT 

CATTGATGTG 

TCGTTGCTCT 

AAGTAAACCC 

GTTTGCAATC 

AGAGACTGGC 

AGTTAGGTTG 

TATGAACTyC 

CTTTCTTATG 

CTCAATAAGA 

GAGACAACAC 

ATATAAAAAT 



31 

1 

CTTTGAAGTG 

CAGCACATTA 

ACCAAAGTGA 

CTTTTTATCT 

AACCCTGTGT 

GAGATTACGT 

ACTGTATCTT 

ACTGAAATGG 

ATGGCTCTGT 

AGCAT6AAGA 

CCTGTGGTAG 

GTGATGAAGG 

TCTGGCTCCA 

CCAGAGGAGA 

ACACCACAGA 

CCTGTCACCT 

ATGAATACTA 

CTTTTCACCT 

GAATTTAGTC 

TTTGGAAGCA 

TTCTTAATGG 

TCTGGTTCAC 

CATATTGTGA 

GAAAATGAAG 

ATGATAATGG 

AAGGTACACA 

AGTGACTTTG 

CCTGATGAAT 

GTGGACAGTG 

GTTGATCTCA 

AAGGAACTTC 

GATCCTTATG 

CCTGATATGG 

A6AAAGTACA 

TTTGATATTT 

CAAGCATTCA 

ATAAAAGAAG 

GTTTCCCTTA 

AAAGCATAGT 

CCATGACAAC 

TGATACTCAA 

ACAAATGATG 

CTGATACAAA 

TGTGTTCTTT 

CCTCGCATTA 

TATTTTTATT 



41 
I 

TGAAAACATT 
TAGTGGAGCA 
CAAAGGGGGC 
CTACAACCCC 
GGAATGAGAC 
TAATGGATGC 
CTATGAAGGT 
TTCTAGAAAT 
GTGATCAGGA 
AACTCTTGGG 
CCATATTGGG 
CATTATAC6A 
CCTGGTATAT 
TTAATGAAGA 
AAGTTAAAAG 
TTACTGATAT 
CTCTGAGCAG 
GTCTTCATGT 
CATACGAAAT 
AATTTTTTAT 
GTGTCTGGGG 
AAAGCAGAGG 
GTAATGATAG 
ATGCTGGAAG 
CCTTGGTGAG 
ACTTCATGCT 
CCACACAGGA 
TTGAGCGAAT 
GGCTCACATT 
TAATCTCCTT 
TACTTGCAGA 
TGTTTGATCG 
AGAAAGATTG 
6GGCTCCAGG 
TTGATGACCC 
AAAGACTACA 
CCATGGTTGA 
GTAATGTTGA 
TCATGTACTG 
TGGATTTAAA 
AGTTGCAGTT 
TTGATTATGT 
TGTAGGGATA 
TTAAAAATTT 
TGTATGAATG 
TATATATGCA 



51 
I 

TCCTGTAATT 

CCAGTATTCC 

CTTTGGTGAC 

TGACAGCAGG 

CTTTGAATTT 

CAATTATGTC 

GGGAGAAAAG 

GTCTCTTGAA 

GAAGACTTTC 

TCCAAAGAAT 

TTCAGGTGGG 

ATCAGGAATT 

GTCAACCTTG 

ACTAAT6AAA 

ATATGTTGAG 

CTTTGGGATG 

TTTGAAGGAA 

CAAACCTGAC 

TGGCATGGCT 

GGGAACAGTC 

CAGTGCCTTT 

CTGCACAATG 

CTCGGACAGT 

TGACTATCAA 

TGATTCAGCT 

GGGCTTGAAT 

CTCCTTTGAT 

ATATGAGCCT 

TAACCTGCCG 

TGACTTTTCT 

AAAGTGGGCT 

GGAAGGGCTG 

CCCAACCATC 

TGTTCCAAGG 

AGAATCACCA 

TGATCTTATG 

AAGCATTGAA 

GGCAAGAAGA 

GAAATGGCAG 

AGTACAGTAC 

ACTTAGCTGC 

AAGGATATAC 

TATACTGTAT 

TTTTTCTTTT 

TTATTCACTG 

TATATATACA 



60 
120 
IBO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



55 



60 



65 



70 



Seq ID NO: 50 Protein secruence: 
Protein Accession #: XP 051896 



1 
I 

MSFIDPYQHI 
RHFNNDINPV 
PFIFNQVTEM 
LHSARDVPW 
PDFPEKGPEE 
ETLIHNRMNT 
TFMAPDLFGS 
LENITTKHIV 
TREGRAGKVH 
KSKKIHWDS 
KLPFPKIDPY 
EEKEZADFDI 
QHPSRCSVSI) 



11 
I 

IVEHQYSHKF 
WNETPEFILD 
VLEMSLEVCS 
AILGSGGGFR 
INEELMKNVS 
TLSSLKEKVN 
KFFMGTWKK 
SNDSSDSDDE 
NFMIiGLNLNT 
GLTFNLPYPL 
VFDREGIiKEC 
FDDPBSPFST 
SNVEARRFFN 



21 
I 

TVWIiRATKV 
PNQENVLEIT 
CPDIiRFSMAI, 
AMVGPSGVMK 
HNPLLLLTPQ 
TAQCPLPLFT 
YEENPIiHFLM 
SHEPKGTENE 
SYPLSPLSDF 
ILRPQRGVDL 
YVFKPKNPDM 
FNFQYPNQAF 
KBFLSKPKA 



31 
I 

TKGAFGDMIjD 
liMDANYVMDE 
CDQEKTFRQQ 
ALYESGILDC 
KVKRYVESLW 
CIiHVKPDVSE 
GVWGSAFSIL 
DAGSDYQSDN 
ATQDSFDDDE 
IISFDFSARP 
EKDCPTIIHF 
KRXjHDIiMHFN 



41 

•I 

TPDPYVEliFI 
TLGTATPTVS 
RKBHIRESMK 
ATYVAGLSGS 
KKKSSGQPVT 
loMFADWVEFS 
FNRVLGVSGS 
QASWIHRMIM 
LDAAVADPDE 
SDSSPPFKEIi 
VliANINFRKY 
TIiNNZDVIKB 



51 
I 

STTPDSRKRT 
SMKVGEKKEV 
KLLGPKNSEG 
TWYMSTLYSH 
FTDIFGMIjIG 
PYEIGMAKYG 
QSRGSTMEEE 
AliVSDSALFN 
FERIYEPLDV 
LLAEKWAKMN 
KAPGVPRETE 
AMVESIEYRR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 51 Nucleotide aecnience; 
Nucleic Acid Accession #: NM_006S28 
75 Coding sequence: 57.. 764 (tinderlined sequences correspond to start and stop codons) 
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JNSDOCID: <WO 02079492A2J_> 
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10 



15 



20 



GCCGCCAGCG 
ACCCCGCTCG 
GCGATGCTGC 
ACGGACCCTG 
GCCAGTTCCT 
GCGACGATGC 
TGGACGACCA 
GTGAAAAATT 
AAGCTACTTG 
AAGATGAGGG 
CCTGTGATGC 
AGGATTGCAA 
GCTTTGCCAG 
ATCTTGTTTG 
GCATGAGGAA 
TTCAAAAATT 
TTTAATTTAT 
AAATATGACT 
AAACAACATA 
CC 



11 
I 

GCTTTCTCGG 
CCCCCTGGGG 
TCAGGAGCCA 
CCGGGCCCTA 
GTACGGGGGC 
TTGCTGGAGG 
GTGTGAGGGG 
CTTTTCCGGT 
TATGGGCTTC 
ACTGTGCTCT 
TTTCACCTAT 
ACGTGCATGT 
TAGAATCCGG 
TCTTTATGGC 
ACAAATCATT 
TGGATTTTTT 
GGTTCAACTG 
CACTCATTTC 
AGACAATATA 



21 
1 

ACGCCTTGCC 
CTGTCGATTC 
ACAG6AAATA 
CTTCTCCX5TT 
TGCGAGGGCA 
ATAGAAAAAG 
TCCACAGAAA 
GGGTGTCACC 
TGCGCACCAA 
GCCAATGTGA 
ACTGGCTGTG 
GCAAAAGCTT 
AAAATTCGGA 
TTATTTGCCT 
GGTGATTTAT 
TATATATAAC 
TTTGTGAGAC 
TTGGGGTCGT 
ATCATGTGCT 



31 
1 

CAGC6GGCCG 
TGCTGCTTTT 
ACGCGGAGAT 
ACTACTACGA 
ACGCCAACAA 
TTCCCAAAGT 
AGTATTTCTT 
GGAACCGGAT 
AGAAAATTCC 
CTCGCTATTA 
GAGGGAATGA 
TGAAAAAGAA 
AGAAGCAATT 
TTATGGTTGT 
TCACCAGTTT 
TAGCTGCTAT 
GAATTCTTGC 
ATTCCTGATT 
TTTAACATAT 



41 

1 

CCCGACCCCC 
CCTGACGGAG 
CTGTCTCCTG 
CAGGTACACG 
TTTCTACACC 
TTGCCGGCTG 
TAATCTAAGT 
TGAGAACAGG 
ATCATTTTGC 
TTTTAATCCA 
CAATAACTTT 
AAAGAAGATG 
TTAA ACATTC 
ATCTGAAGAA 
TTATTAATAC 
TCAAATGTGA 
AATGCATAAG 
TCAGAAGAGG 
TTGAGAATAA 



51 
I 

TGCACCATGG 
GCTGCACTGG 
CCCCTAGACT 
CAGAGCTGCC 
TGGGAGGCTT 
CAAGTGAGTG 
TCCATGACAT 
TTTCCAGATG 
TACAGTCCAA 
AGATACAGAA 
GTTAGCAGGG 
CCAAAGCTTC 
TTAATATGTC 
TAATATGACA 
AAGTCACTTT 
GTCTACCATT 
ATATAAAAGC 
ATCATAACTG 
AAAGGACTAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 52 Protein sequence t 
25 Protein Accession #: NP_006519 

1 11 21 31 41 51 

I I I I I I 

MDPARPLGIiS ILLLPLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 

30 CRQFIiYGGCE GNANNFYTWE ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNIiSSM 

TCEKFFSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 

RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 



60 
120 
180 



35 Seq ID NO: 53 Nucleotide sequence t 
Nucleic Acid Accession #: AA478778 
Coding sequence: no ORE fotind 



40 



45 



50 



55 



60 



TATTTTTGTA 
TGATCCTTCA 
ACTTTTTTAA 
GTATAAAAGA 
TCACAGATTG 
GGAAAAAAAT 
GAAATATAAA 
CACTTGGATG 
AGACTTAGAC 
GTGCCTTGGT 
TATTGGGAAA 
TTACCAGTAG 
CTGGAGAAAT 
GCTGGTGATG 
CTGCAGTAAT 
TTCCGATT6C 
CATGGTCTTT 
ACACTTGGTT 
CAGTATTCTC 
TGAATAA6TG 
CACCACACCC 



11 

1 

CGTAAAATGA 
TTATCACGGT 
AATGAATTTT 
TATTTTTGGC 
TACCAACTAT 
ATGCTGCCTT 
CACTTTTAAT 
AAATAAGACC 
TTTATCCTTA 
CTCTCCACAA 
GTGAGATCCT 
AAAGACACAG 
TCAGAACCAG 
TGACTTCTCT 
GGACGTTTGT 
TCATTAATTC 
CTGCCCCTCC 
TGAGAAACCC 
CAACTCCAAA 
TTATTCTCCA 
AAAAAAAAAA 



21 
I 

TTCTATTATG 
ACACTATTGT 
TTTAAAACAA 
ATTTCTAGGC 
TAACTATGTT 
GGTGCTAATA 
GAAAGGGAGG 
AGCTCTTTAC 
TTGTTGTTAG 
TCAAATGGAG 
CTCACCATTT 
GATGCACAGA 
GTTCTGAATC 
TCAGGCCATG 
GTGAAGAAAT 
ACTTTTTTGT 
AAGCTGATGA 
TGCCCACTTC 
CAAGCTCTAG 
TTATTAATGT 
AAAAAAAAAA 



31 
I 

ACTGCCTTTG 
TTACTTTTCA 
TCTAGCCATC 
AAGTATCAGC 
AAATAAGTAT 
TTGTATGTAT 
AACGGAAGGA 
CCTTATTTTT 
TGTTGTTAAT 
GATCCCCCAA 
TGCCAAGATA 
ATGGGCATGA 
ATCACGATTG 
AGCCTAACAY 
GAACTGTGGA 
TACTTCTTTC 
AGG6AAGCCT 
CAAAGACCAA 
AGTGCTCCAG 
GTTCTGAAAA 
AAAA 



41 
I 

CATGTAGTAA 
TCTGTAAATG 
ATCAAGGTGC 
CAATAAGTAT 
TCAGTTTCAT 
TTAAATGATC 
CAATTTCCAG 
GGATATGCCT 
ATTCGTTGCT 
GCAGCTTCAT 
CTCTAAAAT6 
CCTTCAGCTC 
CCTTTTGCAT 
CCTGCCGGTT 
GTACAAAATG 
CAAAATGGAA 
TTGCCAATGG 
AGAGATTAGG 
GAAAAGTTAT 
TATATTATQA 



51 

I 

TATGACAAAG 
TTTTATTGTT 
TATAAGAGTT 
GTTAGTGATA 
GTGATCTCTG 
ATCTGACTCA 
TGCACAGAAT 
TTTTTGGAAG 
TCAGCCCACG 
TACAGAGTGA 
ACATCCAAGT 
ACGAGCACAC 
GAAAACATCG 
TTCATGCCCG 
CTTTGA6TCT 
GTGCTGAAGC 
CCCATGGAAG 
AAAAGCCrrGG 
ATTCAGTATA 
ATAAATACAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



65 Seq ID NO: 54 Nucleotide secruence; 

Nucleic Acid Accession #: NM_020663 

Coding sequence: 1..645 (underlined sequences correspond to start and stop codons) 



70 



75 



ATGAACTGCA 
ATGTTGAAGT 
TACGCCAACG 
ACTGTGACTG 
GACTACAACC 



11 
1 

AAGAGGGAAC 
GTGTGGTGGT 
ACGCCTTGCC 
TGGGAGGCAA 
AGCTGAGGCC 



21 
I 

TGACAGCAGC 
GGGGGACGGT 
AGAGGAATAC 
GCAACACTTG 
ACTCTCCTAC 



31 
I 

TGCGGCTGCA 
GCCGTGGGGA 
GTGCCCACTG 
CTCGGACTGT 
CCCAACACGG 



41 
I 

GGGGCAACGA 
AAACCT6CCT 
TGTTTGACCA 
ATGACACC6C 
ATGTGTTTTT 



51 

1 

CGAGAAGAAG 60 

GCTGATGAGC 120 

CTATGCAGTT 180 

GGGACAGGAG 240 

GATCTGCTTC 300 



205 



wo «2/079492 
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TCTGTCGTAA ACCCTGCCTC TTACCACAAT GTCCAGGAGG AATGGGTCCC CGAGCTCAAG 360 

GACTGCATGC CTCACGTGCC TTATGTCCTC ATAGGGACCC AGATTGATCT CCX3TGATGAC 420 

CCAAAAACCT TGGCCCGTTT GCTGTATATG AAAGAGAAAC CTCTCACTTA CGAGCATGGT 4 80 

GTGAAGCTCG CAAAAGCGAT CGGAGCACAG TGCTACTTGG AATGTTCAGC TCTGACTCAG 540 

AAAGGTCTCA AAGCGGTTTT TGATGAAGCA ATCCTCACCA TTTTCCACCC CAAGAAAAAG 600 
AAGAAACGCT GTTCTGAGGG TCACAGCT6C TGTTCAATTA T CTGA 



Seq ID NO: 55 Protein secmencQi 
10 Protein Accession #: NP_065714 

1 11 21 31 41 51 

I I I I i I 

MNCKEGTDSS CGCRGNDEKK MIiKCWVGDG AVGKTCLLMS YANDAFPBEY VPTVFDHYAV 60 
15 TVTVGGKQHL LGIiYDTAGQB DYNQLRPLSY PNTDVFLICF SWNPASYHN VQEEWVPELK 120 
DCMPHVPYVL IGTQIDIiRDD PKTLARLLYM KEKPLTYEHG VKIAKAIGAQ CYLECSALTQ 180 
KGLKAVFDEA ILTIPHPKKK KKRCSEGHSC CSII 

20 Seq ID NO: 56 Nucleotide oecruence: 

Nucleic Acid Accession #: fgenesh prediction 

Coding sequence: 1-546 (underlined sequences correspond to start and stop codons) 

25 1 11 21 31 41 51 

I I I I I i 

aSGGCCTTGG GCAGCTCCGC CCCTGTGGCT TTGCAGGGTA ATGCCCACTT CCCTGCTGCT 60 

TTCATGGCTG GCATTAAGTG TCTGTGGCTT TTCCAGGTAG TCCCCCTGGG GCTCCCCGAG 120 

TTGGTGCAAA GGCTCCTGGG TGGAGCTCGA ACTGAAACTC GCTTTGTGCC CGCAGCCCTG 180 

30 CAGCTCGCCG GTGCCCTCGA CCTGCCCGCT GGGTCCTGTG CCTTTGAAGA GAGCACTTGC 24 0 

GGCTTTGACT CCGTGTTGGC CTCTCTGCCG TGGATTTTAA ATGAGGAAGG CCAGCAACCT 3 00 

TTCTGGTCCT CAGGAGACAT GTCTGACTGG GACTACTGGG TTGGCTGGCG GAAGTTAATT 360 

CATTCTCCTC TGAGCACTCC AGGGTGGAGC AGGCAGGTTA GGCTCCAGTT GTTCCAGCTT 420 

CAGTTTGTCA AAGGCCAGAA CTTGGACGTA ACAGTGTACT GCAGGCTCCA GGGCAGTGAG 480 

J!) AAACCCTTTG AAACTGGTTC CATGGTTCCA TTCACCTTCA TGTACTGGAT CCACCATGGA 540 
AAGTAG 



40 



45 



seq ID NO: 57 Protein seouenee.- 

Protein Accession #: fgenesh prediction 



11 



21 



31 41 51 

I I I I I I 

MAIiGSSAPVA LQGNAHFPAA FMAGIKCLWL FQWPLGLPE LVQRIiLGGAR TETRFVPAAL SO 

QIiAGALDLPA GSCAPEESTC GFDSVIASLP WIUNBEGQQP FWSSGDMSDW DVWVGWRKIiI 120 

HSPLSTPGWS RQVRLQUQL QFVKGQNLDV TVYCRLQQSB KPPETQSHVP FTFMyWIHHa X80 
K 



50 



55 



60 



65 



70 



75 



Seq ID NO: 58 Nucleotide sequence: 
Nucleic Acid Accession #: XM_050478 

Coding sequence: 27.. 4 508 (underlined sequences correspond to start 



1 11 

I I 
CCGGCGGCGC CTGAGCCCAG 
CTGTGCAGCT GCAAGGGGGG 
GTGAGCCGCT CACAGTGTCT 
TGAGGACTGG TGATGAGCTG 
AGGCCCTCAT TCTCATCAAA 
ACGCCCCTGT CAGTAGGCCG 
AAGCAGCCAC CACCATGCAT 



51 



21 31 41 

I I I I 

CCGAG GATGG AGAACCGGCC TGGGTCCTTC CAGTAC6TCC 
GCACCCTGGG GCTTCACCCT TAAGGGGGGT CTGGAACACT 
AAGATTGAAG ATGGAGGCAA GGCA6CTTTG TCCCAGAAGA 
GTGAATATCA ATGGCACTCC ATTATATGGC TCCC6CCAAG 
GGCTCCTTCC GGATTCTCAA GCTGATTGTC AGGA6GA6GA 
CACTCATGGC ATGTGGCCAA GCTGCTGGAG GGAT6CCCTG 
TTCCCTTCTG AAGCCTTCAG CTTGTCCTGG CATTCTGGCT 
GCAACACAAG TGACGTGTGT GTGCAGTGGT GTCCACTCTC CCGGCATTGC AGCACCGAGA 
AAAGCAGCTC CATTGGCAGC ATGGAGAGCC TGGAGCAACC AGGCCAAGCC ACCTATGAGA 
GCCATCTGTT GCCTATTGAC CAGAACATGT ACCCTAACCA GCGTGACTCA GCCTACAGCT 
CCTTCTCGGC CAGCTCAAAT GCTTCTGACT GTGCCCTTTC CCTCAGGCCA GAGGAGCCAG 
CCTCTACAGA CTGCATCATG CAAGGCCCAG GGCCAACTAA GGCCCCCAGT GGCCGGCCTA 
ATGTGGCTGA GACCTCAGGA GGTAGTCGGC GCACCAATGG GGGCCACCTG ACCCCCAGCT 
CTCAGATGTC ATCCCGTCCA CAGGAGGGAT ACCAGTCAGG GCCCGCCAAA GCAGTCAGGG 
GCCCACCACA ACCTCCAGTG AGGCGGGACA GCCTTCAGGC CTCCAGAGCC CAACTCCTCA 
ATGGAGAGCA GCGCAGGGCA TCTGAGCCTG TGGTCCCCTT GCCACAGAAG GAGAAACTGA 
GCTTAGAGCC TGTGCTACCC GCAAGGAACC CTAATAGGTT CTGTTGCCTC AGTGGGCATG 
ACCAAGTGAC AAGTGAGGGC CATCAGAACT GTGAGTTCAG TCAGCCTCCT GAATCCAGCC 
AACAGGGCTC TGAGCATCTA CTGATGCAGG CCTCAACCAA AGCT6TTGQA TCCCCAAAAG 
CCTGTGACAG AGCTTCCAGC GT6GATTCCA ACCCACTCAA TGAGGCTTCT 6CAGAGCTAG 
CTAAGGCTTC TTTTGGCAGA CCTCCACATC TCATAGGACC CACAGGGCAT CGCCATAGTG 
CCCCTGAACA GCTGCTGGCA TCCCACCTGC AGCATGTGCA CCTTGATACC AG666CAGCA 
AAGGGATGGA GCTCCCACCC GTACAGGATG GGCACCAGTG GACTCTGTCC CCTTTGCACA 



and stop codons) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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GCAGCCACAA AGGGAAGAAA AGTCCATGCC CCCCTACAGG AGGAACCCAT GACCAGTCCA 1440 

GCAAAGAAAG AAAGACCAGA CAAGTGGATG ACAGGTCTTT AGTTTTGGGA CACCAGAGCC 1500 

AAAGCAGTCC CCCACATGGA GAGGCTGATG GACACCCCTC AGAAAAAGGT TTCCTGGACC 1560 

CAAACAGAAC AAGCAGAGCA GCCAGTGAAT TGGCCAACCA GCAACCCTCT GCCTCTGGCT 1620 

5 CCCTTGTTCA ACAAGCCACG GACTGTTCTT CAACCACTAA AGCAGCTAGT GGCACAGAGG 1680 

CAGGTGAAGA AGGGGACAGC GAGCCCAAGG AGTGCAGCCG GATGGGTGGT AGGCGAAGTG 1740 

GAGGGACCCG GGGCCGCTCG ATCCAAAACC GGCGGAAGAG TGAGCGTTTT GCTACCAATC 1800 

TGCGTAATGA AATTCAGAGG AGGAAGGCCC AGCTCCAGT^ AAGCAAGGGT CCCTTGTCAC 1860 

AGCTGTGTGA CACTAAGGAG CCAGTGGAAG AGACCCAGGA GCCCCCAGAA AGTCCTCCAC 1920 

10 TCACTGCCTC TAACACATCT CTTCTATCTT CATGTAAAAA ACCTCCCAGC CCCAGAGACA 1980 

AGCTCTTCAA CAAAAGCATG ATGCTCAGAG CTAGGTCTTC CGAGTGCCTC AGCCAAGCCC 2040 

CTGAGAGCCA TGAATCTAGG ACAGGCTTAG AGGGACGAAT AAGCCCTGGC CAGAGGCCTG 2100 

GCCAGTCCTC TTTGGGCCTG AACACCTGGT GGAAAGCACC TGACCCATCC TCCTCAGACC 2160 

CTGAGAAAGC ACATGCTCAC TGTGGAGTCC GTGGAGGTCA TTGGAGATGG TCTCCAGAGC 2220 

15 ATAATTCACA GCCACTTGTG GCAGCAGCCA TGGAAGGCCC TTCCAACCCA GGTGACAACA 2280 

AGGAATTGAA GGCTTCTACT GCTCAAGCTG GGGAGGATGC CATCCTCTTG CCTTTTGCAG 2340 

ACAGAAGAAA GTTCTTTGAA GAGAGTAGCA AATCCTTATC TACATCTCAT TTGCCAGGTT 2400 

TAACCACTCA TAGCAACAAG ACTTTTACCC AGAGACCAAA ACCTATAGAC CAAAACTTCC 2460 

AGCCAATGAG CTCCAGCTGT AGGGAATTGA G6CGCCATCC CATGGACCAA TCATATCATT 2520 

20 CCGCAGACCA ACCATATCAT GCCACAGACC AATCATATCA TTCCATGTCA CCCCTTCAGT 2580 

CAGAAACTCC CACTTACTCA GAATGTTTTG CAAGCAAAGG TCTAGAAAAT TCCATGTGTT 2640 

GTAAGCCACT ACACTGTGGT GATTTTGATT ACCACAGGAC CTGCTCTTAC TCCTGCAGTG 2700 

TTCAAGGAGC TCTAGTCCAT GATCCTTGCA TTTATTGTTC TGGQGAAATC TGCCCTGCCT 2760 

TGCTAAAGAG AAATATGATG CCAAATTGCT ACAACTGCCG GTGCCACCAC CACCAATGCA 2820 

25 TTCGGTGTTC AGTTTGCTAT CATAATCCTC AGCACA6TGC CCTCGAGGAC AGCAGCTTGG 2880 

CACCTGGCAA CACTTGGAAA CCCAGGAAGC TGACAGTGCA GGAATTTCCT GGGGACAAAT 2 940 

GGAATCCAAT AACAGGAAAC AGGAAGACCA GCCAGTCAGG GAGGGAAATG GCTCATTCCA 3000 

AGACTAGCTT TTCATGGGCA ACCCCTTTCC ATCCTTGCCT TGAGAACCCA GCACTGGACT 3060 

TGTCAAGCTA CCGAGCAATT TCTTCTCTTG ACCTCCTTGG AGACTTCAAA CATGCTTTGA 3120 

30 AAAAATCAGA GGATU^CTTCA GTTTATGAGG AGGGGAGCTC CCTTGCCTCC ATGCCCCACC 3180 

CACTGCGCAG CCGTGCCTTC TCAGAGAGTC ACATCAGCTT GGCGCCCCAA AGCACCCGGG 3240 

CCTGGGGGCA GCATAGGAGG GAGCTCTTTA GCAAAGGTGA TGAGACCCAG TCGGATCTTC 3300 

TCGGAGCCAG GAAGAAGGCC TTTCCTCCTC CTCGCCCTCC TCCTCCCAAC TGGGAGAAGT 3360 

ACAGGCTCTT TCGTGCAGCC CAGCAGCAGA AGCAGCAACA GCAGCAGCAG AAGCAACAGG 3420 

35 AGGAGGAGGA GGAGGAGGAA GAAGAAGAAG AAGAGGAAGA GGAAGAGGAG GAGGAGGAGG 3480 

CAGAGGAGGA GGAAGAGGAG CTGCCACCCC AGTATTTCAG TTCAGAAACC TCTGGTTCCT 3540 

GTGCTCTCAA TCCTGAGGAG GTCCTAGAGC A6CCACAACC CCTCAGCTTT GGCCACCTGG 3600 

AGGGCTCGAG ACAGGGTTCA CAAAGTGTCC CAGCAGAGCA AGAATCCTTT GCACTCCATT 3660 

CCAGTGATTT CTTGCCTCCA AT7AGGGGTC ACTTGGGATC TCAACCTQAG CAGGCTCAGC 3720 

40 CCCCTTGCTA CTATGGCATT GGTGGGCTTT GGAG6ACATC GGGACAGGAA GCCAGTGAAT 3780 

CCGCCAAACA AGAGTTTCAG CACTTTTC6C CTCCTTCAGG GGCCCCAGGA ATCCCTACCT 3840 

CTTACTCAGC TTATTACAAT ATTTCTGTGG CCAAGGCAGA GCTGCTGAAC AAACTGAAAQ 3900 

ACCAACCTGA GATGGCAGAG ATTGGCCTA6 GAGAGGAGGA AGTTGACCAT GAACTGGCTC 3960 

AAAAAAAGAT ACAGCTTATC GAAAGCATCA GCAGAAAACT TTCTGTCTTG CGGGAGGCCC 4020 

45 AGCGAGGGCT GCTAGAGGAC ATCAATGCCA ATTCTGCCCT TGGGGAGGAG GTGGAGGCCA 4080 

ACTTAAAAGC CGTCTGCAAA TCCAATGAAT TTGAAAAGTA CCACTTGTTT GTTGGGGACC 4140 

TGGACAAAGT GGTCAACCTG TTGCTGTCAC TCTCTGGACG ACTGGCCCGG GTGGAGAATG 4200 

CTCTGAACAG CATCGATTCA GAGGCCAACC AGGAGAAGTT GGTACTGATA GAGAAGAAGC 4260 

AGCAGCTGAC GGGGCAGTTG GCAGATGCCA AGGAGCTGAA GGAGCACGTG GACCGCCGGG 4320 

50 AGAAGTTGGT GTTTGGCATG GTCTCCCGCT ACCTGCCTCA GGACCAGCTC CAAGATTACC 4380 

AGCACTTTGT CAAGATGAAA TCTGCTCTCA TCATTGAACA GCX3AGAGCTG GAGGAGAAGA 4440 

TCAAGCTCGG GGAAGAGCAA CTCAAATGTC TCAGGGAGAG TCTACTCCTG GGGCCCAGCA 4500 

ATTT CTAAT T CTACCAGCAC TCTGCCACAG CATCCCTGCC CAGCCATGTG GGAAGTGCTT 4560 

TCAATCTTCT TTGTTAGCAG TTTCTCAGCA AGTAGATAGC AATTAGCAGT TTGTTCCA6C 4620 

55 CCTCTACCCT GGATGTCTCT CACTACCCCT TCCCTAGCAG TGGTCCTAAC CAGCTAGGAG 4680 

ACCCTGGGGA AGCCACAA6C TTCTACCCAA GGGAGCTGCA GCAAGGT6T6 ATCTTAGAAC 4740 

CACACTCTCC TTCCCACAGT TGCCAAGGGC AAGTACTTGC TGCACAGAGA ACCAAG6AAG 4800 

TGCCTTCATT CTGCTTTGTA CTAGGACACC AAAGACATCA AGTACTCATC ACCCACCCAT 4860 

ATCATCAACA GCCTCTAAAG GCTCAGAGGG AATCTGCCTT GCAGCTCTAC TCTGCCCCAG 4920 

60 GGCTTGTGGC CAGCCATTTC TCACAGAGAG CTGGCTGCCT TGAG6GCATT CACCTGGCAC 4980 

CAGTTTCAGG GCCTCACCCA AGCTTTGCAG GGGAAAGCAC A6AG6GA6GA ATTACACTGA 5040 

AAAAAATGCA AGCAAAGGTT GAGTACCCCC AGGTGCCCCT TAGGAAGGAA CCAGGTTTAA 5100 

ATAGGCTCTA CCCTTACCTT TCCCAGCAGC AAGTTCAGGG GAAGAGGCCT ACTCTTAGCC 5160 

CTGGCTAGTG TGACCCTCTT CCTGTCCTAA GACTTTGGTC CTACCACCTC TTGTTTCATC 5220 

65 TTTCCTTTAC ATTGCTGGGG GTTACCGCAG GTGCCTACCC CAGGGCTTCA CCATATGGGC 5280 

CATTAATAGC TCTACTAAAA CTGACTTCTA GATGTAGGTT TCATTATTGG GGGAGGGGGT 534 0 

TCTTATTGTT ATATTTTAAA TGGCCTTTTG ATTTTATTTA TTTTTATGTT TTGATTATTT 5400 

TTTTCTTTTT TAACTAATAA GGCGAGAAGA GGGAAGTTGG AGAGGGAAAA GTTAGCCCAG 5460 

AAGGAAAGCA TTTTCTGCAG ATCAGCCTGA ATCCACCGTG GCTAGGCATA TTCTTGCTCT 5520 

70 TCTCGTGTTG CTCACAACTA CCTGCCTGGA TGAATTTAGG AAAGTTGCAG GATACAAGGT 5580 

TAAAACACAA GATCAAATGA ACAATCCGAA AATGTTATTA AGAAAACAGT TCCGGCCGGG 5640 

CATGGTGGCT CACGCCTGAA ATCCCAGCAC TTTGGGAGGC CGAGGCAGGT GGATCACGAG 5700 

GTCAGGAGAT CAAGACCATC CTGGCTAACA CGGTGAAACC CTATCTCTAC TAAAAATACA 5760 

AAAAATTAGC CAGGTGTGGT GGCACGCACC AGTAGTCCCA GCTACTOGGG AGGCTQAGGC 5820 

75 AGGAGAATTG CTTGAACCTG GAAGGCAGAG ATTGCAGTGA GCTGAGACCA CACCACTGCA 5880 

CTCCATCCTG GGCAACAGAG TGAGACTTTG TCTCAAAAAG AAAGAAAGAA AGAAAGAAAG 5940 



207 
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AAAGAAAGAA AGAAAAGAAA GAAAGAAAGA AAGAAAGAAA ACAGTTCCAT TTACAATAGC 
ATC 

Seq ID NO: 59 Protein aeouence; 
Protein Accession XP 050478 
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15 



20 
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30 



35 
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MENRPGSPQY 
INGTPLYGSR 
SEAFSLSWHS 
MYPNQRDSAY 
RRTNGGHLTP 
PVVPLPQKEK 
QASTKAVGSP 
LQHVHIiDTRG 
DDRSIjVLGHQ 
SSTTKAASGT 
AQLQKSKGPL 
RARSSECLSQ 
VRGGHWRWSP 
SKSIiSTSHLP 
DQSYHSMSPL 
CIYCSGEICP 
KLTVQEFPGD 
LDLIjGDFKHA 
FSKGDETQSD 
EEEEEEEEEE 
VPABQESFAL 
SPPSGAPGIP 
ISRKIiSVIiRE 
SLSGRUUIVB 
RYLPQDQLQD 



11 

I 

VPVQLQGGAP 
QEALILIKGS 
GCNTSDVCVQ 
SSFSASSNAS 
SSQMSSRPQE 
LSLEPVLPAR 
KACDRASSVD 
SKGMELPPVQ 
SQSSPPHGEA 
EAGEEGDSEP 
SQLCDTKEPV 
APESHESRTG 
EHNSQPLVAA 
GLTTHSNKTF 
QSETPTYSEC 
ALLKRNMMPN 
KWNPITGNRK 
LKKSEBTSVY 
LLGARKKAFP 
EAEEEEEELP 
HSSOFLPPIR 
TSYSAYYNIS 
AQRGIiLEDXN 
NAUJSIDSEA 
YQHFVKMKSA 



21 
I 

WGFTLKGGIiE 
PRILKLIVRR 
WCPLSRHCST 
DCALSLRPEE 
GYQSGPAKAV 
NPNRFCCIiSG 
SNPUIEASAE 
DGHQWTIiSPIi 
DGHPSEKGFL 
KECSRMGGRR 
EETQEPPESP 
LEGRISPGQR 
AMEGPSNPGD 
TQRPKPIDQN 
FASKGLENSM 
CYNCRCHHHQ 
TSQSGREMAH 
GEGSSLASMP 
PPRPPPPNWE 
PQYFSSETSG 
GHLGSQPEQA 
VAKAELWKL 
ANSAIiGEBVE 
NQBKLVLIBK 
LIIEQREIiEE 



31 
I 

HCEPIiTVSKI 
RNAPVSRPHS 
EKSSSI6SME 
PASTDCIMQG 
RGPPQPPVRR 
HDQVTSEGHQ 
LAKASF6RPP 
HSSHKGKKSP 
DPNRTSRAAS 
SGGTRGRSIQ 
PLTASNTSLL 
PGQSSLGIiNT 
NKELKASTAQ 
FQPMSSSCRE 
CCKPLHCGDF 
CIRCSVCYHN 
SKTSFSWATP 
HPLRSRAFSE 
KYRLFRAAQQ 
SCALNPEEVL 
QPPCYYGIGG 
KDQPEMAEIG 
ANLKAVCKSN 
KQQLTGQIiAD 
KIKLGBBQIiK 



41 

I 

EDGGKAALSQ 

WHVAKIiLEGC 

SLEQPGQATY 

PGPTKAPSGR 

DSLQASRAQXt 

NCEFSQPPES 

HLI6PT6HRH 

CPPTGGTHDQ 

ELANQQPSAS 

NRRKSBRFAT 

SSCKKPPSPR 

WWKAPDPSSS 

AGEDAILLPF 

LRRHPMDQSY 

DYHRTCSYSC 

PQHSAIiEDSS 

FHPCLENPAL 

SHISLAPQST 

QKQQQQQQKQ 

EQPQPLSPGH 

LWRTSGQEAT 

LGEEEVDHBL 

BFEKYHLFVG 

AKEIiKEHVDR 

CIiRESLLLGP 



51 
I 

KMRTGDELVN 

PBAATTMHFP 

ESHLLPIDQN 

PNVABTSGGS 

liNGBQRRASE 

SQQGSEHItLM 

SAPBQIiLASH 

SSKERKTRQV 

GSLVQQATDC 

NLRNEZQRRK 

DKLFNKSMML 

DPEKAHAHCG 

ADRRKPFEES 

HSADQPYHAT 

SVQGALVHDP 

IjAPGNTWKPR 

DLSSYRAISS 

RAWGQHRREL 

QEEESEEEEE 

liEGSRQGSQS 

ESAKQEFQHF 

AQKKIQIiIES 

DLDFCWNLLIi 

REKLVFGMVS 

SNP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 



Seq ID NO: 60 Nucleotide aeouence; 
Nucleic Acid Accession #: NM__014 705 

Coding sequence: 192.. 24 89 (underlined sequences correspond to start and stop codons) 



GGGAGAAGCT 

ATTATATGCA 

AATGCTTCCT 

GAGCGGGAAA 

GAGAG6TTGT 

TACGCTACAT 

CATATACCCT 

TCCTGACCTA 

TCCAGAACTT 

CAGAGCAGTA 

CCTCTTTGTA 

GATTTTATGG 

ATGACTACGA 

TCGCCATGCA 

TGCAGATATA 

TTCCGGACAA 

GACCATTTCA 

GAACGTCATT 

AGCGTGAAGT 

ATCAGCAGCT 

CCCTGACTAT 

ATCAAGAGGC 

TTGCACGATT 

TGCAT6AGAA 

TC!TTTGT6AT 

CTGTCCATTT 

GCCCAGATGG 

GATATTCTTC 

AATCAGAAAG 

GTTCTACTCA 

CTCCTTTGTT 

AGAGACCATG 

ATCATATTGG 

CTTCACCAGC 



11 

I 

AGGAAAAAAT 

TGTGTTTGTA 

GGATTAGAAT 

CATGGCGGGA 

TAGATTACAG 

TCS^CAAACTC 

CCTCTTATAT 

CCCCATGCAA 

TGACAGAGGC 

TGAGAGTTAT 

TGACAAAATT 

AAAAAAATTT 

GAGGCTGGAA 

GCACGCCAAC 

TGCTGTGACT 

CATCAAAAGC 

CAAAGGCACA 

ATACTTGGTG 

GGTAGAAATG 

GAAGACTCTG 

GTGCCTGAAT 

AITCTTTGTC 

AA6AGAGCTG 

GTTTGTACCT 

6AAGTCGAGC 

TCCTAATGGA 

TACCAGG6TA 

CTCCTCACTG 

CTCT6ATGAA 

CTCGGCTTCA 

GTCTGACAAA 

CAGTGCCATC 

AGACGGGGCC 

AA6ACACACG 



21 
I 

GTCTTTGAGC 
TTTTATGACT 
TCCACTATTT 
AAGTGGCGTT 
AACTTCTATA 
TATGATCTGC 
GACGAOCTAC 
ACAGAATGGC 
AAATGTTGGG 
TATGACTACA 
ATG6ACCAGC 
CCATTTTTCT 
GCCTTCCAAC 
CAGCCCGATG 
CCCATTCCAG 
TTCTATAAAG 
AAAGATAAAG 
CAGAGTTTGC 
A6TCCTCTGG 
ATTAGTCAGT 
GGAGTTATAG 
AAAGAATATA 
ATGCTTGAGC 
CAAGATATGA 
TTAGGGATAC 
AGCCCTCGTG 
ATTCCTAGAC 
TCCTCACAAG 
GTCTTTAACA 
CCTAATGTGA 
CACAAACATT 
TATCCAACAC 
TTGCCACGCA 
ACATCA6TAT 



31 
I 

TGTGAGATGC 

TGGATAATCT 

GGTCCCTATC 

TCATTAATTG 

AGACTGAACT 

ATCTCAAAGC 

T6GAATGGTC 

A6CGCAAAGA 

AGAAT6GCAT 

GAAACCTGAG 

AACGTCTTGA 

TAAGAAATAA 

AGAGAATGCT 

AGACCATCTT 

AGAGCCAGGA 

TGAATCACAT 

AGAATGAATT 

CTGGCATCTC 

AAAATGCAAT 

GTCAGACAAG 

ATGCTGCAGT 

TCTTAAGTCA 

AGGCACAGAT 

GACCCCTTCA 

AGGAGTTCTC 

TGTGTAGAAA 

GCAGCCCGTT 

CTTCTGCTOA 

T6CA6CCAAG 

CAAGTTCTGC 

CCCGAGAAAA 

CTGTGGAGCC 

GTGACCCAAA 

CCCCCTCGCC 



41 



I 

TTGTATATTT 

GAAAATCAAT 

CTAGTCTACT 

CTACTGTAAC 

GAACAAGGAG 

ACAGAACTTT 

TGATCGGCCC 

GCACCTGCAC 

TATCTTGTGC 

CAAGAT6CGG 

ACCAGAGTTC 

GGAGTTTGTG 

GAACGAGTTC 

CCA6GCAGAA 

GGTCCTGCAG 

CTGGAAATTC 

CAAGAGTCTC 

TCGCTGGTTT 

TGAAGTGCTA 

ACAGATGCAG 

TAATGGTGGC 

CCCTGAAGAT 

TCTGGAATTT 

CAAAAAGCTG 

TGCTTGTATG 

CTCAGCACCT 

AAGTTACCCA 

AGTAA6CAAT 

TCCATCTACC 

TCCATCGA6T 

CTCTTGCCTG 

TTCGCAGAGG 

TCTCTCTGCA 

TGCCG6GCGA 



51 

I 

TGAAAATATG 

TTGCTTTGTC 

AAAGAAAATT 

TCGTCTAATG 

GAGATGTATA 

ACAGAAGCTG 

CTCAGGGAGT 

CTCACCATCA 

CGGAAGATTG 

ATGATGGAAG 

TTCAGAGTT6 

TGTCGAGGGC 

CCCCATGCCA 

GCTCAGTATT 

AGAGAGGGTG 

CGCTATGACC 

TGGGTGGAGA 

GAAGTGGAAA 

GAAAATAAGA 

AATATTAATC 

GTTTCCAGGT 

GGGGAGAAAA 

GGTTTGGCCG 

GTTGACCAAT 

CAAGCCAGTC 

GCTTCTGTGA 

GCTGTCAACC 

ATTACAGGGC 

TCAAGCTTGA 

GCCAGAGCTT 

TCACCAAGAG 

ATGCTGTTTA 

CCTGAAAAAG 

TCTCCATTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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AGGGCTCTGT GCAGTCTTTC ACCCCCTCTC CAGTGGAGTA CCACTCGCCA GGACTCATCT 2100 

CCAACTCCCC TGTCTTGTCG GGCA6CTACA GCAGTGGGAT TTCTTCTCTC AGCCGGTGCA 2160 

GCACGTCGGA AACCTCAGGC TTTGAAAATC AGGTGAATGA ACAGTCGGCC CCCCTGCCGG 222 0 

TGCCAGTGCC GGTGCCCGTG CCGAGCTACG GCGGGGAGGA GCCAGTGCGC AAGGAGAGCA 2280 

5 AGACTCCGCC CCCGTACAGC GTCTACGAGC GGACTCTGCG GCGCCCCGTC CCGCTACCTC 2340 

ACAGCCTCTC CATCCCCGTC ACGTCGGAGC CGCCCGCGCT GCCCCCCAAG CCTCTGGCAG 2400 

CGCGATCCAG CCACCTGGAG AATGGGGCCC GGAGGACTGA CCCCGGCCCG CGGCCCAGGC 2460 

CCCTGCCCCG CAAGGTCTCT CAGTT ATAA G TCACTTTTCT ATGTACCTGC GATGCATTCT 2520 

TTGCCCGTTT ACAAAATAAG AAGTATGATG AGAAGACATT TAGTGTAGGC ACTTTAATAA 2580 

10 CTTACTCAGC TCCTTCGATG AATGGAATTA AAACTTGCTT ATTAAATATC ATGTTGCACA 2640 

ATATTAAAAG TTGCTGATCT AAAACGCCAG ATGTTAAATG AAGTATGGCT GAATTTCATT 2700 

AAAACGTTTC TCATTTGGAA GTGGTAAATA GTGATAAAGA CTCCTTTTGT ACCTTTTTAT 2760 

GTTCACTTTT TTTTATATAG TTTAATCTTA AAACCAATAC GATATTGTCA AACGATACAA 2820 

TGTGTGACAA TGTTGTATCG TTTTTACTGA ATACTTGATA CTTGGAGAAA GCTTATTAAG 2880 

15 TCAGTGCACA TCCTAACACA GTGGTCCTTA TTTTAGAAGA CTTCTGTAAA TAAGG CAA GG 2940 

TTTATCAGTG CAGATCATCA GAATTAAAGT TCAAGCAGGC GAGCAAGACA GTATACTTAA 3000 

GGGGTTGCAA AGCTTGGGAC TGGAAATTGT TTTGTTCTTG AAACAAAATA CTTCTTTAAG 3060 

GTTGCTTTTG CTGTTTGACT GCTGTCTACA TTC6TAAAAT TCTATTTTGT GAATTGGTAG 3120 

CTAAATCCCT TACTACCCTG ACACCGTG6T ATCTACTGTA TTTCTTTTCA AGGTGCAATT 3180 

20 TGCTTCAGAG TTCCAATCAG CTAGATTAAG CAAGAGGCTC CAGAAGAAAT GTTTACTTGA 3240 

ATTTTGCGCT TCCTTTCTTG ATAGTTTCCT ATATAAAATT TGTCATTGAA CAAGAGCAAA 3300 

TGCTGAAGTA TTAATGAGGC ACAAATGACT GTGCCCCATT AGCAAGAATT CAGGAATCAA 3360 

TACAGACAGT ATT7UUVTTAA TAGCTTAAGT GAAGAAAAAA AAAAACTTAG TGAAAATGTA 3420 

TTAGCACGAT TAAATGGCAA AAGGACTTAT AAAAGGCAAG GGCATTAACT TTCAGTCCTG 34 8 0 

25 CACAAAATM AAAATTCCTC ACGACTCTCC ACTTTTACCA GTGGAGTTTG TCTTAGCTGA 3540 

CCTGTCGTCr TTCTCTTGAA GGAGGATTGC TGTAGACTTC TCTAGCTTGA ATATTGCAAC 3600 

ATAGCATCTT AGGTCTAGAT AGGGATGCTA ATGCCAGTTG TAGAAGTGTG AAAAAAGCAC 3660 

CTTGTATGTA GTAATGTATT TTATATCTTT GTTTTTTCTT TTACTGACTG TTTATAACAC 3720 

TCAATTGACA ATAGATATGA ACTGTATTTT AAATCATACT GTTAAATATT TTCCCTCTTT 3780 

30 TGTTGGGAAG CTCATTTTAG TTTAACCATG TTTGTTTTGT TGGTAGCTTA CCTGGAAGGC 3840 

AGTGACCACT TTTTTATATT CTCTTAATGA AACCATTCAG CAGGTATATG CTGTTGAGGC 3900 

TGGTTATAGA GGTTTTCTAT AATAAATGTT CAAGTATTTT TGTATATAAC TGOJTAATTT 3960 

TAATAAGAGA TACCATTATG TGTAAAAAAA AGTAAAAATA AACGCAAACA GTTGTTGATG 4020 

CAGTATGATT GTTATAATTA TGCCAAATAC TTTACGTATG GAAAAAGAAT ATTTGTACAT 4080 

35 ATGTGCTTTT AACAATTCTG CCATATTGAC TTTACAATTT TGAATGTCGG AAAAATTAAT 4140 

ATATGTTAAA TATTTATGTT TAGTGAAAGT GTTCATAATT GAGAAAAGGA ACATA TGCAT 4200 

TTTAGCTTTG TATCTTGCAA GTTTTGCAGT CAGAAATTTT TTGAACTAGC TTTT6CTTTT 4260 

GATAACACTT CGTGTTTGTA ACCACATTCA TATATATATA CATATATATG TGAAGCTCCA 4320 

TATTTCTGTT GCTTTAAAGA AGTAAAACCT TCCATTTAAA TAAGATGACA TGCATAAGAT 43 80 

40 AACAAAGCTT CCTTGATTTC CTTTTCCTGT GTAATTTAAT AGATTTGTTG ACTAGTGCTT 4440 

GGGCACATTA TAAATCAGTG TTATTTGCTC TTGGAGCCAT TTTTTAAAAA AAATTTTGGC 4500 

AGTGAGCAGT TGAATTTATC TTGAATTTAT CATGTGTGTG TATTTCTGAA GCAGCTACAT 4560 

AGCAGAACAT TTTAAGAGAT TCTGTTAGCC CACATGTTCA TGTTGGTTGC TGCTGAATGG 4620 
TAAATATTAA ATAAAATTAC CAGATTAATC TT 



45 



Seq ID NO: 61 Protein sequence.: 
Protein Accession #: NP 055520 



1 11 21 31 41 51 

50 1 I I I I 1 

MAGKWRFINC YCNSSNGEW R3U2NFYKTEL NKBEMYIRYI HKLYDLHLKA QNPTEAAYTIi 60 

liLYDEIiLEWS DRPLREFLTY PMQTEWQRKB HLHLTIIQMP DRGKCWENGI HiCRKIAEQY 120 

ESYYDYRNLS KMRMMEASLY DKIMDQQRIiE PEFFRVGPYG KKFPFFIiRNK EFVCRGHDYB 180 

RLEAFQQRML NEFPHAIAMQ HANQPDBTIF QAEAQYIiQIY AVTPIPESQE VLQREGVPDN 240 

55 IKSFYKVNHI WKFRYDRPFH K6TKDKENEF KSLWVERTSI* YLVQSLPGIS RWFBVEKREV 300 

VBMSPLENAI EVLENKNQQL KTLISQCQTR QMQNINPLTM CliNGVIDAAV NGGVSRYQEA 360 

FFVKEYILSH PBDGEKIARIi RELMIiEQAQI LBFGLAVHEK FVPQDMRPM KKLVDQFFVM 420 

KSSIjGIQEFS ACMQASPVHP PNGSPRVCRN SAPASVSPDG TRVIPRRSPL SYPAVNRYSS 480 

SSLSSQASAE VSNITGQSES SDEVFNMQPS PSTSSLSSTH SASPNVTSSA PSSARASPIiL 540 

60 SDKHKHSREN SCLSPRBRPC SAIYPTPVEP SQRMLFNHIG DGALPRSDPN IjSAPEKASPA 600 

RHTTSVSPSP AGRSPLKGSV QSFTPSPVEY HSPGIiISNSP VIiSGSYSSGI SSLSRCSTSE 660 

TSGFENQVNE QSAPIjPVPVP VPVPSYGGEE PVRKESKTPP PYSVYERTLR RPVPLPHSLS 720 
IPVTSEPPAIj PPKPLAARSS HLENGARRTD PGPRPRPIiPR KVSQL 

65 Seq ID NO* 62 Nucleotide sequence; 

Nucleic Acid Accession #: fgenesh prediction 

Coding sequence: 1..2561 (underlined sequences correspond to start and stop codons) 



70 1 11 21 31 41 51 

I I I I I I 

ATGG ACCX3AG GCCAGGGTAA GAGGGGCOGC GACGCCC6CA CTTGTTGOGG CGCOGGGCGG 60 

GAAAGGGAGA CTGGACGATC TGJ^CCGGA GAGGAGGAGG GAGAGAGGCG GGCGGTGGGG 120 

CGGGGGCT6A GGAACGCTCG GAGGGGACTG GGAGACGCGG CGCTTATGCA AAGGTGCCTT 180 

75 CGGCTGCCGG GACAACCCGC CAGCAACCAG GTACAGCTCT C3U5AGGTTCC ACA6AGGAAG 240 

CTCAGGGTCC CTGAATCTCC CAGTGTGGCA GAGAAAGTGA AACTTGGTCA CCGATGCCTG 300 
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GAACTGCTGG AGCAGCTGCT CCCAGAGCTC ACCGGGCTGC TCAGCCTCCT GGACCACGAG 360 

TACCTCAGCG ATACCACCCT GGAAAAGAAG ATGGCCGTGG CCTCCATCCT GCAGAGCCTG 420 

CAGCCCCTTC CAGCAAAGGA GGTCTCCTAC CTGTATGTGA ACACAGCAGA CCTCCACTCG 480 

GGGCCCAGCT TCGTGGAATC CCTCTTTGAA GAATTTGACT GTGACCTGAG TGACCTTCGG 54 0 

GACATGCCAG AGGATGATGG GGAGCCCAGC AAAGGAGCCA GCCCTGAGCT AGCCAAGAGC 600 

CCACGCCTGA GAAACGCGGC CGACCTGCCT CCACCX3CTCC CCAACAAGCC TCCCCCTGAG 660 

GACTACTATG AAGAGGCCCT TCCTCTGGGA CCCGGCAAGT CGCCTGAGTA CATCAGCTCC 720 

CACAATGGCT GCAGCCCCTC ACACTCGATT GTGGATGGCT ACTATGAGGA CGCAGACAGC 780 

AGCTACCCTG CAACCAGGGT GAACGGCGAG CTTAAGAGCT CCTATAATGA CTCTGACGCA 840 

ATGAGCAGCT CCTATGAGTC CTACGATGAA GAGGAGGAGG AAGGGAAGAG CCCGCAGCCC 900 

CGACACCAGT. GGCCCTCAGA GGAGGCCTCC ATGCACCTGG TGAGGGAATG CAGGATATGT 960 

GCCTTCCTGC TGCG6AAAAA GCGTTTCXJGG CAGTGGGCCA AGCA6CTGAC 6GTCATCAGG 1020 

GAGGACCAGC TCCTGTGTTA CAAAAGCTCC AAGGATCGGC AGCCACATCT GAGGTTGGCA 1080 

CTGGATACCT GCAGCATCAT CTACGTGCCC AAG6ACAGCC GGCACAAGAG GCACGAGCTG 1140 

CGTTTCACCC AGGGGGCTAC CGAGGTCTTG GTGCTGGCAC TGCAGAGCCG AGAGCAGGCC 1200 

GAGGAGTGGC TGAAGGTCAT CCGAGAAGTG AGCAAGCCAG TTGGGGGAGC TGAGGGAGTG 1260 

GAGGTCCCCA GATCCCCAGT CCTCCTGTGC AAGTTGGACC TGGACAA6AG GCTGTCCCAA 1320 

GAGAAGCAGA CCTCAGATTC TGACAGCGTG GGTGTGGGTG ACAACTGTTC TACCCTTGGC 1380 

CGCCGGGAGA CCTGTGATCA CGGCAAAGGG AAGAAGAGCA GCCTGGCAGA ACTGAAGGGC 144 0 

TCAATGAGCA GGGCTGCGGG CCGCAAGATC ACCCGTATCA TTGGCTTCTC CAAGAAGAAG 1500 

ACACTGGCCG ATGACCTGCA GACGTCCTCC ACCGAGGAGG AGGTTCCCTG CTGTGGCTAC 1560 

CTGAACGTGC TGGTGAACCA GGGCTGGAAG GAACGCTGGT GCCGCCTGAA GTGCAACACT 1620 

CTGTATTTCC ACAAGGATCA CATGGACCTG CGAACCCATG TGAACGCCAT CGCCCTGCAA 1680 

GGCTGTGAGG TGGCCCCGGG CTTTGGGCCC CGACACCCAT TTGCCTTCAG GATCCTGCGC 1740 

AACCGGCAGG AGGTGGCCAT CTTGGAGGCA AGCTGTTCAG AGGACATGGG TCGCTGGCTC 1800 

GGGCTGCTGC TGGTGGAGAT GGGCTCCAGA GTCACTCCGG AGGCGCTGCA CTATGACTAC 1860 

GTGGATGTGG AGACCTTAAC CAGCATCGTC AGTGCTGGGC GCAACTCCTT CCTATATGCA 1920 

AGATCCTGCC AGAATCAGTG GCCTGAGCCC CGAGTCTATG ATGATGTTCC TTATGAAAAG 1980 

ATGCAGGACX3 AGGAGCCCGA 6CGCCCCACA GGGGCCCAGG TGAAGCGTCA CGCCTCCTCC 204 0 

TGCAGTGAQA AGTCCCATCG TGTGGACCCG CAGGTCAAAG TCAAACGCCA CGCCTCCAGT 2100 

GCCAATCAAT ACAAGTATGG CAAGAACCGA GCCGAGGAGG ATGCCCGGAG GTACTTGGTA 2160 

GAAAAAGAGA AGCTGGAGAA AGAGAAAGAG ACGATTCGGA CAGAGCTGAT AGCACTGAGA 2220 

CAGGAGAAGA GGGAACTGAA GGAAGCCATT CGGAGCAGCC CAGGA6CAAA ATTAAAGGCT 2280 

CTGGAAGAAG CCGTGGCCAC CCTGGAAGCT CAGTGTCGGG CAAA6GAGGA GCGCCGGATT 234 0 

GACCTGGAGC TGAAGCTGGT GGCTGTGAAG GAGCGCTTGC AGCAGTCCCT GGCAGGAGGG 2400 

CCAGCCCTGG GGCTCTCCGT GAGCAGCAAG CCCAAGAGTG GGCAACTCTC TGAGGAA6AT 2460 

ACGCTCACCT CCAAT6GTGC TCTCTCAGAQ AGAACTTCTC TGACCTCATC TACACCAGGG 2520 
CTTCTCAACC CCAACACTAC TGACATTTTG GACCAGTAA 



Seq ID NO: 63 Protein aemienee; 

Protein Accession #: fgenesh prediction 



1 11 21 31 41 51 

I I I I I I 

MDRGQGKRGR DARTCCGAGR ERETGRSEAG EEEGERRAVG RGLRNARRGL GDAALMQRCL 60 

RLPGQPASNQ VQLSEVPQRK LRVPESPSVA BKVKLGHRCL ELLEQLIaPEL TGLLSLLDHE 12 0 

YLSDTTLEKK MAVASIIjQSL QPLPAKBVSY liWNTADLHS GPSFVESLFE EFDCDLSDLR 180 

DMPEDDGEPS K6ASPEIAKS PRLRNAADLP PPLPNKPPPB DYYEEALPLG PGKSPEYISS 24 0 

HNGCSPSHSI VDGYYBDADS SYPATRVNGE LKSSYNDSDA MSSSYESYDE EBEEGKSPQP 300 

RHQWPSEEAS MHLVRECRIC APLLRKKRPG QWAKQLTVIR BDQLLCYKSS KDRQPHLRLA 360 

LDTCSIIYVP KDSRHKRHEIi RFTQGATEVL VLAIiQSREQA BEWUCVIREV SKPVGGAEGV 420 

EVPRSPVLLC KLDLDKRLSQ EKQTSDSDSV 6VGDNCSTLG RRETCDHGRQ KKSSLAELKG 480 

SMSRAAGRKI TRIIGFSKKK TLADDLQTSS TEEEVPCCGY LNVLVNQGWK BRWCRLKCNT 540 

LYFHKDHMDL RTHVNAIALQ GCEVAPGFGP RHPPAFRILR NRQEVAILEA SCSEDMGRWL 600 

GLLLVEMGSR VTPEAIiHYDY VDVETLTSIV SAGRNSFLYA RSCQNQWPBP RVYDDVPYEK 660 

MQDEEPERPT GAQVKRHASS CSEKSHRVDP QVKVKRHASS ANQYKYGKNR AEEDARRYLV 720 

EKEKLEKEKE TIRTELIALR QEKRELKEAl RSSPGAKLKA LEEAVATLEA QCRAKEERRI 780 

DLELKIiVAVK ERLQQSIiAGG PALGLSVSSK PKSGQIjSBED TLTSNGALSE RTSIjTSSTPG 840 
LLNPNTTDIL DQ 



Seq ID NO: 64 Nucleotide aecruence : 
Nucleic Acid Accession #: NM_004126.1 

Coding sequence: 108-129 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

> I I I I I 

GGCAOGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCX5 CCGAGCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGCX5AA AATG CCTGCCCTTC 120 

ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA A6TTGAGCA6 CTTCGCAAAG 180 

AAGTGAAGTT GCAGAGACAA CAAGTGTCTA AATGTTCTOA AGAAATAAAG AACTATATTG 240 

AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 

AAGAAAAAGG C AGCTG TGTT ATTTCATA^ TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 

AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAA6 6AA6GAAGAA 420 



210 

.02079492A2_I_> 



wo 02/079492 



PCT/l]S02/()4915 



TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 4 80 

GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 

ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA AAGTTTTGTC TT 

Seq ID NO: 65 Protein secuencej. 
Protein Accession #: NP 004117 



10 



15 



20 



25 



30 



35. 



40 



1 11 21 31 41 SI 

I I I I I I 

MPAItHXEDIiP EKEKI/KMBVB QLRKEVKIiQR QQVSKCSEEI KNYIEERSGE DPIiVKGIPBD 

KNPFKEKGSC VIS 



60 



Seq ID NO: 66 Nucleotide aeouence: 
Nucleic Acid Accession #: NM_003842.I 

Coding sequence: 1-1236 (underlined sequences correspond to start and stop codons) 



ATGGAACAAC 
GGACCCAGGG 
GTTGTCGCCG 
CTAGCTCCCC 
TGTCCACCTG 
CAGGACTATA 
TCAGGTGAAG 
GAAGGCACCT 
CCCAGAGGGA 
AAAGAATCAG 
TTTGTTTGCA 
GGTGGTGGTG 
AATGTCCTCA 
GAAGTCCAGG 
CATCTGCTGG 
AATGAAGGTG 
CCCTTTGACT 
GTGGCTAAAG 
GTCAACAAAA 
GGAGAGAGAC 
TATCTAGAAG 



11 
I 

GGGGACAGAA 
AGGCGCGGGG 
CGGTCCTGCT 
AGCAGAGAGC 
GACACCATAT 
GCACTCACTG 
TGGAGCTAAG 
TCCGGGAAGA 
TGGTCAAGGT 
GCATCATCAT 
AGTCTTTACT 
GGGACCCTGA 
ATGAGATCGT 
AGCCAGCAGA 
AACCGGCAGA 
ATCCCACTGA 
CCTGGGAGCC 
CTGAGGCAGC 
CCGGGCGAGA 
TTGCCAAGCA 
GTAATGCAGA 



21 
I 

CGCCCCGGCC 
AGCCAGGCCT 
GTTGGTCTCA 
GGCCCCACAA 
CTCAGAAGAC 
GAATGACCTC 
TCCCTGCACC 
AGATTCTCCT 
CGGTGATTGT 
AGGAGTCACA 
GTGGAAGAAA 
GCGTGTGGAC 
GAGTATCTTG 
GCCAACAGGT 
AGCTGAAAGG 
GACTCTGAGA 
GCTCATGAGG 
GGGCCACAGG 
TGCCTCTGTC 
GAAGATTGAG 
CTCTGCCATG 



31 

1 

GCTTCGGGGG 
GGGCCCCGGG 
GCTGAGTCTG 
CAAAAGAGGT 
GGTAGAGATT 
CTTTTCTGCT 
ACGACCAGAA 
GAGATGTGCC 
ACACCCTGGA 
GTTGCAGCCG 
GTCCTTCCTT 
AGAAGCTCAC 
CAGCCCACCC 
GTCAACATGT 
TCTCAGAGQA 
CAGTGCTTCG 
AAGTTGGGCC 
GACACCTTGT 
CACACCCTGC 
GACCACTTGT 
TCCTAA 



41 

t 

CCCGGAAAAG 
TCCCCAAGAC 
CTCTGATCAC 
CCAGCCCCTC 
GCATCTCCTG 
TGCGCTGCAC 
ACACAGTGTG 
GGAAGTGCCG 
GTGACATCGA 
TAGTCTTGAT 
AGCTGAAAGG 
AACGACCTGG 
AGGTCCCTGA 
TGTCCCCCGG 
GGAGGCTGCT 
ATGACTTTGC 
TCATGGACAA 
ACACGATGCT 
TGGATGCCTT 
T6AGCTCTGG 



51 
I 

GCACGGCCCA 
CCTTGTGCTC 
CCAACAAGAC 
AGAGGGATTG 
CAAATATGGA 
CAGGTGTGAT 
TCAGTGCGAA 
CACAGGGTGT 
ATGTGTCCAC 
TGTGGCTGTG 
CATCTGCTCA 
GGCTGAGGAC 
GCAGGAAAT6 
GGAGTCAGAG 
GGTTCCAGCA 
AGACTTGGTG 
TGAGATAAAG 
GATAAAGTGG 
GGAGACGCTG 
AAAGTTCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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Seq ID NO: 67 Protein sequence; 
Protein Accession #: NP 003833.1 



50 



55 



MEQRGQNAPA 
liAPQQRAAPQ 
SGEVEIiSPCT 
KESGIIIGVT 
NVIiNEIVSIIi 
NEGDPTETLR 
VNKTGRDASV 



11 
I 

ASGARKRHGP 
QKRSSPSEGIi 
TTRNTVCQCE 
VAAWIiIVAV 
QPTQVPEQm 
QCPDDFADIiV 
HTLLDAIiETL 



21 
I 

GPREARG2^RP 
CPPGHHISED 
EGTFREEDSP 
FVCKSLLWKK 
EVQEPAEPT6 
PFDSWEPLMR 
GERXiAKQKIE 



31 

I 

GPRVPKTLVIi 
GRDCISCKYG 
EMCRKCRTGC 
VIiPYIjKGICS 
VNMLSPGBSB 
KLGLMDNEIK 
DHIiLSSGKFM 



41 
I 

WAAVLUjVS 
QDYSTHWNDL 
PR6MVKVGDC 
GGGGDPERVD 
HIiIiEPAEASR 
VAKAEAAGHR 
YIiEGNADSAM 



51 

I 

AESALITQQD 
LFCIACTRCD 
TPWSDIECVH 
RSSQRPGABD 
SQRRRLLVPA 
DTIiYTMliIKW 
S 



60 
120 
180 
240 
300 
360 
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Seq ID NO: 68 Nucleotid e sequence: 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 361- 2220 (underlined sequences correspond to start and stop codons) 



65 



70 



75 



GQCACCATCT 
TGTCGCTGCC 
GGTTACACTG 
GCTGAGACGT 
TGCGAACACG 
GGTCTCAGCT 
ATGAACGGGG 
CCGCAGGACA 
TGCCAGGCTA 
AGTCTTTGTC 
GCCATCGCCT 
AACTGCTCTG 
TGTGCCCATG 



11 
I 

GCTCCCTGCC 
ACAACGGCGG 
GGGATCGGTG 
GCGACTGCGC 
GCTTCACTGG 
GCCAGGCCCC 
AGTGCTCCTG 
C6CATG66CC 
CCA6CGGCCT 
CTCCTGACAC 
GCTCACCCAT 
T6CCCTGCCC 
AGGCASTCTG 



21 
I 

CTGCCCAGAG 
CCTCTGTGAC 
CCGGGAGGAG 
CCCGGACGCC 
GGACCGCTGC 
CTGCACCTGC 
CCTGCCGGGC 
AGGGTGCCAG 
CTGTCAGTGC 
CTACGGTGTC 
0GA06GCGAG 
ACCCGGAACC 
CAGCCCCCAA 



31 

I 

GGCTTTCACG 
CGATTCACTG 
TGCCCGGTGG 
CGTTGCTTCC 
ACGGATCGCC 
GACCGGGAGC 
TGGGCGGGCC 
GAGCACTGTC 
GCGCCGGGTT 
AACTGTTCTG 
TGCGTCTGCA 
TGGGGCTTCA 
ACTGGAGCCT 



41 

i 

GACCCAACTG 
GGCAGTGCCG 
GCCGCTTTGG 
CGGCCAACGG 
TCTGCCCCGA 
ACAGCCTCAG 
TCCACTGCAA 
TCTGCCTGCA 
ACACGGGCCC 
CACGCTGCTC 
AGGAAGGTTG 
GTTGCAATGC 
OTACCTGCAC 



51 
I 

CTCCCAGGAA 
CTGCGCTCCG 
GCAGGACTGT 
CGCATGTCTG 
CGGCTTCTAC 
CTGCCACCCG 
CGAGAGCTGC 
CGGTGGCGTC 
TCACTGTGCT 
ATGTGAAAAT 
GCAGCGTGGT 
CAGCTGCCAG 
CCCTGGGTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 

NSDOCID: <WO 



wo 02/079492 PCT/U 502/04915 



CATGGGGCCC ACTGCCAGCT GCCCTGTCCG AAGGGGCAGT TTGGAGAAGG TTGTGCCAGT 840 

CGCTGTGACT GTGACCACTC TGATGGCTGT GACCCTGTTC ATGGACGCTG TCAGTGCCAG 900 

GCTGGCTGGA TGGGTGCCCG CTGCCACCTG TCCTGCCCTG AGGGCTTATG GGGAGTCAAC 960 

TGTAGCAACA CCTGCACCTG CAAGAATGGG GGCACCTGTC TCCCTGAGAA TGGCAACTGC 1020 

GTGTGTGCAC CCX3GATTCCG GGGCCCCTCC TGCCAGAGAT CCTGTCAGCC TGGCCGCTAT 1080 

GGCAAACGCT GTGTGCCCTG CAAGTGCGCT AACCACTCCT TCTGCCACCC CTCGAACGGG 1140 

ACCTGCTACT GCCTGGCTGG CTGGACAGGC CCCGACTGCT CCCAGCGCTG CCCTCTGGGG 1200 

ACATTTGGTG CTAACTGCTC CCAGCCATGC CAGTGTGGTC CTGGAGAAAA GTGCCACCCA 1260 

GAGACTGGGG CCTGTGTATG TCCCCCAGGG CACAGTGGTG CACCTTGCAG GATTGGAATC 1320 

CAGGAGCCCT TTACTGTGAT GCCX3ACCACT CCAGTAGCGT ATAACTCGCT GGGTGCAGTG 1380 

ATTGGCATTG CAGTGCTGGG GTCCCTTGTG GTAGCCCTGG TGGCACTGTT CATTGGCTAT 144 0 

CGGCACTGGC AAAAAGGCAA GGAGCACCAC CACCTGGCTG TGGCTTACAG CAGCGGGCGC 1500 

CTGGACGGCT CCGAGTATGT CATGCCAGAT GTCCCTCCGA GCTACAGTCA CTACTACTCC 1560 

AACCCCAGCT A CCAC ACCCT GTCGCAGTGC TCCCCAAACC CCCCACCCCC TAACAAGGTT 1620 

CCAGGCCCGC TCTTTGCCAG CCTGCAGAAC CCTGAGCGGC CAGGTGGGGC CCAAGGGCAT 1680 

GATAACCACA CCACCCTGCC TGCTGACTGG AAGCACCX5CC 6G6AGCCCCC TCCAGGGCCT 1740 

CTGGACAGGG GGAGCAGCCG CCTGGACCGA AGCTACAGCT ATAGCTACAG CAATGGCCCA 1800 

GGCCCATTCT ACAATAAAGG GCTCATCTCT GAAGAGGAGC TCGGGGCCAG TGTGGCTTCC 1860 

CTGAGCAGTG AGAACCCATA TGCCACCATC CGGGACCTGC CCAGCTTGCC A6GGGGCCCC 1920 

CGGGAGAGCA GCTACATGGA GATGAAAGGC CCTCCCTCAG GATCTCCCCC CAGGCAGCCT 1980 

CCTCAGTTCT GGGACAGCCA GAGGCGGCGG CAACCCCAGC CACAGAGAGA CAGTG6CACC 2040 

TACGAGCAGC CCAGCCCCCT GATCCATGAC CGAGACTCTG TGGGCTCCCA GCCCCCTCTG 2100 

CCTCCGGGCC TACCCCCCGG CCACTATGAC TCACCCAAGA ACAGCCACAT CCCTGGACAT 2160 
TATGACTTGC CTCCAGTACG GCATCCCCCA TCACCTCCAC TTCGACGCCA GGACCG TTGA 



Seq ID NO: 69 Protein secmence: 

Protein Accession # : FGENESH prediction 

} 11 21 31 41 51 

i I I I I I 

GTICSLPCPE GFHGPNCSQE CRCHNGGLCD RFTGQCRCAP GYTGDRCREE CPVGRFGQDC 60 

AETCDCAPDA RCFPANGACL CEHGFTGDRC TDRIiCPDGFY GLSCQAPCTC DREHSLSCHP 120 

MNGECSCIjPG WAGIiHCNESC PQDTHGPGCQ EHCIiCIiHGGV CQATSGLCQC APGYTGPHCA 180 

SLCPPDTYGV NCSARCSCEN AIACSPIDGE CVCKEGWQRG NCSVPCPPGT WGFSCNASCQ 240 

CAHBAVCSPQ TGACTCTPGW HGAHCQLPCP K6QPGEGCAS RCDCDHSDGC DPVHGRCQCQ 300 

AGWMGARCHL SCPEGLWGVN CSNTCTCKNG GTCLPENGNC VCAPGFRGPS CQRSCQPGRY 360 

GKRCVPCKCA NHSFCHPSNG TCYCIiAGWTG PDCSQRCPLG TPGANCSQPC QCGPGEKCHP 420 

ETGACVCPPG HSGAPCRIGI QEPFTVMPTT PVAYNSLGAV IGIAVIiGSLV VALVALFIGY 480 

RHWQKGKEHH HLAVAYSSGR LDGSEYVMPD VPPSYSHYYS NPSYHTLSQC SPNPPPPNKV 540 

PGPIiFASLQN PERPGGAQGH DNHTTIiPADW KHRREPPPGP LDRGSSRLDR SYSYSYSNGP 600 

GPFYNKGLIS EEELGASVAS LSSENPYATI RDLPSLPGGP RESSYMEMKG PPSGSPPRQP 660 

PQFm>SQRRR QPQPQRDSGT YEQPSPliIHD RDSVGSQPPIi PPGIiPPGHYD SPKNSHIPGH 720 
YDIiPPVRHPP SPPLRRQDR 



Seq ID NO: 70 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005458 

Coding sequence: 1..2826 (underlined sequences correspond to start and stop codons) 



ATGG CTTCCC 
GCGCGCCTGC 
GGCTGGGCGC 
CTCATGCCGC 
GTGGAACTGG 
CTGCGGCTCT 
ATAAAATACG 
ATCATTGCAG 
CCTGTTCTAG 
GCGGTGAATC 
CTGACGCAAG 
GGCGAGGACA 
AAAAAGCTGA 
GCAAAAGT6T 
ATTCCGGGCT 
CGCTGCCTCC 
CCCCTGAGCT 
GAGTACAACA 
GGCATCTGGG 
CGGCACCAGC 
AATGCCATGA 
GAGAGAATGG 
GAGTACAACG 
TCCGAACCAC 



11 
I 

CGCGGAGGTC 
TACTGCTACT 
GGGGCGCCCC 
TCACCAAGGA 
CCATCGAGCA 
ATGACACGGA 
GGCCGAACCA 
AGTCCCTCCA 
CCGATAAGAA 
CAGCCATTCT 
ACGTTCAGAG 
TTGAGATTTC 
AGGGGAATGA 
TCTGTTGTGC 
GGTACGAGCC 
GGAAGAATCT 
CCAAGCAGAT 
ACAAGCGGTC 
TCATCGCCAA 
GGATCCAGGA 
ACGAGACCAA 
GGACCATTAA 
CTGTGGCCGA 
CAAAAGACAA 



21 
I 

CGGGCAGCCA 
GCTGCT6CCG 
CCGGCCGCCG 
GGTGGCCAAG 
GATCCGCAAC 
GTGCGACAAC 
CTTGATGGTG 
AGGCTGGAAT 
AAAATACCCT 
GAAGTTGCTC 
GTTCTCTGAG 
AGACACCGAG 
TGTGCGGATC 
ATACGAGGAG 
TTCTTGGTGG 
GCTTGCTGCC 
CAAGACCATC 
AGGC6TGGGG 
GACACTGCAG 
CTTCAACTAC 
CTTCTTCGGG 
ATTTACTCAA 
CACACTGGAG 
GACCATCATC 



31 
I 

GGGCXSGCCX5C 
CTGCTGCTGC 
CCCA6CAGCC 
GGCAGCATCG 
GAGTCACTCC 
GCAAAAGGGT 
TTTGGAGGCG 
CTGGTGCAGC 
TATTTCTTTC 
AAGCACTACC 
GTGCGGAATG 
AGCTTCTCCA 
ATCCTTGGCC 
AACATGTATG 
GAGCAGGTGC 
ATGGAGGGCT 
TCAGGAAAGA 
CCCAGCAAGT 
A6GGCCAT6G 
ACX36ACCACA 
GTCACGGGTC 
TTTCAAGACA 
ATCATCAATG 
CTGGAGCAGC 



41 

I 

C6CC6CCGCC 
CTCTGGCGCC 
CGCCGCTCTC 
GGCGCGGTGT 
TGCGCCCCTA 
TGAAAGCCTT 
TCTGTCCATC 
TTTCTTTTGC 
GGACCGTCCC 
AGTGGAAGCG 
ACCTGACTGG 
ACGATCCCTG 
AGTTTGACCA 
GTAGTAAATA 
ACACGGAAGC 
ACATTGGCX3T 
CTCCACAGCA 
TCCACGGGTA 
A6ACACTGCA 
CGCTGGGCA6 
AAGTTGTATT 
6CAGG6AGGT 
ACACCATCAG 
TGCGGAAGAT 



51 




1 

ACCGCCGCCC 


60 


CGGG6CCTGG 


120 


CATCATGGGC 


180 


GCTCCCCGCC 


240 


CTTCCTCGAC 


300 


CTACGATGCA 


360 


CGTCACATCC 


420 


TGCAACCACG 


480 


ATCAGACAAT 


540 


CGTGGGCACG 


600 


AGTTCTGTAT 


660 


TACCAGTGTC 


720 


GAATATGGCA 


780 


TCAGTGGATC 


840 


CAACTCATCC 


900 


GGATTTCX3AG 


960 


GTATGAGAGA 


1020 


CGCCTACX3AT 


1080 


TGCCAGCAGC 


1140 


GATCATCCTC 


1200 


CCGGAAT6GG 


1260 


GAAGGTGGGA 


1320 


GTTCCAAGGA 


1380 


CTCCCTACCT 


1440 



212 

.02079482A2.i.> 



wo 
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CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC 
TTCTTCAACA TCAAGAACCG GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC 
2\ACCTTATCA TCCTTGGAGG GATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT 
GGATCCTTTG TCTCTGAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 
5 ACCGTGGGCT ACACGACCGC TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCACGCC 
ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 
GTGGGGGGCA TGCTGCTGAT CGACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC 
CTGCGAAGGA CAGTGGAGAA GTACAGCATG GAGCCGGACC CAGCAGGACG GGATATCTCC 
ATCCGCCCTC TCCTGGAGCA CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC 

10 TATGCCTACA AGGGACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGGA GACCCGCAAC 
GTCAGCATCC CCGCACTCAA CGACAGCAAG TACATCGGGA TGAGTGTCTA CAACGTGGGG 
ATCATGTGCA TCATCGGGGC CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGT6CAG 
TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC 
GTGCCGAAGC TCATCACCCT GAGAACAAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC 

15 CAGTTCACTC AGAATCAGAA GAAAGAAGAT TCTAAAACGT CCACCTCGGT CACCAGTGTG 
AACCAAGCCA GCACATCCCG CCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCGAATG 
AAGATCACAG AGCTGGATAA AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA 
GAAAAGACCA CCTACATTAA ACA6AACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG 
GGAAACTTCA CTGAGAGCAC AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA 

20 AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAGAA 
GATATAAACT CTCCAGAACA CATCCAGCGT CGGCTGTCCC TCCAGCTCCC CATCCTCCAC 
CACGCCTACC TCCCATCCAT CGGAGGCGTG GACGCCAGCT GTGTCAGCCC CTGCGTCAGC 
CCCACCGCCA 6CCCCCGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC 
CTG TAA 

25 

Seq ID NO: 71 Protein sequence;. 
Protein Accession #: NP_005449 

30 1 11 21 31 41 51 

I I I I I I 

MASPRRSGQP GRPPPPPPPP ARIiIiLLIiLLP LI^IiPIAPGAW GWARGAPRPP PSSPPliSlMG 
LMPLTKEVAK GSIGRGVIiPA VELAIBQIRN ESLLRPYFIiD IiRLYDTECDN AKGIiKAFYDA 
IKYGPNHLMV FGGVCPSVTS IIABSLQGWN LVQLSPAATT PVLADKKKYP YFFRTVPSDN 

35 AVNPAIIiKIjIi KHYQWKRVGT LTQDVQRFSE VRNDLTGVI.Y GBDIEISI>TB SFSNDPCTSV 
KKXiKGNDVRI ILGQFDQNMA AKVFCCAYEE NMYGSKYQWI IP6WYEPSWW EQVHTEANSS 
RCIiRKNLIiAA MEGYIGVDFE PLSSKQIKTI SGKTPQQYER BYNNKRSGVG PSKPHGYAYD 
GIWVIAKTIiQ RAMETIiHASS RHQRIQDFNY TDHTLGRIIIj NAMNETNPPG VTGQWFRNG 
ERMGTIKFTQ FQDSREVKVG BYNAVADTLB IINDTIRFQG SEPPKDKTII LBQLRKISIiP 

40 LYSIl*SAIiTI LGMIMASAPL FFNIKNRNQK LIKMSSPYMN NLIILGGMLS YASIFLFGLD 
GSFVSEKTFE TLCTVRTWIL TVGYTTAFGA MPAKTWRVHA IFKNVKMKKK IIKDQKIjLVI 
VGGMLLIDI.C IIjICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLiLEHCEN THMTIWLGIV 
YAYKGLLMLF GCFIiAWETRN VSIPAIiNDSK YIGMSVYNVG IMCIIGAAVS FliTRDQPNVQ 
FCIVALVIIF CSTITLCIiVF VPKLITIiRTN PDAATQNRRF QFTQNQKKED SKTSTSVTSV 

45 NQASTSRLEG LQSENHRIjRM KITEIiDKDLE EVTMQLQDTP EKTTYIKQNH YQEIiNDII*NL 
GNFTESTDGG KAILKNHIiDQ NPQLQWNTTE PSRTCKDPIE DINSPEHIQR RIiSLQIiPILH 
HAYIiPSIGGV DASCVSPCVS PTASPRHRHV PPSFRVMVSG L 

Seq ID NO: 72 Nucleotide sequence: 
50 Nucleic Acid Accession #: NM_005795 

Coding sequence: 522-1940 (underlined sequences correspond to start and atop codons) 

1 11 21 31 41 51 

t I I I I I 

55 GCACGAGGGA ACAACCTCTC TCTCTSCA6C AGAGAGTGTC ACCTCCTGCT TTAG GACCAT 60 

CAAGCTCTGC TAACTGAATC TCATCCTAAT TGCAGGATCA CATTGCAAAG CTTTCACTCT 120 

TTCCCACCTT GCTTGTGGGT AAATCTCTTC TGCGGAATCT CAGAAAGTAA AGTTCCATCC 180 

TGAGAATATT TCACAAAGAA TTTCCTTAAG AGCTGGACTG GGTCTTGACC CCTGGAATTT 24 0 

AAGAAATTCT TAAAGACAAT GTCAAATATG ATCCAAGAGA AAATGTGATT TGAGTCTGGA 3 00 

60 GACAATTGTG CATATCGTCT AATAATAAAA ACCCATACTA GCCTATAGAA AACAATATTT 360 

GAATAATAAA AACCCATACT AGCCTATAGA AAACAATATT TGAAAGATTG CTACCACTAA 420 

AAAGAAAACT ACTACAACTT GACAAGACTG CTGCAAACTT CAATTGGTCA CCACAACTTG 480 

ACAAGGTTGC TATAAAACAA GATTGCTACA ACTTCTAGTT T ATGT TATAC AGCATATTTC 54 0 

ATTTGGGCTT AATGATGGAG AAAAAGTGTA CCCTGTATTT TCTGGTTCTC TTGCCTTTTT 600 

65 TTATGATTCT TGTTACAGCA GAATTAGAAG AGAGTCCTGA GGACTCAATT CAGTTGGGAG 66 0 

TTACTAGAAA TAAAATCATG ACAGCTCAAT ATGAATGTTA CCAAAAGATT ATGCAAGACC 720 

CCATTCAACA AGCAGAAGGC GTTTACTGCA ACAGAACCTG GGATGGATGG CTCTGCTGGA 780 

ACGATGTTGC AGCAGGAACT GAATCAATGC AGCTCTGCCC TGATTACTTT CAGGACTTTG 840 

ATCCATCAGA AAAAGTTACA AAGATCTGTG ACCAAGATGG AAACTGGTTT AGACATCCAG 900 

70 CAAGCAACAG AACATGGACA AATTATACCC AGTGTAATGT TAACACCCAC GAGAAAGTGA 960 

AGACTGCACT AAATTTGTTT TACCTGACCA TAATTGGACA CGGATTGTCT ATTGCATCAC 1020 

TGCTTATCTC GCTTGGCATA TTCTTTTATT TCAAGAGCCT AA6TT6CCAA AGGATTACCT 1080 

TACACAAAAA TCTGTTCTTC TCATTTGTTT GTAACTCTGT TGTAACAATC ATTCACCTCA 1140 

CTGCAGTCGC CAACAACCAG GCCTTAGTAG CCACAAATCC TGTTAGTTGC AAAGTGTCCC 1200 

75 AGTTCATTCA TCTTTACCTG ATGGGCTGTA ATTACTTTTG GATGCTCTGT GAAGGCATTT 1260 

ACCTACACAC ACTCATTGTG GTGGCCQTOT TTGCAGAGAA 6CAACATTTA ATGTGGTATT 1320 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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2640 
2700 
2760 
2820 
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10 



15 



20 



25 



30 



35 



40 



45 



ATTTTCTTGG 
TATATTACAA 
GCCCAATTTG 
TCATCACCAA 
GAGCTACTCT 
CTGAAGGAAA 
AGGGTCTTTT 
GAAGAAACTG 
TTCGTAGTGC 
GTCCTAGTGA 
CAGAAAATTT 
AACTCAAGGA 
GGGAATGTCA 
ATCCAGCTCT 
CACTATGCCT 
ACAATCAACT 
AAATGGCTGT 
GACCTAGCTA 
TCCCATCTTG 
TAACTACCCT 
CTATGAAAAG 
ATCTTGTGGC 
TTCTATATCA 
TGTCTTACCA 
TTCTACTGTA 
TATTTTCTTG 
TTTTATTTTA 
TAATGCAACA 
AAATA6A6TC 
AGAAGTCTGT 
AAACACAAGG 
GGTGTGTTTG 
TTTTAAACAA 
AATATTACTT 
ATTCAAGAAA 
TTTAATAGAG 
AAAATTATTG 
AACCTTGCTA 
CCGCTGAAAT 
GTCATGGGTA 
CATCATAAGT 
TCAAATTGCT 
AGTACATTTA 
ATA6CAAGCT 



CTGGGGATTT 
TGACAATTGC 
TGCTGCTTTA 
GTTAAAAGTT 
TATCTTGGTG 
GATTGCAGAG 
GGTCTCTACC 
GAATCAATAC 
GTCTTACACA 
ACACTTAAAT 
ATATAA TTGA 
CTTGGACCCA 
TAAAGAAGAG 
ATGTGGGAAA 
GATGTGACGC 
TTTCTGAGCT 
AAAACTAAAC 
AGGTCTATAA 
ATTGGGGCAG 
CTCAAATGGA 
CAACTGAGTA 
ATATCCATTG 
TTAGGAAAAC 
AACAGTGGGA 
TAAACAAATT 
GAATTTTGTA 
TAGTCTCAAA 
ATGTGTGTAT 
TGGAATGCTA 
TTGAGAACTA 
TCACTATTTT 
ACATATTTCT 
CTACTGTGAT 
TACTGACTTT 
TATAAAAAAC 
CTACTGTATA 
TCAGATCTTA 
ATGAATTAAA 
CTCTAAAGAA 
TCTGTTTTTT 
TGTTTCTTAA 
AAGACAAATT 
TAATTTATCT 
GCCATAGAAA 



CCACTGATTC 
TGGATCAGTT 
CTGGTGAATC 
ACACACCAAG 
CCATTGCTTG 
GAGGTATATG 
ATTTTCTGCT 
AAAATCCAAT 
GTGTCAACAA 
GGAAAAAGCA 
AAATAGAAG6 
TGACTCTGTA 
CCTTCACATG 
AAAGAAATCC 
TACTAACCTG 
GGTGTAAGCC 
ATACATGTT6 
ACATGAAGGG 
TTGACTTTTT 
CAATACCAGA 
CAATTGTTAT 
TGGAAACTGG 
ATCTTAGTTG 
GGGAATTCCT 
AGCAATCATT 
AAAAGAAATT 
TCAAATACAT 
GTTAATATCT 
TATTTGGTAA 
AGAGAACA6A 
ACTGAATATA 
TTTTTCATTT 
AAATACCAAT 
TACTATGT6A 
TAGAAGGATA 
TAATACAAAT 
CTGAATTATT 
GTGAAATTTG 
CAAGAATGAC 
AAGTGTGTCA 
GCTGTCAATA 
ATCTAAATTC 
ATGCATGAAA 
6GA 



CTGCTTGTAT 
CTGATACCCA 
TTTTTTTCTT 
CGGAATCCAA 
GCATTGAATT 
ACTACATCAT 
TCTTTAATGG 
TTGGAAACAG 
TCAGTGATGG 
TCCATGATAT 
ATGGTTGTCT 
GCCAGAAGAC 
AAATTAGTAG 
TGGTTTGTAA 
ACATCACCAA 
AGTTCCAGCA 
GGCATGATTC 
AAAATTAGCT 
TTTTTTCCCA 
AGTGAATTAT 
GATCTACTCA 
ATGAACAGGA 
ATGCTACAAA 
AGCTGTAAAT 
TTATATAAAG 
GTGAAAAATG 
ACAACCTATG 
GATACTGTAT 
ATATTTTAAA 
AACATCTATC 
TTTGTTTTGA 
TGACAATGAA 
CTGCTACTTT 
AGATATATAG 
CTATATATAC 
TAGGGAAATA 
GTCAGACTTT 
CATG6GATTC 
TTCAATTAGT 
ATCTGATTAA 
TGTCAATAGA 
GTAAGAATTA 
AAGTATTGTT 



ACATGCCATT 
TCTCCTCTAC 
GTTAAATATT 
TCTGTACATG 
TGTGCTGATT 
GCACATCCTT 
AGAGGTTCAA 
CTTTTCCAAC 
TCCAGGTTAT 
TGAAAATGTT 
CACTGTTTGG 
TTCAATATTA 
TGTGTTGATA 
TGTTTGTCAG 
GTGTGGAATT 
CACCATTGAT 
TACCCTTATT 
TTTAGTTTTA 
GAGTGCCXSTA 
CCCTGCTGGC 
TTTGCTGACA 
TGTATAATAT 
ACACCTTGTC 
ATAAATTTTG 
AAAATCAATG 
AGCTTGTAAA 
TAATTTTTJ^A 
CTGGGCTGAT 
GACAACCAGA 
ATAAGATATA 
TAACTCATAC 
CTCACATTCT 
TATAGATTTT 
CTTTGGAAAT 
CATATACAAT 
CTTGAATATA 
ATTAAATAAA 
AGTTTCTCTA 
AAAAGTCAAT 
AATGGATGAA 
TGGTGAGTTC 
ACATATAGAA 
TTGTTTGAAA 



GCTAGAAGCT 

ATTATCCATG 

GTACGCGTTC 

AAAGCTGTGA 

CCATGGCGAC 

ATGCACTTCC 

GCAATTCTGA 

TCAGAAGCTC 

AGTCATGACT 

CTCTTAAAAC 

TGCTTCTCCT 

AATGACTTTG 

AGAGTGTAAC 

TAAATACTCC 

GGAGAAAAGC 

GAATTCAAAC 

CSCCCCAAGA 

AAACTCTTTA 

GTCCTTTTTG 

TTTCTTTTCT 

CATCAGTTAT 

GCAATCTTAC 

AACCTCTTCC 

TCCCTTCCAT 

AAGGATTTCT 

TACTCCATTA 

AGCAAATATA 

TTTTTAAATA 

TGCCAGCATC 

TTTATTTTAA 

CTTAATAATA 

AATCCAGAAA 

ACCCCATTAA 

6TCCCAGGCT 

GCTTTAATAT 

TCATTGAGAA 

6ATAGAAGAA 

ATGTTATTTT 

TTTGGGAAAA 

ACAAATTACT 

AGAACTTATT 

TGGTCTGGTC 

CATGAATTTC 



1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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2040 
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2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2860 

2940 
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Seq ID NO: 73 Profcein sequence; 
Protein Accession #: NM 005795 



50 



55 



60 



65 



70 



75 



MIjYSIFHIiGL 
QKIMQDPIQQ 
NWFRHPASNR 
SCQRITLHKN 
MliCEGIYLHT 
liliYIIHGPIC 
VLIPWRPEGK 
FSNSEALRSA 



11 
I 

MMEKKCTLYF 
AEGVYCNRTW 
TWTNYTQCNV 
LFPSFVCNSV 
LIWAVFAEK 
AALLVNIiFFL 
lABEVYDYIM 
SYTVSTISDG 



21 

1 

LVLLPFFMIL 
DGWIiCWNDVA 
NTHEKVKTAL 
VTIIHLTAVA 
QHLMWYYFLG 
LNIVRVLITK 
HII^FQGLL 
P6YSHDCPSB 



31 
I 

VTAELBBSPE 
AGTESMQLCP 
NLFYZiTIIGH 
NNQALVATNP 
WGFPLIPACI 
LKVTHQAESN 
VSTIFCFPNG 
HXiNGKSZHDI 



41 
I 

DSIQLGVTRN 
DYFQDFDPSE 
GLSIASLIiIS 
VSCKVSQPIH 
HAIARSLYYN 
LYMKAVRATL 
EVQAIZiRRNW 
ElTVUiKPENIf 



51 
I 

KIMTAQYECY 60 

KVTKICOQDG 120 

LGIFFYFKSIi 180 

LYIiMGCNYFW 240 

DNCWISSDTH 300 

ILVPLLGIEF 360 

NQYKIQFGNS 420 
YN 



Seq ID NO: 74 Nucleotide aecruenee ; 
Nucleic Acid Accession NM_000450.1 

Coding sequence: 117.. 1949 (underlined sequences correspond to start and stop codons) 



CCTGAGACAG 
CCAAAACGGA 
TTGCTTCACA 
GGTCTTACAA 
AAAGGTACAC 
TATTGAGCTA 
TCTGGGTAGG 
CCAACAATAG 
TGGGCATGTG 
CCTGTACCAA 



11 
I 

AGGCAGCAGT 
AAGTATTTCA 
GTTTCTCTCA 
CACCTCCACG 
ACACCT6GTT 
TTCACCAAGT 
AACCCAGAAA 
GCAAAAAGAT 
GAATGATGAG 
TACATCCTGC 



21 



I 



GATACCCACC 
AGCCTAAACC 
GCTCTCACTT 
6AA6CTATGA 
GCAATTCAAA 
TATTACT6GA 
CCTCTGACAG 
GAG6ACTGCG 
AGGTGCAGCA 
AGT6GCCACG 



31 

I 

TGAGAGATCC 
TTTGGGTGAA 
TGGTGCTTCT 
CTTATGATGA 
ACAAA6AAGA 
TTGGAATCAG 
AAGAA6CCAA 
TGGAGATCTA 
AGAAGAA6CT 
GTGAATGTGT 



41 

I 

TGTGTTTGAA 
AAGAACTCTT 
CATTAAAGAG 
GGCCAGTGCT 
GATTGAGTAC 
AAAAGTCAAC 
GAACT6G6CT 
CATCAAGAGA 
TGCCCTATGC 
AGAGACCATC 



51 
I 

CAACTGCTTC 
GAAGT CATG A 
AGTGGAGCCT 
TATTGTCAGC 
CTAAACTCCA 
AATGTGTGGG 
CCAGGTGAAC 
GAAAAAGATG 
TACACAGCTG 
AATAATTACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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CTTGCAAGTG 

CCCTGGAATC 

ACAATTCTTC 

TGCAGTGTAT 

6TGATGCT6T 

TCCCATGGAA 

AGAGCCTTCA 

TGACATGCAG 

CTGGAGAGTT 

AGGGACCAGC 

GTGAAGCTTT 

CTAGTGCTTC 

TTGTGTTG7VA 

AGCCCACATG 

GGTGTGCTCA 

AGGAGGGATT 

CAGAAGAGGT 

TCAACATGAG 

AAGGATG6AC 

GCCTGCTACC 

CTGCTGCTGG 

TACGGAAA6C 

GCTACCAAAA 

GGAACTAGAG 

CTTCTTGCCT 

AATAGATCAA 

TCCTACTCTC 

CAAAGGTGAA 

CGCTGTAAAA 

CAGTGTTTCG 

GTTTAGAGGA 

CCCACAGGCA 

AGGGTTACTA 

GTTCTTAAGG 

TCATTCAATA 

GGTAAAAGTT 

TGTTTTAAAT 

A6AATAAAGT 

AGTAAAAACT 

CCACGATGAA 

CACACAAAGG 

GCTTTGCATT 

TATGGAAGAT 

TTTAACGAAT 

GCTCTGGAAG 

AACAATTCCA 

AGTAATTGCC 

CCATTAACTT 

AACGACAAAG 

TTAAAGGGGC 

ATGGAATACA 

GCATTAGAAA 

TTTAAATTAT 

TCAGACCTAT 



TGACCCTGGC 
CCCTGAGCAT 
CTGCTCTATC 
GTCCTCTGGA 
GACAAATCCA 
CACAACCTGT 
GTGTACCTCA 
GGCCGTCCGC 
CACCTTCAAA 
CCAGGTTGAA 
CCAGTGCACA 
TGGCAGTTTC 
GGGATCCAAA 
TGAAGCTGTG 
TTCCCCTATT 
TGAATTATAT 
TCCTTCCTGC 
CTGCAGTGGG 
GCTCAATGGC 
TACCTGTGAA 
ACTCTCCCTC 
AAAGAAATTT 
GCCTTCTTAC 
GGATACACTG 
ACTATGCCAG 
AGTCCAGCAG 
AGGATCAAGA 
GAGACCAAGA 
TCTTGGCACA 
ACAGCTGATT 
AAAAAATGAC 
AATGCATGGA 
TGCACAATTT 
ATTGTGATAT 
CAAGTGTGGT 
ACTTATTCTA 
TTATTTCAAA 
TCTGAGTGTT 
GAATGGAAGG 
AAACTTCCAT 
TATAATTTTA 
CCTACAAAGA 
TTTAAATTCA 
GAAGATGTCT 
AGAGGAATGC 
AAGGAATCTC 
AAAGCTGCTC 
AGCATGTGTT 
CCAACAGTCA 
AGAAAAACTC 
GTGTTATTTT 
TTAGCTGTGT 
AACTTAAAAT 
TTGACATAAC 



TTCAGTGGAC 

GGAAGCCTG6 

AGCTGTGATA 

GAATGGAGT6 

GCCAATGGGT 

ACATTTGACT 

TCTGGGAATT 

CAGCCTCAGA 

TCATCCTGCA 

TGCACCACTC 

GCCTTGTCCA 

CGTTATGGGT 

AGGCTCCAAT 

AGATGCGATG 

GGAGAATTCA 

GGATCAACTC 

CAAGTGGTAA 

GAGCCCGTGT 

TCTGCAGCTC 

GCTCCCACTG 

CTGACATTAG 

GTTCCTGCCA 

ATCCTTTAAG 

AAGTTAACAG 

ATGCCTTTAT 

GCAAGGACGG 

AAGTGTTGGC 

CTCTGAAATC 

GAAACACAAT 

ACACAGTTGC 

TAAAAATATT 

GGGTTGTTAA 

AATCACTTTC 

TTTTACTTGC 

AGGGACTTAA 

GATTACCCCC 

AAGGGAAAGT 

TTTACTACAG 

TTTGTATATT 

GAGGCCAAAC 

TGAATGTCTT 

TGTTTGTCAG 

CAATAGAAAC 

AATAGTTATT 

CTGTGTGAGC 

CAGTTTTCAG 

TAGCCTTGAG 

GAAAAAAAAA 

AAACAGAGAT 

TGGGAAATAA 

CTTTGT^AATT 

GAAATACCAG 

ATTTTATAAT 

ACTATAAA6G 



TCAAGTGTGA 

TTTGCAGTCA 

GGGGTTACCT 

CTCCTATTCC 

TCGTGGAATG 

GTGAAGAAGG 

GGGACAACGA 

ATGGCTCTGT 

ACTTCACCTG 

AAGGGCAGTG 

ACCCCGAGCG 

CCAGCTGTGA 

GTGGCCCCAC 

CTGTCCACCA 

CCTACAAGTC 

AACTTGAGTG 

AATGTTCJ^AG 

TTGGCACTGT 

GGACATGTGG 

AGTCCAACAT 

CACCATTTCT 

GCAGCTGCCA 

TTCAAAAGAA 

AGACAGATAA 

GGCTGAAACC 

CCTTCAACTG 

TAATGAAGGG 

TCAGAATTCC 

ATTTTGTGGC 

TGTCATAAGA 

ATAACTTAAA 

TGGTGCAAAT 

ATCCCTATGG 

ATTGAATATA 

AATVACTTGTA 

TCATTGTTTA 

ATTGTCCCCT 

TTGTTTTTTG 

GTCAGATATT 

GTTTTGAACT 

T6TTGGAAAA 

AT6TGATATG 

TCACCATGTA 

CCCTATTTGT 

AAGCATTTAT 

TTGATCACTG 

GAGTGTGAGA 

GTTTCAGAGA 

GTGATAAGGA 

GAGAGAACAA 

GTTTAAGTGT 

TGTGGTTTGT 

TTTT7U\AGTA 

TTGACAATAA 



GCAAATTGTG 

CCCACTGGGA 

GCCAAGCAGC 

AGCCTGCAAT 

TTTCCAAAAC 

ATTTGAACTA 

GAAGCCAACG 

GAGGTGCAGC 

TGAGGAAGGC 

GACACAGCAA 

AGGCTACATG 

GTTCTCCTGT 

AGGGGA6TGG 

GCCCCCGAAG 

CTCTTGTGCC 

CACATCTCAG 

CCTGGCAGTT 

GTGCAAGTTC 

A6CCACAG6A 

TCCCTTGGTA 

CCTCTGGCTT 

AA6CCTTGAA 

TCAGAAACA6 

CTCTCCTCGG 

GCAACACCCA 

AAAAGACTCA 

AAAGGATATT 

TTTTCTAACT 

TTTCTTTCTT 

ATGAATAATA 

AAAATGACAG 

CCTACTGAAT 

GATTCAGTGC 

TTATAATCTT 

AATGCTGTCA 

TTAACAAATT 

A6CAAGGCAT 

AAAACATGGT 

TTTTCAGAAA 

AATAAAAGCA 

GAATACAGAA 

TAAACATAAT 

AAAGAGTCAT 

TTTCTTCTGT 

GTTTATTTAT 

GCAATGAAAA 

ATCAAAACTC 

AGTTCTGGCT 

TCAGAACAGC 

CTACTGTGAT 

TGTAAATATT 

GTTTGAGTTT 

TATATTTATT 

ATGTGCTTAT 



AACTGTACAG 

AACTTCAGCT 

ATGGA6ACCA 

GTGGTTGAGT 

CCTGGAAGCT 

ATGGGAGCCC 

TGTAAAGCTG 

CATTCCCCTG 

TTCATGTTGC 

ATCCCAGTTT 

AATTGTCTTC 

GAGCAGGGTT 

GACAACGAGA 

GGTTTGGTGA 

TTCAGCTGTG 

GGACAAT6GA 

CCGGGAAAGA 

GCCTGTCCTG 

CACTGGTCTG 

GCTGGACTTT 

CGGAAATGCT 

TCAGACGGAA 

GTGCATCTGG 

GTCTCTGGCC 

TCACCACTTC 

GTGTTCCCTT 

TTCTTCCAAG 

CTCCCTTGCT 

TTGCCCTTCA 

ATTATCCAGA 

ATGTTGAATG 

GCTCTGTGCG 

TTCTTAAAGA 

CCATACTTCT 

ACTATGATAT 

ATGTTACATC 

GATGTTAACC 

AGAATTGGAG 

TATGTGGTTT 

TAAATGCAAA 

AGATGGATGT 

TCTTGTATAT 

CTGGTAGATT 

ATGTTAGGGT 

AAGCAGATTT 

ATTCTCAGTC 

TCCTACACTT 

GAACACTGGC 

AGAGGTTCTT 

CAGGCTATGT 

TATGTAAACT 

TATT6AGAAT 

TAAGCTTATG 

GTTT 



660 
720 
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Seq ID NO: 75 Protein aecruence; 
Protein Accession #: MP 000441 



11 



21 



31 



41 



51 



MIASQFLSAIi 
SILSYSPSYY 
DVGMWNDERC 
TALES PEHGS 
ECDAVTNPAN 
AVTCRAVRQP 
VCEAFQCTAL 
EKPTCEAVRC 
WTEEVPSCQV 
SGLIiPTCBAP 
GSYQKPSYlIi 



TLVIiLIKESG 
WIGIRKVNNV 
SKKKLAIiCYT 
liVCSHPLGNF 
GFVECFQNPG 
QNGSVRCSHS 
SNPERGYMNC 
DAVHQPPKGL 
VKCSSIiAVPG 
TESNIPLVAG 



AWSYNTSTEA 
WVWVGTQKPL 
AACTNTSCSG 
SYNSSCSISC 
SFPWNTTCTF 
PAGEFTFKSS 
IiPSASGSFRY 
VRCAHSPIGE 
KINMSCSGEP 
LSAAGIiSLLT 



MTYDEASAYC 
TEEAKNWAPG 
HGECVETINN 
DRGYIiPSSME 
DCEEGFELMG 
CNFTCEEGFM 
GSSCEFSCEQ 
FTYKSSCAFS 
VFGTVCKFAC 
LAPFLIjWIiRK 



QQRYTHLVAI 
EPNNRQKDED 
YTCKCDPGPS 
TMQCMSSGEW 
AQSbQCTSSG 
LQGPAQVECT 
GFVLKGSKRI, 
CEEGFELYGS 
PBGHTLNGSA 
CLRKAKKPVP 



QNKEEIEYLN 
CVEIYIKREK 
GIiKCEQIVNC 
SAPIPACNW 
NWDNEKPTCK 
TQGQWTQQIP 
QCGPTGEWDN 
TQLECTSQGQ 
ARTCX3AT6HW 
ASSCQSLESO 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 76 Nucleotide sequence: 
Nucleic Acid Accession #: NM_031439 

Coding sequence: 69.. 1235 (underlined sequences correspond to start and stop codons) 
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10 



15 



20 



25 



30 



35 



40 



1 
I 

CCCGACCCGT 
GTGCGGCCAT 
CCCTGGACGC 
ACAAGGGCTC 
ACX3AGAG6AA 
TGCTGGGAAA 
CGGAGCGGCT 
GGAAGAAGCA 
TCTCCCGGGA 
AGAAGGAGGA 
ACCACGAGGG 
CGTACGGGCT 
CCTTCTTCTC 
CAGGGCACCC 
GCTCCCTGGC 
CCCCATCTCC 
CCCACCTGGG 
TGAGCCAGGT 
CTCCTGGCCA 
CCCAGGTGAC 
CGGCCACGTA 
CCCTCGCGCC 
TTTCCTCCCA 
GCTGGACTCT 
GCCCACATTT 
CTCCCAGTGG 
GACAGACTTG 
ATTAATAAAG 
TGATAATTTT 
TCCAAAGTGA 
GATTT6AGAA 
TATTTTATTT 
ACACACACTT 
TATAAAATTC 
GATGTTTAAA 



11 
I 

GCX3AGGGCCA 

GGCTTCGCTG 

CGAGCTGTCG 

CGAGAGCCGT 

ACGGCTGGCA 

GTCGT6QAAG 

GCGCCTGCAG 

6GCCAAGCGG 

CCAGAACGCC 

CAGGGGTGAG 

GCCGGCTGGT 

GCCCACACCT 

CTCCCCCTGC 

GTACTCACCG 

CCTTGGCCAG 

TGCCTATTAC 

CCAGCTTTCC 

GGAACTCCTG 

CCCAGACTCC 

ACCAACGGGT 

CTACAACAGC 

CTCTCCTTCT 

CCGCTCAGGG 

CCTTATCCGA 

TAAGTATATT 

AATGTTCACT 

ATAGCCAAGG 

GAAGATGGG6 

GTGTGCACAG 

CCACAAAATT 

ATTAACCAGT 

TAAATATACA 

CAAGAGCCAC 

AGTGTATTAG 

AACAAAACAG 



21 

I 

GGTCCGCGCC 
CTGGGAGCCT 
GATGGACAAT 
ATCCGGCGGC 
GTGCAGAACC 
GCGCTGACGC 
CACATGCAGG 
CTGTGCAAGC 
CTGCCGGAGA 
TACTCCCCCG 
GGTGGC6GCG 
CCTGAAATGT 
CA6GAGGAGC 
GAGTACGCCC 
TCCCCCGGCG 
TCCCCGGCCA 
CCGCCTCCTG 
GGGGACATGG 
GCCACAGGGG 
CCCACAGAGA 
TACAGTGTGT 
TGTGCCTTGA 
CAGGGAGGTC 
GTGCCGCCTC 
CCTTCAAGTG 
GACGTCTTTT 
TCCCTTCT6G 
AAATTTGACX. 
CCCAAGGACC 
TCAAAGGGAC 
ATGGCTAACT 
TTTTAAAGCA 
CGOGCCCAGC 
TTTCATTACA 
GCTGTTGTAA 



31 
I 

TGCCCCGCCA 
ACCCTTGGCC 
CGCCGCCGGC 
CCATGAACX3C 
CGGACCTGCA 
TGTCCCAGAA 
ACTACCCCAA 
GCGTGGACCC 
AGAGAAGCX3G 
GCACTGCCCT 
GCGGCACCCC 
CTCCCCTGGA 
ATGGCCATCC 
CAA6CCCTCT 
TCTCCATGAT 
CCTACCACCC 
AGCACCCTGG 
ATCGCAATGA 
CCATGGCCCT 
CCAGCCTCAT 
C ATAGA GCTG 
GTGGCAGAGG 
TGAACTGCGG 
TATCCCCTTC 
AGTTTTCCTC 
CTTGGTAGCC 
TCCAGTTTTC 
CATTAATGAG 
ACGAGGCTTT 
TCATACAATT 
ATATCACAGA 
GTTCTTTTTT 
CTACATTTAT 
TAGGAGAAAT 
AAAAAAAAAA 



41 
I 

GGCGAAGCGA 
CGAGGGTCTC 
CGTCCCCCGG 
CTTCATGGTT 
CAACGCCGAG 
GAGGCCGTAC 
CTACAAGTAC 
GGGCTTCCTT 
CAGCCGGGGG 
GCCCAGCCTC 
GAGCAGTGTG 
CGTGCTGGAG 
CCGCCGCATC 
CCACTGTAGC 
GTCCCCTGTA 
ACTCCACTCC 
CTTCGACGCC 
ATTCGACCAG 
CAGTGGGCAT 
CTCCGTCCTG 
GAGGCGCCCC 
AGCCGTCCAG 
CCCCAGAGCC 
CCCACGTTCC 
CAGCCCCTGA 
ATCATCGAAA 
TGATTTAGGG 
CTCGCTAACC 
CTGCACTTTC 
TGAGAAAAAA 
AAATG6GATT 
TTTGTTAATT 
AATTTTCATT 
TATATTTCTA 
AAAAAAAAA 



51 

I 

GGCGACCCGC 
GAGTGCCCGG 
CCCCCGGGGG 
TGGGCCAAGG 
CTCAGCAAGA 
6TGGACGAGG 
CGGCCGCGCA 
CTGAGCTCCC 
GCGCTGGGGG 
CGGGGCTGCT 
GACACGTACC 
CCGGAGCAGA 
CCCCACCTGC 
CACCCCCTGG 
CXCGGCTGTC 
AACCTCCAAG 
CT6GATCAAC 
TATTTGAACA 
GTTCCGGTCT 
GCTGATGCCA 
GTCCGGTCAG 
CCACACCAGC 
TTTGGCCTAA 
AGCCCCTGCA 
GAGTTGCTGT 
CTAATGGGGG 
TTCTCTCAAG 
TACGATCTGG 
TGCACCCCCT 
CAGTCAACCT 
GA6TTAAAAC 
TGTTTATTAT 
CTCTTTTACC 
AACATTTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
1560 
1620. 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



45 



55 



60 



Seq ID NO: 77 Protein aeouence! 
Protein Accession #: NP 113627 



11 21 31 41 51 



MASLLGAYPW PEGLECPALD AELSDGQSPP AVPRPPGDKG SBSRIRRPMN AFMVWAKDER 60 

KRIiAVQNPDL HNAELSKMLG KSWKALTLSQ KRPYVDEAER LRIiQHMQDYP NYKYRPRRKK 12 0 

JV QAKRLCKRVD PGFLLSSLSR DQNALPEKRS GSRGALGBKE DRGEYSPGTA LPSLRGCYHE 180 

GPAGGGGGGT PSSVDTYPYG LPTPPEMSPL DVLBPEQTPF SSPCQEEHGH PRRIPHLPGH 24 0 

PYSPEYAPSP LHCSHPLGSL ALGQSPGVSM MSPVPGCPPS PAYYSPATYH PLHSNLQAHL 300 

GQLSPPPEHP GFDALDQLSQ VELLGDMDRN EFDQYLNTPG HPDSATGAMA LSGHVPVSQV 360 
TPTGPTETSL ISVLADATAT YYNSYSVS 



Seq ID NO: 78 Nucleotide sequence! 
Nucleic Acid Accession #: XM_035787 

Coding sequence: 329.. 949 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

III III 

TGCCCCGCCC CGCTCCCCAG CGCCCCGGAA GTGATCTGTG GCGGCTGCTG CAGAGCCGCC 60 

AGGAGGAGGG TGGATCTCCC CAGAGCAAAG CGTCGGAGTC CTCCTCCTCC TTCTCCTCCT 120 

OD CCTCCTCCTC CTCCTCCAGC CGCCCAGGCT CCCCCGCCAC CCGTCAGACT CCTCCTTCGA 180 

CCGCTCCCGG CGCGGGGCCT TCCAGGCGAC AAGGACCGAG TACCCTCCGG CCGGAGCCAC 240 

GCAGCCGCGG CTTCC6GAGC CCTCGGGGCG GCGGACTGGC TCGCGGTGCA GATTCTTCTT 300 

AATCCTTTGG TGAAAACTGA GACACAAAAT fiGCTGCAAAT AAGCCCAAGG GTCAGAATTC 360 

TTTG6CTTTA CACAAAGTCA TCATGGTGGG CAGTGGTGGC GTGGGCAAGT CAGCTCTGAC 4 20 

/U TCTACAGTTC ATGTACGATG AGTTTGTGGA GGACTATGAG CCTACCAAAG CAGACAGCTA 4 80 

TCGGAAQAAG GTAGTGCTAG ATGGGGAGGA A6TCCAGATC GATATCTTAG ATACAGCTGG 540 

GCAGGAGGAC TACGCTGCAA TTAGAGACAA CTACTTCOGA AGTG66GA6G GGTTCCTCTG 600 

TGTTTTCTCT ATTACAGAAA TGGAATCCTT TGCAGCTACA GCTGACTTCA GG6A6CAGAT 660 

TTTAAGAGTA AAAGAAGATG AGAATGTTCC ATTTCTACTO GTTGGTAACA AATCAGATTT 720 

AGAAGATAAA A6ACAGGTTT CTQTAGAAGA GGCAAAAAAC AGAGCT6A6C AGTGGAATGT 780 

TAACTACGTG GAAACATCTG CTAAAACACG A6CTAATGTT 6ACAAGGTAT TTTTTQATTT 840 
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AATGAGAGAA 
GAG6AAAAGT 
ACTCCTTTCT 
TTAATTGACT 
TTGGAACTGC 
AGCAAAGTTC 
AAGCTTGTGT 
TGGGAGGAAA 
AAATGTCTTG 
CCACTTTTTT 
CATATAGTTA 
TTAGAAAAGT 
GAAAAATATC 
CCATGTGTCC 
AAGTAACTGT 
AAAATAAGAT 
AGGGCCAGCA 
TTTGAATGAA 
GTGAGCATGT 
AGTGCTAATG 
ATTTCTAAAT 
TTTCTGTTTC 
CCATCAAACC 
TCACCATTGT 
CAGTGGGAAA 
AGTTGATTCA 
TGTGTTCAGG 
ACTCACTCTT 
TGAGATGTAC 
TACCAATGTA 
TTTAACCAGG 
GGTACTGAGG 
ACTACCCGAT 



ATTCGAGCGA 
TTAGCCAAGA 
TATCTTGACC 
GAAATTACTT 
AATGAAAGTC 
AACTTATTTC 
TTCTTGGGCA 
GGTGACTTCC 
GTCTTCTACT 
TAACCATTAT 
AACTGAGAGT 
GGTGTAAACT 
TGGTTATATC 
TGGTGAGAAA 
CCGCTAGAAG 
TCCAGAGCTC 
TATATACTTG 
CCCTCCTTTT 
AAGTGTTAAG 
CATTTTGCAC 
TTATATTCAT 
TGTTTATAAT 
TGG6TATAGT 
GTGGTGTACC 
TATGCCACTG 
ACAAAGTATT 
CATTCCAGGT 
TAGATATTTA 
ATCTTTCATT 
ACACTGGCCA 
GGTCCTAACC 
TGCTATGAAG 
TTGTTTATTT 



GAAAGATGGA 
GAATCAGAGA 
ATACTAATAA 
TAACATTTTG 
AAATTTACTT 
ATAATTGCCT 
GTCTTTCTTG 
TCTGGTGTTT 
GCCTTGAAAA 
TATGCAAAAT 
AATTCATCTG 
TGTACATGGA 
ATTCTGGGTG 
ATATATGCCT 
TCTGTCCAAA 
TTTGATCGCT 
CAAGATAATT 
CTCTGAGATT 
TTTTTAATCT 
TA6AACGCTT 
AAAGTTACAG 
GAAGAACACT 
GCAGAAAAC6 
TGCTGGAAGA 
ACCGATTTTT 
TTTTTCTTTT 
AACAGGTGTG 
CATCCAGCTT 
TCGTATTTCT 
GCGGGCCCAG 
ACTAACATTG 
CCAACTGACA 
GCTVATTTGAG 



AGACAGCAAA 
AAGATGCTGC 
ATATAATTTA 
GAAATTGTTG 
TAAAAAQAAA 
ACATTTATCA 
AAATTGAAGA 
ATTATAAAGC 
ATGACAATTG 
TTAGAA6AAA 
TGAATCTGCT 
ATTTTTTGAA 
TGTTCTTACT 
GGCACAGCTT 
TTTAAAATGT 
TTTAATAAAC 
TTCAGCTGCA 
CTGGTCCCTG 
GGGA6CAGG6 
CG66AAAATA 
TTTGATACAG 
GTAGCTACAT 
TGGCACACAC 
ATTCTAGCAT 
TTTTTTTCCT 
TTCTCAGTCC 
TATGTAAAGT 
CTCATGTTAA 
CATAGGCTAT 
CAATCTCCAT 
TGACTTTGCT 
AAGATGCATC 
CCATTTAAAG 



6AAAAGAATG 
ATTTTATAAT 
TAAGCATTGC 
TATATCACTA 
TTAATATGGC 
TGGTCCTGAA 
GGTGAAATGG 
TTAAATTTTA 
TGAACATGAT 
AGTTATTGGC 
TTAATTACCT 
TATGCCTTAA 
GACACCAGGG 
TTGTATAGAA 
GTGCCATATT 
TGCAAGTTCA 
AGGATTCAGC 
GAAATCCCTT 
CATAGGAA6A 
TTCATGCTTG 
GAATTATTAG 
TTTCAGAAGT 
TGACCACACA 
GCTACTTGGG 
CTTTGCAGTG 
TAATTTGAAC 
TAAAAATAGG 
ATATTTGTCC 
GCCATGTGCG 
GTGTACTTAT 
TTGAGACCTT 
ACGTGTCTTA 
ACCAATAAAC 



GAAAAAA6AA 
CAAAGCCCAA 

cattgaaggc 
aaagcatg;^ 
ttcaccaaga 

TGTAGCGTGT 
GGGTGGGGAG 
TATCATTTTA 
AGTTAAACTA 
ATGGTTGTTG 
GGTGAGTAAC 
TTTAGAAACT 
GTCCGCTGCC 
AATTCTTGAG 
CTGGTTCTTG 
TTTTA7ATGA 
ACCAGTTATG 
TCTGCTAGTG 
AAATGTCA6T 
CCATCTGTTC 
GAGTAATTCT 
TAACATCAAG 
TTAGGCTGTG 
GACATAATTT 
GGGCTAGGAC 
AGGTCAAAGA 
CTTTTTAGGA 
TTAAAGGGTT 
GAATTCAAGT 
TACAGTCTTA 
TCCTCTCCTG 
GGCTGATGCC 
TTCCTTTTTT 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



Seq ID NO: 79 Protein sequence; 
Protein Accession #: XP_035787 

1 11 21 31 41 51 

I I I I t I 

MAANKPKGQN SIiALHKVIMV GSGGVGKSAI* TLQFMYDEPV BDYEPTKADS YRKKWLDGB 
EVQIDIIiDTA GQEDYAAIRD NYFRSGEGFIi CVFSITEMES FAATADFREQ ILRVKEDENV 
PFLIjVGNKSD IiEDKRQVSVE EAKNRAEQWN VNYVETSAKT RANVDKVFPD LMREIRARKM 
EDSKEKNGKK KRKSIjAKRIR ERCCIL 



60 
120 
180 



Seq ID NO: 80 Nucleotide sequence ; 
Nucleic Acid Accession #: NM_003467 

Coding sequence; 8 9.. 114 7 (underlined sequences correspond to start and stop codous) 



GTTTGTTGGC 
CACCGCATCT 
CTACACCGAG 
AGAAAATGCr 
TGGCATTGTG 
CATGACGGAC 
TCCCTTCTGG 
AGTCCATGTC 
TCTGGACCGC 
GGCTGAAAAG 
CTTCATCTTT 
CAATGACTTG 
TGGTATTGTC 
CCACCAGAAG 
TTGGCTGCCT 
GCAAGGGTGT 
TTTCTTCCAC 
CTCTGCCCAG 
AGGAAAGCGA 
CAGCTAACAC 
ACATTTTTCA 
TTGTCTTGTG 
TGTTTCATAT 
CTCGTGGTAG 
AAGCTA6AAA 



11 

I 

TGCGGCAGCA 
GGA6AACCAG 
GAAATGGGCT 
AATTTCAATA 
GGCAATGGAT 
AAGTACAGGC 
GCAGTTGATG 
ATCTACACAG 
TACCTGGCCA 
GTGGTCTATG 
GCCAACGTCA 
TGGGTGGTTG 
ATCCTGTCCT 
CGCAAGGCCC 
TACTACATTG 
GAGTTTGAGA 
TGTTGTCTGA 
CACGCACTCA 
GGTGGACATT 
AGATGTAAAA 
GATATAAAAG 
TTTCTTTAGT 
TGATGTGTGT 
GACTGTA6AA 
TGATCCCCAG 



2X 

I 

GGTA6CAAAG 
CGGTTACCAT 
CAGGGGACTA 
AAATCTTCCT 
TGGTCATCCT 
TGCACCTGTC 
CCGTGGCAAA 
TCAACCTCTA 
TCGTCCACGC 
TTGGCGTCTG 
GTGAGGCAGA 
TGTTCCAGTT 
GCTATTGCAT 
TCAAGACCAC 
GGATCAGCAT 
ACACTGTGCA 
ACCCCATCCT 
CCTCTGTGAG 
CATCTGTTTC 
GACTTTTTTT 
ACTGACCAAT 
TTTTGTGAAG 
CTAGGCAG6A 
AAGGGAACT6 
CTGTTTATGC 



31 
I 

TGACGCCGAG 
GGA6GGGATC 
TGACTCCATG 
GCCCACCATC 
GGTCATGGGT 
AGTGGCCGAC 
CTGGTACTTT 
CAGCAGTGTC 
CACCAACAGT 
GATCCCTGCC 
TGACAGATAT 
TCAGCACATC 
TATCATCTCC 
AGTCATCCTC 
CGACTCCTTC 
CAAGTGGATT 
CTATGCTTTC 
CAGA6GGTCC 
CACTGAGTCT 
TATACGATAA 
ATTGTACAGT 
TTTAATTGAC 
CCTGTGGCCA 
AACATTCCAG 
ATAGATAATC 



41 

I 

GGCCTGAGTG 
AGTATATACA 
AAGGAACCCT 
TACTCCATCA 
TACCAGAAGA 
CTCCTCTTTG 
GGGAACTTCC 
CTCATCCTGG 
CAGAGGCCAA 
CTCCTGCTGA 
ATCTGTGACC 
ATGGTTGGCC 
AAGCTGTCAC 
ATCCTGGCTT 
ATCCTCCTGG 
TCCATCACCG 
CTTGGAGCCA 
AGCCTCAAGA 
GAGTCTTCAA 
ATAACTTTTT 
TTTTATTGCT 
TTATTTATAT 
AGTTCTTAGT 
AGCGTGTAGT 
TCTCX».TTCC 



51 
I 

CTCCAGTAGC 
CTTCAGATAA 
GTTTCCGTGA 
TCTTCTTAAC 
AACTGAGAAG 
TCATCACGCT 
TATGCAAGGC 
CCTTCATCAG 
GGAAGCTGTT 
CTATTCCCGA 
GCTTCTACCC 
TTATCCTGCC 
ACTCCAAGGG 
TCTTCGCCTG 
AAATCATCAA 
AGGCCCTAGC 
AATTTAAAAC 
TCCTCTCCAA 
GTTTTCACTC 
TTTAAGTTAC 
TGTTGGATTT 
AAATTTTTTT 
TGCTGTAT6T 
GAATCACGTA 
C6TGGAACGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



217 



wo 02/079492 



PCT/US02/04915 



TTTTCCTGTT CTTAAGACGT GATTTTGCTG TAGAAGATGG CACTTATAAC CAAAGCCCAA 1560 
AGTGGTATAG AAATGCTGGT TTTTCAGTTT TCAGGAGTGG GTTGATTTCA GCACCTACAG 1620 
TGTACAGTCT TGTATTAAGT TGTTAATAAA AGTACATGTT AAACTTACTT AGTGTTATG 

Seq ID NO: 81 Protein secruence : 
Protein Accession #: NP 003456 



10 



15 



MEGISIYTSD 
LVMGYQKKLR 
YSSVLILAFI 
DDRYICDRPY 
TVILIIiAFFA 
LYAFLGAKFK 



11 
I 

NYTBEMGSGD 
SMTDKYRIiHL 
SLDRYIiAIVH 
PNDLWVWFQ 
OTLPYYIGIS 
TSAQHALTSV 



21 
I 

YDSMKEPCFR 
SVADIiLFViT 
ATNSQRPRKL 
PQHIMVGIjII, 
IDSFILLBII 
SRGSSLKIIiS 



31 
I 

EENANFNKIF 
LPFWAVDAVA 
LAEKWYVGV 
PGIVIIiSCYC 
KQGCEPE29TV 
K6KRG6HSSV 



41 

I 

liPTIYSIIFL 
NWYFGNFLCK 
WIPALXiLTXP 
IIISKLSHSK 
HKWISJTEAL 
STESBSSSFH 



51 

I 

TGIVGNGLVI 
AVHVIYTVNL 
DPIFANVSEA 
GHQKRKAIiKT 
AFPHCCLNPI 
SS 



60 
120 
180 
240 
300 
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30 
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40 



45 



50 



55 



60 
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70 



75 



Seq ID NO: 82 Nucleotide sequence: 
Nucleic Acid Accession #: NM_014959 

Coding sequence: 314.. 1609 (underlined sequences correspond to start and stop codons) 



1 

I 

CTGGTTCTCA 
GGGAAACAAG 
GTGCAAGCAT 
ATTTGACAAA 
CC6AGACGGG 
TTTTTCCAAG 
TGTCAACTAT 
TTCCTCTTAT 
TCTGGGGCCT 
CGTTTGGTTC 
AAGGGATGAG 
CCTGCAGCAC 
AGAGGAGGCT 
CGTCTCCTGG 
CCGGGTGGAG 
GCTGCGGATC 
TTATCACCCC 
GCTAACAAAG 
GCCCCCAATG 
GAAAGTAATG 
CTCAAAATTC 
ACATGGGACT 
ATCAGCCCCT 
AGCCAGGATG 
TGA6AAT6AG 
GCTGAGCATG 
TGAAAGG6AC 
TAGGTAGTCT 
TCATTGATTT 
AGGACAGACT 
AATGTCTGAA 
CACTATTTTA 
GTGTTTATGA 
CCTATGGACA 
AAGCTTCTCA 
GTTAGGACTT 
TGAAAGAGAG 
GTGTGTAGCC 
CTGA6ATGAA 
CAGCTTATTC 
ACATTTGGAT 
ATGTTTTCGT 
TAGTA6ACAC 
A6TCTCTTTT 
ATTTATCCTC 
CCACTTAAAG 
TAGGATATAG 
ATATATTTTC 
TTATGTTTTT 
GTTCTGCAAA 
TAGGTACAGA 
TCAGTGATGT 



11 



ACTTCTTTTG 
CTCTCAGGAC 
CTGGGCCATC 
TGGAAAAAAA 
TATACAGGGA 
AA GATGA TGA 
CT6GGGGGGA 
GCTTCTAAAG 
GAAGGAAATG 
CCCACTGCTG 
GTCACAGTGA 
CATGAACAGT 
GTCGCCGAAA 
TTTCTCGTTG 
CCTTTCTATG 
GCCAGTGGGA 
CACCCCGAAG 
GCGATAGATG 
GAACCCCTGA 
CCCAAGGAGT 
TATGCTGGGC 
TTGGTGTGGG 
CCTCCTTTCT 
GGGGACCTGA 
AAGGAGCTGG 
6T6GAGAAGA 
CCTTACCTCG 
GGAAGAGAGA 
CAGTGTTCAA 
TGTAACCTGG 
GAAGGTAGTA 
TCCATTGACA 
AGGATGGGGC 
CTCCGTTTGA 
GCTGAGGACA 
TAACACTTTA 
CACAAAAATG 
TGCTACAAGG 
ATGTGGTAAA 
TGAGACAGAC 
TCTACCATAG 
GGCACACATA 
CATCTGGTTG 
AACTGGAAGA 
AGTCAGCCAG 
T6GAATCTAG 
TTATCTTCTA 
TACTTTTAAT 
CCAGATATTT 
TTTCCACTTC 
TTCTCTTATT 
TTTTTGCTTG 



21 
I 

AAATAATGTT 
TTCCGGTCGC 
TTCAATGGTA 
GGAGTGTCCA 
GCTACCCTGT 
GACAGAGGCA 
CATTCCCAGG 
TCTGTTTT6A 
TGGATGTTGA 
GCTGGTATCT 
CGATTGCGTT 
GGCTGGTGGG 
TCCACCTCCC 
CCCATTTTAA 
CTGTCCTGGA 
CTCGCCTCTC 
ATATTAAGTT 
ATGAGGAAGA 
ACTTTGGTTC 
TGAAATTGTC 
AGATGAAGGA 
ATACTGAGGT 
CAGGTGCAGC 
AAGGGGT6CT 
T66AGCA6GA 
AAGGGGACCT 
TGTCCTATCT 
ATCCAGCGTT 
GACAGAAGAA 
CATGTACCTA 
ATATTCCTTT 
TGATTCTTGA 
CTGGAAAGGC 
AGTATCACCT 
CTCAAGGCAT 
TCTATGGCTA 
GGAGAAAATG 
AGTTGTTGGG 
TCAACTCCAC 
ATTCCTGGCA 
ACTCTGTCAT 
TTTCCATCCT 
AGTCAGTTTT 
TTTCAATTAC 
TTTGTTATGT 
6CACTTTATC 
CATAATCTTT 
CACTCAGAAG 
ACCATTTCTG 
TTCTGATAGA 
TTTTGCTTCC 
TAGTATTTTT 



31 



CATAGAGAAG 
CATGATGGCT 
AAAAAGATAC 
GAAAAGAGTA 
GTTTCTGAGA 
GAGCCATTAT 
AGACATTTGC 
GATCGAAGAA 
GTTGATTGAT 
GTGGTCAGCC 
TGGTTCCTGG 
C6GCCCCTTG 
CCACTTCATC 
GAATGAAGGG 
AAGCCCCAGC 
CATCCCCATC 
CCACTTGTAC 
TCGCTTCCAT 
CAGTTATATT 
CTACAGGAGC 
ACCCATTCAA 
GAAGCCAGTG 
CTTTGTGAAG 
CGATGATCTC 
AAAGACACGG 
GGCCCTGGAC 
TAGACAGCAG 
CTCATTGGAA 
GACTGGGTAA 
TTGACTGTAT 
TAAATTTTTT 
AGACCCAGGA 
AACTTTTCCT 
TCTCATAACT 
ACATGATGAC 
CTGTTATTAG 
CAAACATGAG 
TTAAATGTTC 
AGAACCACCA 
ATGTACCATA 
TTTGTAGCCA 
TTTATGTTTA 
TTTTATGGTG 
TTAOGTTAAT 
CTTTTCTATT 
ACCATTTAGA 
CTGTATCTTA 
ATTTAAAAAA 
TTGCTCTTCC 
CGTTTTTTAG 
TCTGA6GACA 
AGTTGACATT 



41 

I 

GAGGGCTGTC 
GTGGGCGGTA 
AGTAAAGACA 
GCAGCAGTGA 
CCCTTTGTGA 
TGTTCCGTGC 
TCAGAAGAGA 
GATTATAAAA 
AAGAGCACAA 
ACAGGCCTCG 
AGTCAGCACC 
TTTGATGTCA 
TCCCTCCAAG 
ATGGTCCTGG 
TTCTCTCTGA 
ACTTCCAACA 
CTTGTCCCCA 
GGTGTGCGCC 
GTGTCTAATT 
CCTGGAGAAA 
CTTGAGATTA 
GATCTCCAGC 
GAGAACCACC 
CAGGACAATG 
CAGAGCAAGA 
GTGCTCTTCA 
AATTTG TAAA 
ATGGATAAAC 
CATCTATCAC 
CCTCATGCAT 
CCAACCATTG 
TAAAGGACAT 
GATTAATGTQ 
AAAAGCAGAA 
AGTCTTTTTT 
AACAATGTAA 
CAGAAAATAT 
ATGGTCAACT 
AAAAGAAAAT 
CAAAAAATAA 
TTTCAGCTGT 
ATCTGTTTAA 
TATTTTGAAC 
GTAATTATTA 
CTACTGTTAT 
TCCTATTACC 
AAACCCATCA 
CTCATGAGAA 
TTCATTATTT 
TTCTTTTAGA 
TCTTTTTCTC 



51 



TGAGATTCGA 
AACGCGGTTA 
TAAATACCAC 
GGAAGAGCTG 
CATCTCACAT 
TGTTCCTGAG 
ATCAAATAGT 
ATCGTCAGTT 
ACAGATACAG 
GCTTCCTGGT 
TGGCCCTGGA 
CTGCAGA6CC 
GTGAGGTGGA 
AGCATCCAGC 
TGGGCATCCT 
CATTGATCTA 
GCGACGCCTT 
TGCAGACTTC 
CTGCTAACCT 
TTCAGCACTT 
CTGAAAAAAG 
TTGTAGCTGC 
GGCAACTCCA 
AGGTTCTTAC 
ATGAGGCCTT 
GAAGCATTAG 
ATGAGTCAGT 
AGAAATGTGA 
ACAGGCTTTC 
TTTCCTCAAG 
CTTGATATAT 
CCGGATAGGT 
AAAAATAATT 
AAGCTAACAA 
TTTTTTGTAT 
ATGTATTTGC 
TTTCCCACTG 
CCAAGGAATA 
GAGGGTAATT 
GCCAACTCTG 
CTTTTGATTA 
AACAAGTTCC 
CCATTCTGAT 
ATATGTTAGG 
CACATTTGTA 
TTTTCTCATC 
ATAAATTATT 
GAGTAATCTG 
TCCAAATTTC 
GTGGTTCTGA 
ACCTTCATTC 
CAGCAGTTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



218 



NSDOCID: <WO 0207949aA2.l > 



wo 02/079492 



PCT/US02/04915 



CTTTTAGCTT CCGTATTTCC TGATGAGAAA TCTGCAGTCA TTCAAATTGT TGTTTCCCTG 
TATGTAGTGT GTCATTTTTC TGTCAGATTT CAAGGTATTT ATCTTTAGTT TTTAGCCATT 
TCATTATGTT GGGGATGAGT TTCCTTGTTT TATTCCCTTT GGAATTTGCT CCAATTCATA 
AATTT6CAGT TTTATGTCTT TTACCAAACT TAGAGGTTTT CAGCCTAATT TCTAAAAATA 
5 CTTTTTATTA GCCTGATTTT CATCTTTATA GGAAATAGTT TAAGTGATGA CAAGTTCCAA 
TAGCTTATAT GCCCAGAAGG CCTTCAAAAT AAGAATTTTG AAAG/UVTACA GAAAACAAAC 
TTTTATATCC TTCTCATGTC TTCTACTGTA AAATTCATAT GCTTTGCTAC TCTAAACCTA 
GTTTGAAATC AACAGTCTTG AGAATAGATG AAAATTTTGA TGAATAGTGG AATTCTTTTA 
AATGGAAACC TCTTACATGT GATTTTCCTT GCCATCTAGA AATAAACCAT AGTATTTATG 

10 TTGAATCAAT CAATATTATA TTTTGTTTTT TTCCTCCTCT TCTGAGACTC TTATTGTGGA 
AATGTTAGAC TTTTATGTTT TCCTAAATGT CCCTGATATT CTACTTATTT AGAACATCTT 
TTCATTTTTT CCATTATTCT GATTGGGTAA TTTTAATTTG TCTATTTTCA AATTTGCTGG 
AGTGTTCACC TGTTGTTGTC TGTGTCGTCC CACTGAGTGC ATTCACCACC TTTTAAATTT 
TGGTCACTGT ATGTATCAGT TCTAAAATTT CCATTTTGTT CTCTATATTT TAAATTTCTT 

15 GGCTTATATT CTATTTTCCT GCAAATGTGT CAGCATTTGC TTGTTTGAGC TTTTTTTTTT 
TCAAGACAGG GTCTCAACTC TGTTACCCAG GCTGGAGTGC AGTGGTGCGA TCTCAGCTCA 
CTGCAACCTC TGCCTCCTGG TTCAAGCGAT TATTGTGCCT CAGCCTCCTG AGTAGCTGGG 
ATTACAGGCA TGCACCACCA CAGCCCAGCT AATTTTTTGT ATTTTTAGTA GAGACAGAGT 
TTTGCTATGT TGGCCAGGCT GGTTTTGAAC TCCTGGCCTC AA6TGATCCA CCCACCTCAG 

20 CCTCCCAAAG TGCTGGGATT ACAGGCCACT ACACCTGGCA CATTTGAGTA TTTTTTTTTT 
TTTTTTTTTT TTGAGAT6GA GTCTCGCTCT GTCATCTAGG CTGGAGTGCA GTGGTGTGAT 
CTCAGCTCAC TGCAGCCTCT GTCTCCCGGG CTCAAGCGAT TCTCTTGCCT CAGCCTCCTG 
AGTAGCTAGG ACTACAGGTG CATGCCAACA CGCCCGGCTA ATTTTTTTAA AAAATATTTT 
TAGTAGAGAC AGGGTTTCAC CATTTTGGCC AGGATGGTCT CGATCTCCTG ACCTCATGAT 

25 CCACCCGCCT CGGCCTTCCA AAGTGCTGGG ATTACAGGCA TGAGCCACCG TGCCTGGCCT 
CATTTGAGTA TTTTTATAAT GTCTCTTTTA AAGTCTTTGT CAGATAATTC CACTGTACAT 
GTTATTCAGT GTTTGGTGTC CACTGAGTTG TCATTTGCCA GACAAGTGGA GATTTTTGCA 
GCTCATCCTT GTATTCTCAG TAGTTCCGAT ATGTACCCTC GACATGTGAA TGTTATCTTA 
TGAGACTCTG TTTTATTTGT ATCCAACAGA AGATGTTTAT TATTTATTTG GCTTTCTGTG 

30 AACTGAGGTC TTAATATCAG CTCATTTTAA AAGTCTTTGC AGTGGTATTC GGATCTATCC 
TGTGTGTGCC TATGAGATTG GGTGCAGTGT ATCCTGTTAG CTCCATTCTC AGGGCGTTTG 
AATGTGAATT AGGACCAGCG CAATGAATGC TCAAGTTGGG GTTGGGCGTT AGAATTCATA 
AAAGTCTTTA TATGCTCAG 

35 

Seq ID NO: 83 Protein secaience; 
Protein Accession #: NP__055774 



40 1 11 21 31 41 51 

I I I i I I 

MMRQRQSHYC SVLFLSVNYL GGTFPGDICS EENQIVSSYA SKVCFEIEED YKNRQFLGPE 
GNVDVELIDK STNRYSVWFP TAGWYLWSAT GLGFLVRDEV TVTIAFGSWS QHLALDLQHH 
EQWLVGGPLF DVTAEPEEAV AEIKLPHFIS LQGEVDVSWF LVAHFKNEGM VIiEHPARVEP 
45 FYAVLESPSP SIiMGIUjRIA SGTRLSIPIT SNTIiIYYHPH PEDIKFHLYL VPSDALLTKA 
IDDEEDRFHG VRLQTSPPME PLNFGSSYIV SNSANLKVMP KELKLSYRSP GEIQHPSKFY 
AGQMKEPIQL EITEKRHGTL VWDTEVKPVD LQLVAASAPP PFSGAAFVKE NHRQLQARMG 
DIiKGVIiDDIjQ DNEVIiTBNEK EIjVEQEKTRQ SKNEALIiSMV EKKGDLAIiDV IiPRSISBRDP 
YLVSYIiRQQN Ii 

50 

Seq ID NO: 84 Nucleotide sequence : 
Nucleic Acid Accession #: NM_007036 
Coding sequence; 56-610 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I i 

CTTCCCACCA GCAAAGACCA CGACTGGAGA GCCGAGCCGG AGGCAGCTGG GAAACATGAA 60 

GAGCGTCTTG CTGCTGACCA CGCTCCTCGT GCCTGCACAC CTGGTGGCCG CCTGGAGCAA 120 

60 TAATTATGCG GTGGACTGCC CTCAACACTG TGACAGCAGT GAGTGCAAAA GCAGCCCGCG 180 

CTGCAAGAGG ACAGTGCTCG ACGACTGTGG CTGCTGCCGA GTGTGCGCTG CAGGGCGGGG 24 0 

AGAAACTTGC TACCGCACAG TCTCAGGCAT GGATGGCATG AAGTGTGGCC CGGGGCTGAG 300 

GTGTCAGCCT TCTAATGGGG AGGATCCTTT TGGTGAAGAG TTTGGTATCT GCAAAGACTG 360 

TCCCTACGGC ACCTTCGGGA TGGATTGCAG AGAGACCTGC AACTGCCAGT CAGGCATCTG 420 

65 TGACAGGGGG ACGGGAAAAT GCCTGAAATT CCCCTTCTTC CAATATTCAG TAACCAAGTC 480 

TTCCAACAGA TTTGTTTCTC TCACGGAGCA TGACATGGCA TCTGGAGATG GCAATATTGT 540 

GAGAGAAGAA GTTGTGAAAG AGAATGCTGC CGGGTCTCCC GTAATGAGGA AATGGTTAAA 600 

TCCACGCTGA TCCCGGCTGT GATTTCTGAG AGAAGGCTCT ATTTTCGTGA TTGTTCAACA 660 

CACAGCCAAC ATTTTAGGAA CTTTCTAGAT ATAGCATAAG TACAT6TAAT TTTTGAAGAT 720 

70 CCAAATTGTG ATGCATGGTG GATCCAGAAA ACAAAAAGTA GGATACTTAC AATCCATAAC 780 

ATCCATAT6A CTGAACACTT 6TATGTGTTT GTTAAATATT CGAATGCATG TAGATTTGTT 840 

AAATGTGTGT 6TATAGTAAC ACTGAAGAAC TAAAAATGCA ATTTAGGTAA TCTTACATGG 900 

AGACAGGTCA ACCAAAGAGG GAGCTAGGCA AAGCTGAAGA CC6CAGTGA6 TCAAATTAGT 960 

TCTTTGACTT TGATCTACAT TAATGTTGGG ATATGGAATG AAGACTTAAG AGCAGGAGAA 1020 

75 GATGGGGAGG GGGTGGGAGT GG6AAATAAA ATATTTAGCC CTTCCTTGGT AGGTAGCTTC 1080 

TCTAGAATTT AATTGTGCTT lT T TTl'' rX '' I''r TTTGGCTTTG GGAAAAGTCA AAATAAAACA 1140 
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3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 



60 
120 
180 
240 
300 
360 
420 



wo 02/079492 



PCT/US02/049I5 



ACCAGAAAAC CCCTGAAGGA AGTAAGATGT 
AGCTTTGAAC T6AGAGCAAT TTCAAAAGGC 
TGAAGGACGG TTCTGGGGCA TAGGAAACAC 
CCACCTCAGA GATAAATCTA AGAAGTATTT 

5 GTAAATATTT ATATATTTTT ATAAATAAAT 
ATTTATCATC CTCTTGAGGA AAGAAATCTA 
AACCTATGAC TCTATAAGGT TTTCAAACAT 
TATAGGAGTC ACTCTG6ATT TCAAAAAATG 
TAAACATAAG TGCTGTGACT TCG6T6AATT 

U GGAGGTTTGT AAAAGAAGAA TCAATTTTCA 
TGGAATTAGG AGTATATTTG AAAGAATCTT 
CTTAGGAAAT ATCTCAGAAG TATTTTATTT 
AGTATTTACC TGTATTTTAT TCTTGAAGTT 
GCCTTTGAAT GTAAAGCTGC ATAAGCTGTT 

5 TTGTTCAATA AAAAAGAACA AQATAC 



TTGAAGCTTA TGGAAATTTG AGTAACAAAC 1200 

TGCTGATGTA GTTCCCGGGT TACCTGTATC 1260 

ATACACTTCC ATAAATAGCT TTAACGTATG 1320 

TACCCACTGG TGGTTTGTGT GTGTATGAAG 13 80 

GTGTTAGTGC AAGTCATCTT CCCTACCCAT 1440 

GTATTATTTG TTGAAAATGG TTAGAATAAA 1500 

CTGAGGCATG ATAAATTTAT TATCCATAAT 1560 

TCAAAAAATG AGCAACAGAG GGACCTTATT 1620 

TTCAATTTAA GGTATGAAAA TAAGTTTTTA 1680 

GCAGAAAACA TGTCAACTTT AAAATATAGG 1740 

AGCACAAACA GGACTGTTGT ACTAGAT6TT 1800 

GAAGTGAAGA ACTTATTTAA GAATTATTTC 1860 

GGCCAACAQA GTTGTGAAT6 TGTGTGGAAG 1920 

AGGTTTTGTT TTAAAAG6AC ATGTTTATTA 1980 
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Seq ID NO: 85 Protein semience: 
Protein Accession #: np 008967.1 
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I I I r 

MKSVLLIiTTL LVPAHLVAAW SNNYAVDCPQ HCDSSECKSS PRCKRTVUDD CGCCRVCAAG 60 

RGETCYRTVS GMDGMKCGPG LRCQPSNGED PPGEEFGICK DCPYGTFGMD CRETCNCQSG 120 

ICDRGTGKCL KFPFFQYSVT KSSNRPVSLT EHDMASGDGN IVREEWKEN AAGSPVMRKW 180 
LNPR 
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Seq ID NO: 86 Nucleoti de sequence; 
Nucleic Acid Accession #: D86983 

Coding sequence: 52-4491 (underlined sequences correspond to start and stop codons) 



1 
I 

AGCCGGCCGT 
CGCTCCAGGG 
ACGCTGGCCG 
CGCACCACCG 
ACCTCCATCC 
CGGCTGAGGA 
GGAGCATTTG 
TCAATTGACA 
AATCAGATAG 
TTTTTGCATA 
ATGAAGA6AT 
GCGGATTTGC 
TATCCCAGAC 
GAAAGGCCCC 
TACTTCACCT 
AATGAGCTGA 
ATCCAGAACA 
GGAGAGGTGA 
TTTGTAATCC 
AGCGCCACAG 
CCAGTTGACC 
CAGGGGGACA 
ACCGCTTTCA 
GTTATTGAGG 
ATCGCCTGGA 
TCGGGAACAC 
GCTGTCAACA 
ACCCCA6TGT 
CTCCCGTGCA 
CAGGTGACAG 
GTTGGCCCTG 
TCGGTGAGCA 
GTAGCTACCT 
ACACATTTGT 
CCGAGGGATC 
CAGCTCATTC 
CACTACAACG 
ACCGCCCACC 
CACGACGGCA 
GAGCGCCTGC 
CACCGACTGT 
GG6ACGGAGA 



11 

I 

GGTGGCTCCG 
GCCCCGGGCG 
TGGTGGCCCA 
TGCGCTGCAT 
TAGATCTTCG 
ACTTGAACAC 
AAGACTTGGA 
GGCAAGCATT 
AAACTTTGGA 
ACAACCGGAT 
TGCGACTGGA 
TGAAAACCTA 
GCATCCAGGQ 
GGATCACCTC 
GCAGAGCCGA 
GCATGAAGAC 
CACAGGAGAC 
AGACGCAAGA 
AGCCACAGAA 
GCCACCCCCC 
CGCGGGTGAA 
GCGGAGAGTA 
TCATCGTCCA 
GCCAGACCGT 
CCAAGGGAGG 
TTAGAATCTC 
TCATCGGCTC 
TTGCCAGCAT 
GCTCCCAGGG 
AAAGTGGAAA 
CAGACGCAGG 
TGGTGCTCAG 
CCATCGTGGA 
TTGACAGCCG 
CTTACACAGT 
AGGAGCATGT 
ACCTGGTGTC 
GGCGCGTGAA 
CCTGTAACAA 
TGAAATCCGT 
ACAACGGGCA 
CC6TCACACC 



21 
I 

TGCGTCCGAG 
CCGCTGCCTG 
GAAGCCGGGC 
GCATCTGCTG 
CTTTAACAGA 
ATTGCTTCTC 
AAATTTAAAA 
TAAGGGACTT 
CCCAGATTCG 
TACACATTTA 
CTCAAACACA 
CGCGGAGTC6 
ACGCTCAGTG 
CGA6CCCCAG 
AGGCAACCCC 
AGATTCCCGC 
AGACCAG6GT 
GGTGACCCTC 
TACAGAGGTG 
GCCGCGGATC 
CATCACGCCT 
TGCGTGCTCT 
GGCTCTTCCT 
GGATTTCCAG 
GAGCCAGCTC 
TGGTGTTGCC 
CCAGAAGGTC 
TCCCAGCGAC 
CGAGCCCGAG 
ATTTCACATC 
TCXjCTATGAG 
TGTGAACGTT 
AGCGATTGCG 
TCCTCX3TTCT 
TGAACA6GCA 
ACAGCATGGC 
TCCACAGTAC 
CAACTGCTC6 
CCTGCAGCAC 
GTACGAGAAT 
CGCCCTTCCC 
C6ACGAGCAG 



31 



CGTCCGTCCG 
TTGGCGCTCG 
GCAGGGTGTC 
CTGGAGGCCG 
ATCAGAGAGA 
AATAATAATC 
TATCTCTATC 
GCCTCTCTAG 
TTCCAGCATC 
GTTCCAGGGA 
CTTCACTGCG 
G6GAACGCGC 
6CAACCATCA 
6ACX3CAGATG 
AAGCCTGA6A 
CTAAACTTGC 
ATCTACCAGT 
AGGTACTTCG 
CTGGTTGGGG 
TCCTGGACGA 
TCTGGCGGGC 
GCGACCAACA 
CAGTTCACTG 
TGTGAAGCCA 
TCCGTGGACC 
CTCCACGACC 
GTGGCCCACC 
ACAACAGTGG 
CCAGCCATCA 
AGCCCTGAAG 
TGTGTG6CCC 
CCTGACGTCA 
ACTGTTGACA 
CCAAAT6ATT 
C66GCG6GAG 
TTGATGGTCG 
CTGAACCTCA 
GACATGTGCT 
CCCATGTGGG 
GGCTTCAACA 
ATGCCGCGCC 
TTCACCCACA 



41 
I 

CX3CCGTCGGC 
TGCTGTTCTG 
CGAGCCGCTG 
TGCCCGCCGT 
TCCAACCTGG 
AGATCAAGAG 
TGTACAAGAA 
AGCAACTATA 
TCCCGAAGCT 
CATTTAATCA 
ACTGTGAAAT 
AGGCAGCGGC 
CCCCGGAAGA 
TGACCTCGGG 
TCATCTGGCT 
TGGACGATGG 
GCAT6GCAAA 
GGTCTCCAGC 
A6AGC6TCAC 
GAGGTGACCG 
TTTACATACA 
ACATTGACAG 
TGACGCCTCA 
AGGGCAACCC 
GGCGGCACCT 
AGGGCCAGTA 
TGACTGTGCA 
AGGTGGGCGC 
CCTGGAACAA 
GATTCTTGAC 
GGAACACCAT 
GTCX3AAATGG 
GAGCTATAAA 
TGCTGGCCTT 
AAATCTTTGA 
ACCTCAACGG 
TCGCAAACCT 
TCCACCAGAA 
GCGCCTCGCT 
CCCCTCGGGG 
TGGTGTCCAC 
TGCTGATGCA 



51 
I 

CATGGCCAAG 
CGCCTGGGGG 
CCTGTGCTTC 
GGCGCCGCA6 
GGCATTCAGG 
GATACCTAGT 
TGAGATCCAG 
CCTGCACTTT 
CGAGAGGCTA 
CTTGGAATCT 
CCTGTGGTTG 
CATCTGTGAA 
GCTGAACTGT 
GAACACCGTG 
GCGAAACAAT 
GACCCTGATG 
GAACGT6GCC 
TCGACCCACT 
GCTGGAGTGC 
CACACCCTTG 
GAACGTCGTA 
CGTCCATGCC 
GGACAGAGTC 
GCCGCCCGTC 
GGTCCTGTCA 
CGAATGCCAG 
GCCCAGAGTC 
CAATGTGCAG 
GGATGGGGTT 
CATCAATGAC 
TGGGTCGGCC 
AGATCCGTTT 
CTCAACCCGA 
GTTCCGGTAT 
ACGGACATTG 
AACAA6TTAC 
GTCGGGCTGT 
6TACCGGACG 
GACCGCCTTC 
CATCAACCCC 
CACCCTQATC 
GTGG66CCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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TTCCTGGACC 
GGACAGCACT 
CCCAATGACT 
GTGTGCGGCA 
AACCAGCTCA 
CGCAGCATCC 
TCCGGGAAGC 
AACGAGAGCC 
CTGACCAGCA 
AAGCTGAACC 
GCGGAGATCC 
ATGAGGACGC 
GCCTTCGCCA 
CTGGACGAGA 
TCTCCCTTCC 
GTGGC66GGA 
TTCTCCATGG 
GACCAC6GGA 
ACGTTCGAGG 
TTGTATGGCT 
CCTGGCAGCC 
CGAGATGGGG 
CAGATCAAGC 
GTGCAGAGCG 
ATCCCCAGGG 
CAGTTCAATG 
GAGGACAAGC 
GAACATCTCA 
GACTTCAGAG 
AAGAAACTTG 
GCCAACAACA 
GTCACCTGCT 
GGGGCCTGCT 
AGGCTCCTCA 
TGCG6ACTGC 
GTGCTGTTAC 
ACAGCAGGTG 
TTTTATTTAA 
ATTTAGGCGC 
ACCTCTATAT 
AGGTGGGGTG 
CTATGTTTAA 
CACAGGGACA 
AACTCCTCCT 
GACAAACATT 
CTTCCACACC 
TCACTTGCAC 
TGTGTATCTG 
CCCTGGTTGC 
AGGAGCTCAA 



ACGACCTCGA 
GCAGCAACGT 
CCCGGGCCAG 
GCGGCATGAC 
CCTCCTACAT 
GCGACCTGGC 
CGCTGCTCCC 
CCATCCCCTG 
TGCACACGCT 
CGCACTGGGA 
AGCACATCAC 
TGGGAGAGTA 
CCGCGGCCTT 
ACTTCCAGCC 
GGATTGTGAA 
AAATGCGTGT 
CACACACGGT 
TCCCACCCTA 
ACCTGAAAAA 
CGACACTCAA 
GGCTGGGCCC 
ACAGGTTGTG 
AGACGTCGCT 
ACGTGTTCAG 
TGGACCTCCG 
CCTTTTCCTA 
CGACCAAGAA 
GCAACAGCAC 
AGTTTGTTCT 
AATCACGGCT 
CCAAGTGGAA 
TCGTGGAAGC 
GTCCAGTCTG 
GAGTTTGTCT 
A6ACCAGGAA 
AGAAGGCAGT 
CCTGAAGGGA 
TTCTTTTAAA 
CTAAATTGGT 
GTCAGCCTTG 
AGTCTCGGAG 
AAAGAAAATT 
CTGTCTGGGG 
TCCTCTGGGC 
CCCGCTGCTC 
TGATTAGAAC 
ACATACTGCC 
ATACCTGCCG 
GTCCACGTCC 
GT6TCGGGAA 



CTCCACGGTG 
GTGCAGCAAC 
GAGCGGGGCC 
TTCGCTGCTC 
CGACGCATCC 
CAGCCACCGC 
CTTCGCCACC 
CTTCCTGGCC 
GTGGTTCCGC 
CGGCGACACC 
CTACCAGCAC 
CCACGGCTAC 
CAGGTTTGGC 
CATTGCACAA 
T6AGGGC66C 
GCCCTCGCAG 
GGCTCTGGAC 
CCACGACTAC 
TGAGATTTU^ 
CATCGACCTG 
CACCCTGATG 
GTATGAGAAC 
GGCCAGGATC 
GGTGGCGGAG 
GGTGTGGCAG 
TCATTTCCGA 
AACAAGACCA 
CTCAGCCTTC 
GGAAATGCAG 
CAGTACCACA 
AAAAGATGCA 
TTGCCCCCCT 
CTTACAGAAG 
GCTGTGCCAT 
ACACCCAGAA 
GCAGGAGGCT 
AGCAGGCAGG 
AT6AAAAATT 
TTTGCCTCCC 
CCTTGTTCAG 
CTGCCAGAGG 
GGTGTTTGGC 
GTGCAGTGCA 
TCTCTGTAAC 
GAAGCAGCTG 
ATTCATAAGC 
TAGTTGTGAA 
AGGGCCAAGG 
TGAACAAGAG 
CT6TCTAACT 



GTGGCCCTGA 
GACCCCCCCT 
CGCTGCATGT 
ATGAACTCCG 
AACGTGTACG 
GGCCTGCTGC 
GGGCCGCCCA 
GGGGACCACC 
GAGCACAACC 
ATCTACTATG 
TGGCTCCCGA 
GACCCCGGCA 
CACACGCTTG 
GATCACCTCC 
ATCGATCCGC 
CTGCTGAACA 
CTG6CGGCCA 
AGGGTCTACT 
AACCCTGAGA 
TTTCCGGCGC 
TGTCTTCTCA 
CCTGGGGTGT 
CTATGCGACA 
TTCCCTCACG 
GACTGCTGTG 
GGCAGACGGT 
CGGAAAATAC 
AGCACACGCT 
AAGACCATCA 
GAGTGCGTG6 
TGCACCATTT 
6CCACCTGTG 
AGGGCGGAGG 
CGTGAGATCG 
CTCGTGACAT 
TCCAACCAGA 
AGTCCTAGCT 
GGTGCTACTA 
AACACCATTT 
ATGCCAGGAG 
GGCTCACCGA 
AAACGGAACA 
AGCCCCCGGC 
ATTTCACCAC 
TATAGCCTGT 
CACATTTAGA 
CCAAATGTGA 
GTGTGTGTTG 
CCGCTTCCGG 
TCAGGTTGTG 



GCCAGGCACG 
GCTTCTCTGT 
TCTTCGTGCG 
TGTACCCGCG 
GGAGCACGGA 
GGCAGGGCAT 
CGGAGTGCAT 
6CGCCAACGA 
GCATTGCCAC 
AGACCAGGAA 
AGATCCTGGG 
TCAATGCTGG 
TCAACCCACT 
CCCTTCACAA 
TTCTCAGGGG 
CGGAGCTCAC 
TCAACATCCA 
GCAATCTATC 
TCCGGGAGAA 
TCGTGGTGGA 
GCACACAGTT 
TCTCCCCGGC 
ACGCGGACAA 
GCTACGGCAG 
AAGACTGTAG 
CTCTTGAGTT 
CCAGTGTTGG 
CAGATGCATC 
CAGACCTCAG 
ATGCCGGGGG 
GTGAATGCAA 
CTGTCCCCGT 
AAAAGCCC^ 
GGTGGCCGAT 
TTCATGACAA 
GCATCTGCGG 
TCACGTTAGA 
TTAAATTGCA 
CTTTTTAAAT 
CCGGCAGACC 
AATCGGGGTT 
GAACCTTTGA 
CTCTTCCCTG 
ACGTCAGCAT 
GACTCTCCGT 
AACAGATTTG 
AAAAACCTCC 
ACAACGCCGC 
ATGGCTCTTC 
TGAGTGCGTT 



CTTCTCCGAC 
CATGATCCCC 
CTCCAGCCCT 
GGAGCA6ATC 
GCATGAGGCC 
CGTGCAGCGG 
GCGGGACGAG 
GCAGCTGGGC 
GGAGCTGCTC 
GATCG'^GGGT 
GGAGGTGGGC 
CATCTTCAAC 
GCTTTACCGG 
AGCTTTCTTC 
GCTGTTCGGG 
GGAGCGGCTG 
GCGGGGCCGG 
GGCGGCACAC 
ACTGAAAAGG 
GGACCTGGTG 
CAAGCGCCTG 
CCAGCTGACT 
CATCACCCGG 
CTGTGACGAG 
6ACCAGGGGG 
CAGCTACCAG 
GAGACAGGGG 
TGGGACAAAT 
AACACAGATA 
CGAATCTCAC 
AGACGGGCAG 
GAACATCCCA 
^CTCCTGGG 
GGCAGGGAGC 
CGTCCAGCTG 
AGAAGGAGGC 
CTTCTCAGGT 
CAGTTGAATC 
AAAGCAGGAT 
TGTCACCCGC 
CCATCACAAG 
TGAGAGCGTT 
GGAACCTCTG 
CTAATCCCAA 
GTGTCAGCTC 
CTTTCAGCTG 
TTCATCCCAT 
TCCCAGCCGG 
CCAAGGGAGG 



2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3640 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 



Seq ID NO: 87 Protein sequence; 



55 I 



Protein Accession 
1 11 



SRPWWLRASE RPSAPSAMAK 
RTTVRCMHUi IiEAVPAVAPQ 
GAFEDLENIiK YLYLYKNEIQ 
FIjHNNRITHI* VPGTFNHLES 

60 yprriqgrsv atitpeelnc 
neijSMKtdsr uhjIiDdgtlm 

FVIQPQNTEV LVGESVTIiEC 
QGDSGBYACS ATNNIDSVHA 
lAWTKGGSQIi SVDRRHIiVLS 

65 TPVFASIPSD TTVEVGANVQ 
VGPADAGRYE CVARNTIGSA 
THIiFDSRPRS PNDLI*AI*FRY 
HYNDLVSPQY LNLIANLSGC 
ERLIiKSVYEN GFNTPRGINP 

70 FLDHDLDSTV VALSQARFSD 
VCGSGMTSLL MNSVYPREQI 
SGKPIiLPFAT GPPTEC34RDE 
KIiNPHWDGDT lYYETRKIVG 
AFATAAFRFG HTLVNPIiIjYR 

75 VAGKMRVPSQ LLNTELTERL 
TFEDIjKNEIK NPEIREKLKR 



BAA13219 
21 
I 

RSRGPGRRCL 
TSILDIiRFJIR 
SXDRQAFKGIi 
MKRIjRIiDSNT 
ERPRITSEPQ 
IQNTQETDQG 
SATGHPPPRI 
TAFIIVQAIjP 
SGTIiRISGVA 
LPCSSQGEPE 
SVSMVLSVNV 
PRDPYTVEQA 
TAHRRVNNCS 
HRLYNGHALP 
GQHCSNVCSN 
NQL.TSYIDAS 
NESPIPCFIiA 
AEIQHITYQH 
LDENFQPIAQ 
FSHAHTVAIiD 
LYGSTIiMIDIi 



31 
I 

LALVLFCAWG 
IREIQPGAFR 
ASLEQLYIiHF 
LHCDCEIIiWIi 
DADVTSGNTV 
lYQCMAKNVA 
SWTRGDRTPL 
QFTVTPQDRV 
LHDQGQYECQ 
PAITWNKDGV 
PDVSHNGDPF 
RAGEIFERTL 
DMCFHQKYRT 
MPRLVSTTLI 
DPPCFSVMIP 
19VY6STEHEA 
GDHRANEQLG 
WLPKXLGBVG 
DHIiPLHKAFF 
IiAAlNIQRGR 
FPALWBDIiV 



41 

I 

TIiAWAQKPG 
RLRNIiNTLIiL 
NQIBTLDPDS 
ADLLKTYAES 
YFTCRAEGNP 
GEVKTQEVTL 
PVDPRVNITP 
VIEGQTVDFQ 
AVNIIGSQKV 
QVTESGKPHI 
VATSIVEAIA 
QIiIQEHVQHG 
HDGTCNNLQH 
GTETVTPDEQ 
PNDSRARSGA 
RSIRDIASHR 
LTSMHTLWFR 
MRTIiGEYHGY 
SPFRIVNEGG 
DHGIPPYHDY 
PGSRUGPTLM 



51 
I 

AGCPSRCLCF 
NNNQIKRIPS 
FQHLPKIjERIj 
GNAQAAAICE 
KPEIIWLRNN 
RYFGSPARPT 
SGGLYIQNW 
CBAKGNPPPV 
VT^TVQPRV 
SPEGFIiTIMD 
TVDRAINSTR 
LMVDliNGTSY 
PMWGASLTAF 
FTHMIMQWGQ 
RCMFFVRSSP 
GLLRQGIVQR 
EHNRIATBIiL 
DPGINAGIFN 
IDPIiLRGLFG 
RVYCNIiSAAH 
CLIiSTQFKRIi 



60 
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180 
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RDGDRLWYEN PGVFSPAQLT QIKQTSLARI LCDNADNITR VQSDVPRVAE FPHGYGSCDE 1320 
IPRVDIiRVWQ DCCEDCRTRG QFNAFSYHFR GRRSLEFSYQ EDKPTKKTRP RKIPSVGRQG 1380 
EHLSNSTSAF STRSDASGTN DFREFVLEMQ KTITDLRTQI KKLESRLSTT ECVDAGGESH 1440 
ANNTKWKKDA CTICECKDGQ VTCFVEACPP ATCAVPVNIP GACCPVCLQK RAEEKP 
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Seq ID NO: B8 DMA aecmence 

Nucleic Acid Accession #: NM_004834.l 

coding sequence: 80-3577 (underlined sequences correspond to start and stop codons) 
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AATTCGAGGA 
TTGGTGGCAA 
CCTCTCCTCC 
CACCTATGGA 
AGTTATGGAT 
GAAATACTCT 
TCCAGGACAT 
AGACCTTGTG 
CAGAGAAATC 
CAAGGGCCAG 
GAGTGCTCAG 
GATGGCTCCT 
TGATCTTTGG 
TGACATGCAT 
GTCAATWU^AA 
CATGCAGCGG 
TGAAAGGCAA 
CGAGAAAGAT 
TGAACAGGAA 
AGATTTCCTG 
GTTACTACAG 
GAGACAGAAG 
GAGAGA6CGG 
AAAGAGGCGT 
AGAAGAAGAA 
GGAGCAGCGG 
GCATGACCAT 
AAGGAGCAAG 
AGCGCGAGAG 
CCCACTGCAG 
TGAGCCCAGG 
CTCCTCAGGG 
CTCCGGGGAA 
CCTGGAAAAT 
TGCTGGCGAA 
GCCACCTCAC 
GGAGGACGAC 
CAGAGCAGCG 
TGTCCATGAT 
CGTCCGCCAG 
TACACCTTTT 
ATCTGTGGTG 
CCGGAAAGGC 
GGAGATTCGT 
AGTGAATTTG 
GAAGGTCTAT 
GAATGTCTTG 
GTTAAGAAAT 
CGTAGGGGAT 
TCTGGTGATT 
CAAATTTATG 
CACTGTTGAG 
TGTTGATGTG 
CCCACACTCT 
AGATGGAATG 
AAGGATCACC 
TCGATCCAAT 
TGGTCACTTG 
ACGCAATGAC 
CATGACCTTA 
TGGCCTCCAG 
CTGCACCGAG 
CTCCTTCCTG 



21 
I 



TCCX3GGTACC ATGGCACAGA 
AAAGGGAA AA TGG CGAACGA 
CTGC6GGATC CTGCTGGGAT 
CAA6TCTATA AGG6TCGACA 
GTCACTGAGG ATGAAGAGGA 
CATCACAGAA ACATTGCAAC 
GATGACCAAC TCTGGCTTGT 
AAGAACACCA AAGGGAACAC 
CTGAGGGGAC TGGCACATCT 
AATGTGTTGC TGACTGAGAA 
CTGGACAGGA CTGTGGGGCG 
GAGGTCATCG CCTGTGATGA 
TCTTGTGGCA TTACAGCCAT 
CCAATGAGAG CACTGTTTCT 
TGGTCGAAGA AGTTTTTTAG 
CCCTCTACAG AGCAGCTTTT 
GTTAGAATCC AGCTTAAGGA 
GAAACTGAGT ATGAGTACAG 
66A6AGCCAA GTTCCATTGT 
AGACTGCAGC AG6AGAACAA 
GAGCAACAGC TCCGGGAGCA 
CGGATT6AGC AGCAGAAAGA 
GAGGCTAGAA GGCAGCAGGA 
CTAGAGGAGT TGGAGAGAAG 
AAGAGGAGAG TTGAAAGAGA 
CACTTGGAAG TCCTTCAGCA 
AGGAGGCCGC ACCCGCAGCA 
CCAAGCTTCC ATGCTCCCGA 
GTTCCTGTGA GAACAACATC 
GGCAGTGGGC AGCAGAATAG 
CTTCTGTGGG AGAGAGTGGA 
TCCAGCAACT CAGGATCCCA 
CGCTTCAGAG TGAGATCATC 
GCAGTGAAAA AACCTGAAGA 
GTGGATCTGA CCGCACTGGC 
AAAGTAACGG ACTACTCCTC 
GATGTGGAGC AGGAA6GGGC 
TCATCTCTGA ATTTGAGCAA 
GATGTAGAAA GTGAGCCGGC 
ACTCAGTCCG CTAGTAGCAC 
ATAGACCCCA GATTACTACA 
GGATTTTCCT GTGATGGGAT 
TCAGTGGTCA ATGTGAATCC 
AAATACAA6A AGAGGTTTAA 
CTAGTGGGTA CAGAGAGTGG 
CCTCTTATCA ACCGAAGACG 
GTGACAATAT CTGGCAAAAA 
AAAATACTTC ACAATGATCC 
TTGGAAGGAT GTGTACATTA 
GCTTTGAAGA GTTCTGTGGA 
GCCTTTAAGT CATTTGGAGA 
GAAGGCCAGA GGTTGAAAGT 
GATTCAGGAT CAGTCTATGA 
ATGATCCAGT GTAGCATCAA 
GAGCTTCTGG TGTGCTATGA 
AAGGAT6TAG TTCTACAGT6 
CA6ACAATGG GCTGGGGAGA 
GATGGTGTGT TCATGCACAA 
AAGGTGTTCT TTGCCTCTGT 
GGCAGGACTT CTCTTCTGAG 
AGTCTTCAA6 ATCCTGAGAA 
GGCAACCAGG ACAGCTGTGT 
TTCCTCTTAT ATACCAGTTT 



31 
I 

GCGACAGAGA 
CTCCCCTGCA 
TTTTGAGCTG 
TGTTAAAACG 
AGAAATCAAA 
ATATTATGGT 
TATGGAGTTC 
ACTCAAAGAA 
TCACATTCAT 
TGCAGAGGTG 
GAGAAATACG 
GAACCCAGAT 
TGAGATGGCA 
CATTCCCAGA 
TTTTATAGAA 
GAAACATCCT 
TCATATAGAT 
TGGGAGTGAG 
GAACGTGCCT 
GGAACGTTCC 
GGJU^GAATAT 
ACAGAGGCGA 
AGGT6AACAG 
GCGCAAAGAA 
ACAGGAGTAT 
GCAGCTGCTC 
CTCGCAGCAG 
GCCCAAAGCC 
TCGCTCCCCT 
CCAGGCAGGA 
GAAGCTGGTG 
GCCCGGGTCT 
ATCCAAGTCT 
TAAAAAGGAA 
CAAAGAGCTT 
ATCCAGTGAG 
TGACGAGTCC 
TGGTGAAACG 
CATGACCCCA 
ACTCCAGAAA 
GATTTCTCCA 
GA6ACCAGAA 
TACCAACACT 
CTCTGAGATT 
CCTGATGCTG 
ATTTCAACAA 
GGATAAGTTA 
AGAAGTTGAG 
TAAAGTTGTA 
AGTCTATGCG 
ATTGGTACAT 
GATCTATGGA 
CATTTATCTA 
ACCCCATGCA 
AGATGAGGGG 
GGGAGAGATG 
GAAGGCCATA 
AAGGGCTCAA 
TCGGTCTGGT 
CT6GTAGAAG 
CTTGGAATTC 
GTGCAGACCT 
ATCCCCATTC 



41 



51 



CATTTATTGT 
AAAAGTCTGG 
GTGGAAGTGG 
GGTCAGTTGG 
CTGGAGATAA 
GCTTTCATCA 
TGTGGGGCTG 
GACTGGATCG 
CATGTQATTC 
AAACTTGTTG 
TTCATAGGCA 
GCCACCTATG 
GAAGGTGCTC 
AACCCTCCTC 
GGGTGCCTGG 
TTTATAAGGG 
CGTACCAGGA 
GAAGAAGAGG 
GGTGAGTCTA 
GAGGCTCTTC 
AAAAGGCAAC 
CGGCTAGAAG 
CGAAGGAGAG 
GAAGAGGAGA 
ATCA6GCGAC 
CAGGAGCAGG 
CCGCCACCAC 
CACTACGAGC 
GTTCTGTCCC 
CAGAGAAACT 
CCCAGACCTG 
CACCCTGGGT 
GAAGGCTCTC 
GTTTTCAGAC 
CGAGCAGTGG 
GAGTCGGGGA 
ACCTCAGGAC 
GAATCTGTGA 
TCCAAGGAGG 
CACAAATCTT 
TCTA6CGGAA 
GCCATAAGGC 
AGGCCACAGA 
CTGTGTGCTG 
CTGGACAGAA 
ATGGACGTAC 
CGTGTCTACT 
AAGAAGCAGG 
AAATATGAAA 
TGGGCACCAA 
AAGCCATTAC 
TCCTGTGCTG 
CCAACACATG 
ATCATCATCC 
GTTTATGTAA 
CCTACATCAG 
GAGATCCGAT 
AGACTAAAAT 
GGCAGCAGTC 
CAGTGTGATC 
CTTGTAACTG 
CATGTGTT6G 
TTTTTTTTTT 



TATTTGTTTT 
TGGACATCGA 
TTGGAAATGG 
CA6CCATCAA 
ATATGCTAAA 
AAAAGAGCCC 
GGTCCATTAC 
CTTACATCTC 
ACCGGGATAT 
ACTTTGGTGT 
CTCCCTACTG 
ATTACAGAAG 
CCCCTCTCTG 
CCCGGCTGAA 
TGAAGAATTA 
ATCAGCCAAA 
AGAAGAGAGG 
AGGAAGTGCC 
CTCTTCGCCG 
GGAGACAACA 
TGCTGGCAGA 
AGCAACAAAG 
AACAAGAAGA 
G6AGACGGGC 
AGCTAGAAGA 
CCATGTTACT 
GGCAGCAGGA 
CTGCTGACCG 
6TCGA6ATTC 
CCACCAGTAT 
GCAGTGGCAG 
CTCAGAGTGG 
CATCTCAGCG 
CCCTCAAGCC 
AAGATGTACG 
CGACGGATGA 
CAGAGGACAC 
AAACCATGAT 
GCACTCTAAT 
CCTCCTCCTT 
CAACAGTGAC 
AAGATCCTAC 
GTGACACCCC 
CCTTATGGGG 
GTGGCCAAGG 
TTGAGGGCTT 
ATTTGTCCTG 
GATGGACMC 
GAATCAAATT 
AGCCATATCA 
TGGTGGATCT 
GATTCCATGC 
TAAGAAAGAA 
TCCCCAATAC 
ACACATATGG 
TAGCATATAT 
CTGTGGAAAC 
TCTTGTGTGA 
AGGTTTATTT 
CAGGGATTAC 
6AGCTCGGAG 
GTTCTCTCCC 
TCTTACTCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
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AAATAAATCA AGGCTGCAAT GCAGCTGGTG CTGTTCAGAT TCCAAAAAAA AAAAAAAACC 3840 
ATGGTACCCG GATCCTC6AA TTCC 

Seq ID No: 89 Protein aecnience! 
protein Accession #: NP_00482S.l 

1 11 21 31 41 51 

I I I I I I 

MANDSPAKSIi VDIDIiSSIiRD PAGIFELVEV VGNGTYGQVY KGRHVKTGQL AAIKVMDVTE 60 

DEEEEIKLEI NMLKKYSHHR NIATYYGAFI KKSPPGHDDQ LWLVMEFCGA GSITDLVKNT 120 

KGNTLKEDWI AYISREILRG lAHLHIHHVI HRDIKGQNVL LTENAEVKLV DFGVSAQLDR 180 

TVGRRNTFIG TPYWMAPEVI ACDENPDATY DYRSDLWSCG ITAIEMAEGA PPL.CDMHPMR 240 

ALFLIPRNPP PRLKSKKWSK KFFSFIEGCL VKNYMQRPST EQLLKHPFIR DQPNERQVRI 300 

QLKDHIDRTR KKRGEKDETE YEYSGSEEEE EEVPEQEGEP SSIVNVPGES TliRRDPIiRLQ 360 

QENKERSEAL RRQQIiLQEQQ LREQEEYKRQ LLAERQKRIE QQKEQRRRLE EQQRREREAR 420 

RQQEREQRRR EQEEKRRIiEE LERRRKEEEE RRRAEEEKRR VEREQEYIRR QIiEEEgRHIiE 480 

VLQQQLLQEQ AMLLHDHRRP HPQHSQQPPP PQQERSKPSP HAPEPKAHYE PADRAREVPV 54 0 

RTTSRSPVI.S RRDSPLQGSG QQNSQAGQRN STSIEPRLI-W ERVEKLVPRP GSGSSSGSSN 600 

SGSQPGSHPG SQSGSGERFR VRSSSKSBGS PSQRbBNAVK KPBDKKEVFR PtiKPAGEVDL 660 

TALAKBLRAV EDVRPPHKVT DYSSSSEESG TTDEEDDDVE QEGADESTSG PEDTRAASSL 72 0 

NLSNGETESV KTMIVHDDVE SEPAMTPSKE GTIilVRCyTQS ASSTLQKHKS SSSFTPFIDP 7 80 

RLLQISPSSG TTVTSWGFS CDGMRPBAIR QDPTRKGSW NVNPTNTRPQ SDTPEIRKYK 84 0 

KRFNSBIIiCA ALWGVNIjLVG TESGLMLLDR SGQGKVYPLI NRRRFQQMDV LEGIiNVLVTI 900 

SGKKDKLRVY YIiSWLRNKIL HNDPEVEKKQ GWTTVGDLEG CVHYKWKYE RIKFLVIALK 960 

SSVEVYAWAP KPYHKPMAFK SFGELVHKPIi LVDLTVEEGQ RLKVIYGSCA GFHAVDVDSG 1020 

SVYDIYLPTH VRKNPHSMIQ CSIKPHAIII LPNTDGMEIiL VCYEDEGVYV NTYGRITKDV 1080 

VLQWGEMPTS VAYIRSNQTM GWGEKAIEIR SVETGHIjDGV PMHKRAQRLK FLCERNDKVF 1140 
FASVRSGGSS QVYFMTIX3RT SIiLSW 

Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession # : none found 

coding sequence: 2-71 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I i I I I 

TTACA CTTCA ATTCCTTACA CGGTATTTCA AACAAACAGT TTTGCTGAGA GGAGCTTTTG 60 

TCTCTCC TTA AG AAAATGTT TATAAAGCTG AAAGGAAATC AAACAGTAAT CTTAAAAATG 120 

AAAACAAAAC AACCCAACAA CCTAGATAAC TACAGTGATC AGG6AGCACA GTTCAACTCC 180 

TTGTTATGTT TTAGTCATAT GGCCTACTCA AACAGCTAAA TAACAAC31CC AGTGGCAGAT 240 

AAAAATCACC ATTTATCTTT CAGCTATTAA TCTTTTGAAT GAATAAACTG TGACAAACAA 300 

ATTAACATTT TTGAACATGA AAGGCAACTT CTGCACAATC CTGTATCCAA GCAAACTTTA 360 

AATTATCCAC TTAATTATTA CTTAATCTTA AAAAAAATTA GAACCCAGAA CTTTTCAATG 420 

AAGCATTTGA AAGTTGAAGT GGAATTTAGG AAAGCCATAA AAATATAAAT ACTGTTATCA 4 80 

CAGCACCAGC AAGCCATAAT CTTTATACCT ATCAGTTCTA TTTCTATTAA CAGTAAAAAC 540 

ATTAAGCAAG ATATAAGACT ACCTGCCCAA GAATTCAGTC TTTTTTCATT TTT GTTTTT C 600 

TCAGTTCTGA GGATGTTAAT CGTCAAATTT TCTTTGGACT GCATTCCTCA CTACTTTTTG 660 

CACAATGGTC TCACGTTCTC ACATTTGTTC TCGCGAATAA ATTGATAAAA GGTGTTAAGT 720 

TCTGTGAATG TCTTTTTAAT TATGGGCATA ATTGTGCTTG ACTGGATAAA AACTTAAGTC 780 

CACCCTTATG TTTATAATAA TTTCTTGAGA ACAGCAAACT GCATTTACCA TCGTAAAACA 840 

ACATCTGACT TACGGGAGCT GCAGGGAAGT GGTGAGACAG TTCGAAC6GC TCCTCAGAAA 900 

TCCAGTGACC CAATTCTAAA GACCATAGCA CCTGCAAGTG ACACA ACAAG CAGATTTATT 960 

ATACATTTAT TAGCCTTAGC AGGCAATAAA CCAAGAATCA CTTTGAAGAC ACAGCAAAAA 1020 

GTGATACACT CCGCAGATCT GAAATAGATG TGTTCTCAGA CAACAAAGTC CCTTCAGAAT 1080 
CTTCATGTTG CATAAATGTT ATGAATATTA ATAAAAAGTT GATTGAGA 

Seq ID No: 91 Protein sequencer 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

YTSIPYTVFQ TNSFAERSFC LSIi 

Seq ID NO: 92 DNA sequence 

Nucleic Acid Accession #: NM_003706.1 

Coding sequence: 310-1935 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CACGAGGCAG GGGCCATTTT ACCTCCAGGT TGGCCCTGCT CAGGACCAGG AGGAAACACC 60 
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10 



15 



20 



25 



30 



35 



40 



TCCAGCCCGC 
GGCACCGAGG 
GACTACACCG 
CACCGCGCAT 
CAGTGCACCA 
GCGGCCGTGG 
GCTGATGAGG 
GCCTGCCTTG 
CTCGCAGG6G 
ATGGAAGCTC 
AAGAGCCTAC 
TGGGCCTACA 
ATGAAGAAGC 
GACCTGCAAC 
CACCACGCTG 
TTCAAGAAGG 
TTATGGGGAA 
AGGAATCTGA 
CACCTTATTT 
CCAGAAGATG 
ACCAGGACCT 
CTCAGTAACT 
GGGACCACTC 
CGGAAGCACC 
CTGCCCCCGA 
TTCGAGACCA 
GTAGAAGAG6 
GGAGAAACTG 
GATATTGAGG 
GATGTGGTGG 
ATCCTTAGAG 
TGCTGCTTGG 
GGCCCTGAAG 
CTTGAGTTCA 
TTTGATAGCG 
CAGGTCCCTC 
TCCAATGTCC 
TTTATGACTG 
TGGGCTTAAA 
GCTCAATAAA 
AAAAAAAAAA 



GACCTCCTCC 
ATCCCCGCAG 
CTAAGCGCAG 
GCGCACACGC 
TGG GAAGCTC 
AGAGACGAAG 
CCCCAGTTGT 
GGGTCCTGAG 
TCTCTGGATC 
TCGAGGCTGA 
AGAAAACCAT 
TGGTTATCTC 
CCGTGGAAGA 
CTTCCTGGCA 
GCTTCTCTGC 
GAAGACTGGT 
GTGCTCTTGG 
CCCTGAAAGG 
TTGCCCGATT 
AAGGCGGTGA 
CCCTGGAAAA 
TGATGGATTT 
ACAACTTCCT 
TCCACCTGGT 
CGCGGGAGGT 
TCCGGGCTAC 
CTGAGCTGGA 
GACCAGTG6T 
CATGGAGTGA 
TGCTACTCTT 
AGTTGATGAA 
C ATAGA TGAG 
TCCACCTGGC 
CCTTGGCTGT 
TCATTCAGAA 
GGGAGATCCT 
AGATGGCCAG 
CACTTCTAGC 
TCATGGGCTA 
TGCTTGCTGA 
AAAAAAAAAA 



CACAGGGGGA 
TCTTCACCCG 
GGAGCCCAAG 
CCCAGACCCA 
TGAAGTTTCC 
ACTTCATGTG 
TGCTGTGCTG 
TGAGATGAAA 
CACTTGGGCA 
CCTGAAACAT 
CCAAGCAGCG 
TAAGCAAACC 
AGGGACACTA 
GGAGGCAAGA 
ACTGGGGGCC 
CAGAACTCAC 
TAACACTGAA 
TTTATGGAGA 
ACTGAGGCTG 
GCCTGAACAC 
GCAGGAGCAG 
TGTGAAGAAA 
GTACAAACAC 
GGATGCTGGT 
TCACCTCATC 
CACTGACTAC 
TTTGTGGTCC 
GATACATTTT 
CACATACGAC 
GGCATTAGCC 
CGTGGCCGGG 
CCTCAGCTTC 
CTTCCTGTTC 
CCTAACAGGG 
GAAGGTGTCC 
GGAGCTGGAG 
AATGAATGTG 
CAGTAGCTCT 
TCTCTCCACA 
TTGACTGATG 
AAAAAAAAAA 



AAAGGAAAGC 
CGGAGATTCC 
CCTCCGCACC 
GGCTCAGGAG 
ATT^TTCCTG 
CTGAAAGCTC 
GGCTCAGGCG 
GAACAGGGCC 
ATATCTTCTC 
CGATTTACCC 
AGGTCTGAGA 
A6AGAACTGC 
CCCTACCCAA 
GCACCAGAGA 
TTT6TTTCCA 
CCTGAGAGAG 
GTCATTAG6G 
AGGGCTGTTG 
CAAGAAAGTT 
ACCTGGCTGA 
CCCCATGAGG 
ACAGGCATTT 
GGTGGCATCC 
TTAGCCATCA 
CTCTCCTTCG 
TGCCGCCGCC 
AAGGCCCCCG 
CCCCTGTTCA 
ACATTCAAGC 
AAGAAGAATG 
CTCTACTACC 
CAGGGCACT6 
TTCACTCCCT 
CCAATCACCA 
AAGGAGCTGA 
CATGAGTGTC 
ATAGTTCAGA 
GCACAAGTTA 
GCCAAGTGGA 
AAAAAAAAAA 
AAAA 



AGGAGGACCA 
GGCTGAAGGA 
GGATTCCGGA 
GACTGAGAAT 
GGCTCCAGAA 
TGAAGAAGCT 
GAGGACTGCG 
TGTTGGATGC 
TCTACACCAA 
GACAGGAGTG 
ATTACTCTCT 
CGGAGTCTCA 
TATTTGCAGC 
CCTGGTTCGA 
TAACCCACTT 
ACCTGACTTT 
AATACATTTT 
CTAATGCTAA 
CACAAGGGGA 
CTGAGATGCT 
ACCCCGAAAG 
GCGCTTCAAA 
GGGACAAGAT 
ACACTCCCTT 
ACTTCAGTGC 
ACAAGATCCC 
CCAGCTGCTA 
ACATAGATGC 
TTGCTGACAC 
TCAGGGAAAA 
CGAAGGATAG 
TGGGCCTGTT 
TCAGCCACAC 
GTGACCAGCT 
AGGTG6TGAA 
TGACAATCAG 
CCJAATGCCTT 
GCTCTGTAGA 
GCTCT6AGAA 
AAAAAAAAAA 



CAGAAGCTTT 
GCTGTCCAGC 
GCACAAGCTC 
TTTCTGACCG 
AGAAGAAAAG 
AAGGATTGAG 
GGCTCACATT 
CGTCACGTAC 
TGATGGTGAC 
GGACTTGGCT 
GACCGACTTC 
TTTGTCCAAT 
CATTGACAAT 
GTTCACCCCT 
CGGAAGCAAA 
CCTGAGAGGT 
TGACCAGTTA 
AAGCATTGGA 
ACATCCTCCC 
CGAGAATTGG 
GAAAGGCTCA 
GTGGGAATGG 
AATGAGCAGC 
CCCACTCGTG 
GGGAGATCCT 
CTTTCCCCAA 
CATCCTGAAA 
CTGTGGAGGT 
CTACACTCTA 
CAAGAAGAAG 
TGCCCGAAGT 
GGTCTACTAG 
GCTTCATGGC 
AGACTGTGAT 
ATTTGTCCTG 
AAGCATCATG 
CCACTGCTCC 
AGTAAGAACT 
TACAACAAGT 
AAAAAAAAAA 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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Seq ID No: 93 Protein seoruencg* : 
Protein Accession #: np 003697.1 



60 



65 



1 11 21 31 41 51 

<n ^ I I I I I 

:>U MGSSEVSIIP GLQKEEKAAV ERRRLHVLKA LKKLRIEADE APWAVLGSG GGLRAHIACL 6 0 

GVLSEMKEOG LLDAVTYLAG VSGSTWAISS LYTNDGDMEA LEADLKHRPT RQEWDLAKSL 120 

QKTIQAARSE NYSLTDFWAY MVISKQTREI. PESHLSNMiaC PVEEQTLPYP IFAAIDNDLO 18 0 

PSWQEARAPE TWFEFTPHHA GFSALGAPVS ITHPGSKPKK GRLVRTHPER DLTPLRGLWG 240 

SAIiGNTEVIR BYIFDQLRNL TLKGLWRRAV ANAKSIGHLI PARLLRLQES SQGEHPPPED 300 

3D EGGEPEHTWL TEMLENWTRT SLEKQEQPHE DPERKGSLSN LMDPVKKTGI CASKWEWGTT 360 

HNFLYKHGGI RDKIMSSRKH LHLVDAGIAI NTPPPLVLPP TREVHLILSP DFSAGDPFET 420 

IRATTDYCRR HKIPPPQVEE AELDLWSKAP ASCYILKGET GPWIHPPLP NIDACGGDIE 480 

AWSDTYDTFK LADTYTLDW VLLLAIAKKN VRENKKKILR EIMNVAGLYY PKDSARSCCI* 540 



Seq ID NO: 94 DNA sequence 
Nucleic Acid Accession #: AK027351 



Coding sequence: 1-642 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

III III 

AGGGAAAAAA ACTCCATTAA AAAGCCCAGC TTTCCTCCAT GTTAGATGTG ACTTGGAAAA 60 

TGAGAAAGAT TTAGCAAAAT TCCACCGTAT CTTTTGCCAG 6CTAGAGACA GGGAGAGCAG 12 0 

/U AGTAAAACCC TCAGGCTGCT GAAATTTCTA GGCTCTTAGG AAGCCCCTCG AATTCTGTGA 180 

AAATGAGGGT TTCTTAACTC ACACTGAGAG CGGAAAGGGG CAGACCCTTT TCATAACTCC 24 0 

CTCAAGTGTG TGTTACCTTT CTTTACCAGC ATGGTAAGCA ACAGGACATA TCCCAGCCTC 300 

GGACATGTCT GTATGATCCA AGGTACCCAA AGTCAGACAG AGTAAACTCA AGCCTGGCAC 360 

TGGCTTTCTG CCGCTTCATG TGCTTTGGAA AAAGCAGGAG AAGCAATAGC AGCAGGAGTC 420 

/3 CCCAGCAGCT GGAGCCGCAA GAATGAACTG CAAAGAGGGA ACTGACAGCA GCTGCGGCTG 480 

CAGGGGCAAC GACGAGAAGA AGATGTTGAA GTGTGTGGTG GTGGGG6ACG GTCCCGTGGG 540 
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GAAAACCTGC CTGCTGATGA GCTACGCCAA CGACGCCTTC CCAGAGGAAT ACGTGCCCAC 600 
TGTGTTTGAC CACTATGCAG TTACTGTGAC TGTGGGAGGC AAGCAACACT TGCTCGGACT 660 
GTATGACACC GCGGGACAGG AGGACTACAA CCAGCTGAGG CCACTCTCCT ACCCCAACAC 720 
GGATGTGTTT TTGATCTGCT TCTCTGTCGT AAACCCTGCC TCTTACCACA ATGTCCAGGA 780 
5 GGAATGGGTC CCCGAGCTCA AGGACTGCAT GCCTCACGTG CCTTATGTCC TCATAGGGAC 840 
CCAGATTGAT CTCCGTGATG ACCCAAAAAC CTTGGCCCGT TTGCTGTATA TGAAAGAGAA 900 
ACCTCTCACT TACGAGCATG GTGTGAAGCT CGCAAAAGCG ATCGGAGCAC AGTGCTACTT 960 
GGAATGTTCA GCTCTGACTC AGAAAGGTCT CAAAGCGGTT TTTGATGAAG CAATCCTCAC 1020 
CATTTTCCAC CCCAAGAAAA AGAAGAAACG CTGTTCTGAG GGTCACAGCT GCTGTTCAAT 1080 
10 TATCTGAGGT TGTCTGGGAC CTGCCTCCAC CCCATCCAGG GATGAGAATG GCAGCCAATC 1140 
TCTGTGGCCA AGCTCCAGCC AAAAAGGAGG GCACGACCAG AAAGGAACTC CCTTTGCACG 1200 
GAGGCTTGCC CCATCACCCT CTGAGCCCTC CCAACACAGC ACACTAGTCA GCCCACTGCC 1260 
ACGACCTCCC TGCCAGCCAG AAGCATCCGT ACTGCACGCT GTCTGAGAAT GCTGGGCCTG 1320 
GATTGCAGAC AGTGCCGCTG CTGATCGCAT CAAAAACAAA GTCAAAGGCC ATCTCACATT 1380 
15 TTACAAATCC CCAGCTCATG AACGTGAAGC TGATAGGAAA TCACCCCAGG GAACCCGAAA 1440 
AAGAAACTTG ATTCCTCTAT TGCTGGCCTT ACTTGATGTC TTTTATAAAA CTTGGGACTA 1500 
CAATACTAAC CTTTTTTTCT GAATCTGCTG TTCTACCCAT GTGTCTCACA TTCATTTGTA 1560 
TTATTTCAAG AAATGTACTA ATTTCCAGTT CACTCAGGCC TTACTAATCC ATACCAAATT 1620 
AGCCTAAAGA CAAGGCATTT TATATTCATT TCTATTTTCA GCAT6TTTCT ACCAAAGCTA 1680 
20 TTAGAACCAA CACGTACCTC TGAATGCCCG ATTATAAGAA GACATGAGAA GACTTTAAAA 1740 
GTTTTGGAAA TTTACAGAGC CATGATTTTT GAACCTAATT GAAAGAAAAC CATCTGAATT 1800 
GTTGCAGGTC CACATTTTTG CCAAAGATAC ACTCTATAGA TGCTTAGTAG TGGCCTGATT 1860 
TOTTTCCATG TATTGCCACG ACTVAACTAAA AATGAACTGT GTTTAAGAAT GTAGTATTTC 192 0 
TGTTTTTCAT CCAAGTTGAT TGGGGGAAGA ATATGGCAGG ATCCATCTTT TACAGTATTT 198 0 
^5 TGTATTCAGT AAAGTGGACA TTCCTGCTCC TCCCTTCCCC CATTGCATGC CCTCTTCCTC 204 0 
• CCTTGATTTC ACTTTCTCTC ATGCCCGGAT CCTTTTATTC TCCCCAGTTA TAACCCAGTT 2100 
ATAAAAGAAA GATCTGAGCA TAAAGATACG TGTTTAAAAA TAACTAAAAG TAAAGGAAAG 2160 
TGCCTTAATT TTTCTATTTG CTTCAACTGA AAGTGCTTCT CAGCTCGCCC CATGTAAGTT 2220 
CTCATTCCAT GTAAATGACA TTTTCCAGTT ACAACTGGTA CTGAGATTTT GCCTCTCTCT 2280 
30 TTCCTTACTC ATCCTCCCAA ATGTCTTTGT GGGAGCCATA TCAGTGGATA CCAAGCTCTG 2340 
TATCCATTTG TCCCCTGCCC TCCACAATGT GTGACATAGA ACAGGGACTT TGGCCCTGGG 2400 
AAAGCAAAAG CTCCCAGTAA GGAATCCTGT GCCCAATGAT GTAAAACAAT TCCAAACATC 2460 
CAGGAATTTT TGTATCATAG AGCGAATTAC TTCCTATCTT TTCATTAGAG GCTATGAGGA 2520 
CTTCTAATTA GTCTTAGTTG CTTATAAGTG CCCTGGAATC ACCCAGGTAG GCACTTAATT 2580 
35 TTTTTTTCAG TTGCATGAGC AAAGTGCTTC TTAGTAGTGT GAAATTACAA CAACTTTAAG 2640 
ACTTTCCAGA TTCAAGCTCC CACTGTTGGA AAAAGCCAGC CTTTCTAATC TCTTCTGCTA 2700 
CTGGAATAAG CACTTAAGAA TTGCGTGATA GCCAGGCACC GTGGCTCATG CCTGTAATCC 2760 
CAACACTTAG GGAGGCTGAG GTGGGTGGGC CGCTTGAGCT CAGGAGTTCA AGACCAGCCT 2820 
GGGTAATATA GTGAGATCCT GTGTCTCTAT AAAAAAATTA AAAATTAGTC AGTTGTAGTG 2880 
40 ACACATACCT GTAGTCCCAG CTACTCAGGA GGCTGAGGTG GAAGGATCAC TTGAGCCCAG 2940 
AAGGTAAGGC TGCAGTGAGC TGTGACTGT6 CCACTACACT CCAGCCTGAG TGACAGAGAA 3 000 
AGAACCTGTC AAAAAAAAAA AAAAAACAAC CTACATTTCA AGTACTATTT CCCTTCTCTC 3060 
CCATCTAATT GCTAAAGATT TTCTTTCATA CGCACACACT CCAGTGACTG GAAAAACGGG 312 0 
AGTTTTCAGT CAAAGCTTGA CATTTAGAGA AAACAAGGAC TTTCTGCCTT TATAAATGGA 318 0 
45 AATGAACTGT GTATGAACTA TAACTCTGCA GAGGTTATGA ATTCATCCTT TACAAACAAT 3240 
AATGAACTTT TAGTCCTGTA ATAAATGAAA TGTTATTAGG CAGCTTTGTT GCATGATTGC 3300 
ATAGTTATAT CTTGCTAACG GGCCACTCAT TTCTCACTGA TGTGGATGAA AAAATGAGAG 3360 
CAGTATGTTT CCAGGTGTGT GCACTCAACA GGCAAATAGC TCCCGAGGTC ACCACTTCCC 3420 
TAATGGGCCA CAGGAAGTAA GTTGATCTTG ATGGGGAGAT CACGTCACCC AGAACCAGCA 34 80 
50 ACTGGATAGA GACTGTTGTT AGTGTCTGGG TAGAGCACAG GCTCCCAGGG GTCTTAAGAG 3540 
CTAATTACTG AATAAAACAA TCTAGAACAA AGCAA 



Seq ID No: 95 Protein aeouence; 
Protein Accession #: CAC06611.1 

55 

1 XI 21 

I I I 

MNCKEGTDSS CGCRGNDBKK MLKCVWGDG 
60 TVTVGGKQHIi LGLYDTAGQE DVNQLRPLSY 
DCMPHVPYVL IGTQlDIiRDD PKni2UttiLYM 
KGIiKAVPDEA ILTIFHPKKK KKRCSB6HSC 



31 41 51 

I I I 

AVGKTCXiLMS YANDAFPEEY VPTVFDHYAV 60 
PNTDVFIilCF SWNPASYHN VQEEWVPELK 120 
KEKPIiTYEHG VKLAKAIGAQ CYLECSALTQ 180 
CSII 



65 Seq ID NO: 96 DMA sequence 

Nucleic Acid Accession #: NM_003654.1 

Coding sequence: 367-1602 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

70 1 1 1 I I I 

GGGGAGGGCG CGGGAGGCGG AGGATGCCGC CGCGGCTGCT GCCGCCGCCX3 CCACCCGCGG 60 

GTCCCCGGCG ACCCTACTCC AGACCCGAGG ATGGAGCCGG CGCTGGGCGC TGCAGCrrGCT 120 

CCCGGCGCGT CCCCGACCAG GTAGCTGGTG TCACTTCGGT GTGGTTGGAA GAAGACTTTC 180 

TCCCCAGCTG CATTCCCGGA GGCGCCCTTT CGACCTGGAG GCCGGGTCTG CTGGCCACAG 240 

75 GGCTGCCGCA CTGGCTGGGA CTGCCAGCTG 6GCCTGGAGA OGCTGGTGGC TGTGGACTCC 300 

CCAGCTTGGA GCAGTCCCTC TTTGACCTCA CCCCTTGGAG AAGCAGCCCC ATGAAGGTGC 360 
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CCAGCC ATG C AATGTTCCTG GAAGGCCGTC 
CAGTACACGG CCATCCGCAC CTTCACCGCC 
GAGGCCGGGC TGGCCGAGCG ACTGTGCGAG 
CGCAAGACCC ACATCCTCAT CCTGGCCACC 
5 CTCTTCAACC AGCACCTGGA CGTCTTCTAC 
ACGCTCATCC CCCGCTTCAC CCAGGGCAAG 
GCCAGCCGCG ACCTCCTGCG GAGCCTCTAC 
ATCAAGCCGC CGCCGGTCAA CCACACCACC 
GTCCTCTGCT CCCGGCCTGT GTGCGACCCT 
10 GGGGACT6TG TGCGCAAGTG CGG6CTACTC 
GAGCGCAGCC ACX3TGGCCAT CAAGACG6TG 
CTGGTGGAAG ACCCGCGATT AAACCTCAAG 
ATTCTGGCTT CGCGCAGCGA GACCTTCCGC 
GGCACCGGGA GGAAACCCTA CAACCTGGAC 
15 TTCTCCAACT CCX3TGTCCAC CGGCCTCATG 
TTGGTGCGCT ACGAGGACCT GGCTCGGAAC 
TTCCTGGGCA TCCCGCTGGA CAGCCACGTG 
GACCCCACCC TGGGC7\AGCA CAAATACGGC 
AAGTGGCGCT TCCGCCTCTC CTACGACATC 

20 GTGCTGGCCC AGCTGGGCTA CAAGATCGCC 
GTCAGCCTGG TGGAGGAGCG GGACTTCCGC 
GGGCGGGAGG CGCAAGGTGT CGGTTTTGAT 
TAACCCCTCC CTCTCCCACC TCATCTTCGT 
CTTCTGCCCC TTTTTTGTCT CTGAAATTTG 

25 AGAGGGCGCC TGAAGTAGGG TCCCGCCCCC 
TCTCTGTGCG GACGGTGACA ATGTTTACAA 
CGGGCACTCG CGAGGCGACT TCTCAAGCTT 
TTTGCACTGT CTTACTATTC AAGGTAAGAG 
TTTATGAATG GTGTCCATCC TTTCCCCATC 

30 CCCTTAGAGC AGCGAAACTG CCCCCTCCTG 
TTACTGTGAG GTGAACGTG6 ACCT6TTTCT 
GTCTGAGTCT CGTGGCCGCC CCTGGACCAG 
TTGATCTCGG GGTCCATCTG TGATATTTCT 
CAGTTTGCTA AATGAACATT GAAATTGAAA 

35 AGTGCAATAA TCACC 



CTCCTCCTTG CCCTGGCCTC CATTGCCATC 420 

AAGTCCTTTC ACACCTGCCC CGG6CTGGCA 480 

GAGAGCCCCA CCTTCGCCTA CAACCTCTCC 54 0 

ACGCGCAGCG GCTCCTCCTT CGTGGGCCAG 600 

CTGTTTGAGC CCCTCTACCA CGTCCAGAAC 660 

AGCCCGGCCG ACCGGCGGGT CATGCTAGGC 720 

GACTGCGACC TCTACTTCCT GGAGAACTAC 780 

GACAGGATCT TCCGCCGCGG GGCCAGCCGG 84 0 

CCGGGGCCAG CCGACCTGGT CCTGGAGGAG 900 

AACCTGACCG TGGCGGCCGA GGCGTGCCGC 960 

CGCGTGCCCG AGGTGAACGA CCTGCGCGCC 1020 

GTCATCCAGC TGGTCCGAGA CCCCCGCGGC 1080 

GACACGTACC GGCTCTGGCG GCTCTGGTAC 114 0 

GTGACGCAGC TGACCACGGT GTGCGAGGAC 1200 

CGGCCCCCGT GGCTCAAGGG CAAGTACATG 1260 

CCTATGAAGA AGACCGAGGA GATCTACGGG 1320 

GCCCGCTGGA TCCAGAACAA CACGCGGGGC 1380 

ACCGTGCGAA ACTCGGCGGC CACGGCCGAG 1440 

GTGGCCTTTG CCCAGAACGC CTGCCAGCAG 1500 

GCCTCGGAGG AGGAGCTGAA GAACCCCTCG 1560 

CCCTTCTCGT_GACCCGGGCG GTGCGGGTGG 1620 

AAAATGGACC GTTTTTAACT GTTGCCTTAT 1680 

GTCCTTCCTG CCCCCAGCTC ACCCCACTCC 1740 

CACTACGTCT TGGACGGGAA TCACTGGGGC 1800 

CCCACCCCAT TCAGACACAT GGATGTTGGG 1860 

GCACCACATT TACACATCCA CACACGCACA 1920 

TTGAATGGGT GAGTGGTCGG GTATCTAGTT 1980 

GATACAAACA AGAGGACCAC TTGTCTCTAA 204 0 

CCTGCCTCCT GCCCCTGACG CCCATTTCCC 2100 

CCCGCCCTTG CCTGTCGGTG AGGCAGGTTT 2160 

GTTTCCAGTC TGTGGTGATG CTGTCTGTCT 2220 

TGATGACTGA TGAATCTTAT GAGCTTCTGA 228 0 

TTGTGCCAAA AAGAAAAAAA AAGAGTGGAT 2340 

TGCTTTATCT GTGTTTTCTG TAAATAAAAG 2400 



Seq ID No: 97 Protein sequence ; 
Protein Accession #: NP_003645.1 

40 

.1 11 21 

I I I 

MQCSWKAVLL LALASIAIQY TAIRTPTAKS 
THILILATTR SGSSFVGQLF NQHLDVFYLF 
45 RDLLRSIiYDC DLYFLENYIK PPPVNHTTDR 
CVRKCGLLNL TVAAEACRER SHVAIKTVRV 
ASRSBTFRDT YRLWRLWYGT GRKPYNLDVT 
RYBDLARNPM KKTEEIYGFL GIPLDSHVAR 
RFRLSYDIVA PAQNACQQVL AQIiGYKIAAS 



31 41 51 

I I I 

FHTCPGLAEA GLAERLCEES PTFAYNLSRK 60 

EPLYHVQNTL IPRFTQGKSP ADRRVMLGAS 12 0 

IFRRGT^RVL CSRPVCDPPG PADLVtiEEGD 180 

PEVNDLRAIiV EDPRLNLKVI QIiVRDPRGIL 24 0 

QliTTVCEDFS NSVSTGLMRP PWLKGKYMLV 300 

WIQNNTRGDP TLGKHKYGTV RNSAATAEKW 360 
EEBLKNPSVS LVEERDFRPF S 



55 



60 



65 



70 



75 



Seq ID NO: 98 DNA aecaience 

Nucleic Acid Accession #: NM_002852.1 

Coding sequence: 68-1213 {underlined sequences correspond to start 



51 



41 

I I 
TGGAAAGAAC TTTGCGTCTC 
TGGTCTGCAG TGTTGGCCGA 
AACGAAATAG ACAATGGACT 
GAGCACTCGG AATGGGACAA 
ATGCTGCTGC AAGCCACGGA 



1 11 21 31 

I I I I 

CTCAAACTCA GCTCACTTGA GAGTCTCCTC CCGCCAGCTG 
TCCAGC AATG CATCTCCTTG CGATTCTGTT TTGTGCTCTC 
GAACTCGGAT GATTATGATC TCATGTATGT GAATTTGGAC 
CCATCCCACT GAGGACCCCA CGCCGTGCGA CTGCGGTCAG 

GCTCTTCATC ATGCTGGAGA ACTCGCAGAT GAGAGAGCGC 

CGACGTCCTG CGGGGCGAGC TGCAGAGGCT GCGGGAGGAG CTGGGCCGGC TCGCGGAAAG 
CCTGGCGAGG CCGTGCGCGC CGGGGGCTCC CGCAGAGGCC AGGCTGACCA GTGCTCTGGA 
CGAGCTGCTG CAGGCGACCC GCGACGCX;GG CCGCAGGCTG GCGCGTATGG AGGGCGCGGA 

ggcgcagcgc ccagaggagg cggggcx3cgc cctggccgcg gtgctagagg agctgcggca 
gacgcgagcc gacctgcacg cggtgcaggg ctgggctgcc cggagctggc tgccggcagg 
ttgt6aaaca gctattttat tcccaatgcg ttccaagaag atttttggaa gcgtgcatcc 
agtgagacca atgaggcttg agtcttttag tgcctgcatt tgggtcaaag ccacagatgt 
attaaacaaa accatcctgt tttcctatgg cacaaagagg aatccatatg aaatccagct 
gtatctcagc taccaatcca tagtgtttgt ggtgggtgga gaggagaaca aactggttgc 
tgaagccatg gtttccctgg qaaggtggac ccacct6tgc ggcacct6ga attcagagga 
agggctcaca tccttgtggg taaatggtga actg6cggct accactgttg a6atggccac 
aggtcacatt gttcctgagg 6aggaatcct 6cagattggc caagaaaaga atggctgctg 
tgtgggtggt ggctttgatg aaacattagc cttctctggg agactcacag gcttcaatat 
ctgggatagt gttcttagca atgaagagat aagagagacx: gga6ga6cag agtcttgtca 



and stop codons) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



226 



NSDOCIO: <WO 02079492A2_L> 



wo 02/079492 



PCT/US02/04915 



CATCCGGGGG AATATTGTTG GGTGGGGAGT 
GTATGTTTCA TAAATGTTGT GAAACTCC31C 
AACACATGCC AGTTGGGAAG GTCTGAAAAC 
TGAAAGAGAG AGTTGAGACC AATCTTTATT 
5 AAGGAAAGAC ATTGGAAAAA GCTTTTGAGG 
TTTCAGTTTA ATGCTGTGTC TCTGTCAGAT 
TGTTGAACAG AGGGACAATT GTTTTACTTT 
GAATTTTACA TTGGAAGAAT AACAAAATAA 
TATGTACCTT ATTACAAAAA AAATGATGAA 
10 ACTATAAATG TAGTTTATGT GTTATAATCG 
AAGTTATATT GCAAAAGGGA TTTGTATTAA 
AAATAAAATA TTTTATAAAA CTAAAAAAAA 



CACAGAGATC CAGCCACATG GAGGAGCTCA 1200 

TTGAAGCCAA AGAAAGAAAC TCACACTTAA 1260 

TCAGTGCATA ATAGGAACAC TTGAGACTAA 1320 

TGTACTGGCC AAATACTGAA TAAACAGTTG 1380 

ATAATGTTAC TAGACTTTAT GCCATGGTGC 1440 

AAACTCTCAA ATAATTAAAA AGGACTGTAT 1500 

TCTTTGGTTA ATTTTGTTTT GGCCAGAGAT 1560 

GATTTGTTGT CCATTGTTCA TTGTTATTGG 162 0 

AACATATTTA TACTACAAGG TGACTTAACA 16 8 0 

AATGTCACGT TTTTGAGJ\AG ATAGTCATAT 1740 

TTTAAGACTA TTTTTGTAAA GCTCTACTGT 1800 
AAAAAAA 



15 Seq ID No: 99 Protein aeouence: 
Protein Accession #: NP_002B43.1 

1 11 21 31 41 51 

I I ' ' ' ' 

20 MHIiIiAILFCA LWSAVIiAENS DDYDIiMYVNL DNEIDNGIiHP TBDPTPCDCG QEHSEWDKIjP 60 

IMLENSQHRE RMLLC2ATDDV LRGBIjQRLRB BLGRIiAESLA RFCAPGAFAB ARLTSAIiDEL 120 

LQATRDAGRR LARMEGAEAQ RPEEAGRAIiA AVL.EELRQTR ADLHAVQGWA ARSWLPAGCE 180 

TAIIiFPMRSK KIFGSVHPVR PMRIiESFSAC IWVKATDVLN KTIIiFSYGTK RNPYEIQLYIj 240 

SYQSIVFWG GEENKLVAEA MVSLGRWTHIi CGTWNSEEGIi TSLWVNGELA ATTVEMATGH 300 

25 IVPEGGILQI GQEKNGCCVG GGFDETLAPS GRLTGFNIWD SVLSNEEIRE TGGAESCHIR 360 

GNIVGWGVTE IQPHGGAQYV S 

Seq ID NO; 100 DNA sequence 
Nucleic Acid Accession #: NM_007351.l 
30 Coding sequence; 72-3758 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

35 I I I i I I 

CTGCTATCAA AAAGGCCATA AGGATTTTGT CCCCAAATTT CACATGAGCT ACCTTGCTTC 60 

AAACTACTGA GATGA AGGGG GCAAGATTAT TTGTCCTTCT TTCTAGTTTA TGGAGTGGGG 120 

GCATTGGGCT TAACAACAGT AAGCATTCTT QGACTATACC TGAGGAT6GG AACTCTCAGA 180 

AGACTATGCC TTCTGCTTCA GTTCCTCCAA ATAAAATACA AAGTTTGCAA ATACTGCCAA 24 0 

40 CCACTCGGGT CATGTCGGCG GAGATAGCTA CAACTCCAGA GGCAAGAACT TCTGAAGACA 300 

GTCTTCTTAA ATCAACACTG CCTCCCTCAG AAACT^GTGC ACCTGCTGAG GGTGTGAGAA 360 

ATCAAACTCT CACATCCACA GAGAAAGCAG AAGGAGTGGT CAAGTTACAG AATCTTACCC 42 0 

TCCCAACCAA CGCTAGCATC AAGTTCAATC CTGGAGCAGA ATCAGTGGTC CTTTCCAATT 4 80 

CTACACTGAA ATTTCTTCAG AGCTTTGCCA GAAAGTCAAA TGAACAAGCA ACTTCTCTAA 540 

45 ACACAGTTGG AGGCACTGGA GGCATTGGAG GCGTTGGAGG CACTGGAGGC GTGGGAAATC 600 

GAGCCCCACG GGAAACATAC CTCAGCCGGG GTGACAGCAG TTCCAGCCAA AGAACTGACT 660 

ACCAAAAATC AAATTTCGAA ACAACTAGAG GAAAGAATTG GTGTGCTTAT GTACATACCA 720 

GGTTATCTCC CACAGTGACA TTGGACAACC AGGTCACTTA TGTCCCAGGT GGGAAAGGAC 780 

CTTGTGGCTG GACCGGTGGA TCCTGTCCTC AGAGATCTCA GAAGATATCC AATCCTGTCT 840 

50 ATAGGATGCA ACATAAAATT GTCACCTCAT TGGATTGGAG GTGCTGTCCT GGATACAGTG 900 

GGCCGAAATG TCAACTAAGA GCCCAGGAAC AGCAAAGTTT GATACACACC AACCAGGCTG 960 

AAAGTCATAC AGCTGTTGGC AGAGGAGTAG CTGAGCAGCA GCA6CAGCAA G6CTGTGGTG 1020 

ACCCAGAAGT GATGCAAAAA ATGACT6ATC AGGTGAACTA CCAGGCAATG AAACTGACTC 1080 

TTCTGCAGAA GAAGATTGAC AATATTTCTT TGACTGTGAA TGATGTAAGG AACACTTACT 1140 

55 CCTCCCTAGA AGGAAAAGTC AGCGAAGATA AAAGCAGAGA ATTTCAATCT CTTCTAAAAG 1200 

GTCTAAAATC CAAAAGCATT AATGTACTTGA TAAGAGACAT AGTAAGAGAA CAATTTAAAA 1260 

TTTTTCAAAA TGACATGCAA GAGACTGTAG CACAGCTCTT CAAGACTGTA TCAAGTCTAT 1320 

CAGAGGACCT CGAAAGCACC AGGCAAATAA TTCAAAAAGT TAATGAATCT GTGGTTTCAA 1380 

TAGCAGCCCA GCAAAAGTTT GTTTTGGTGC AAGAGAATC6 GCCCACTTTG ACTGATATAG 144 0 

60 TGGAACTAAG GAATCACATT GTGAATGTAA GGCAAGAAAT GACTCTTACA TGTGAGAAGC 1500 

CTATTAAAGA ACTAGAAGTA AAGCAGACTC ATTTAGAAGG TGCTCTAGAA CAGGAACACT 1560 

CAAGAAGCAT TCTGTATTAT GAATCCCTCA ATAAAACTCT TTCTAAATTG AAGGAAGTAC 1620 

ATGAGCAGCT TTTATCAACT GAACAGGTAT CAGACCAGAA GAATGCTCCA GCTGCTGAGT 1680 

CAGTTAGCAA TAATGTCACT GAGTACATGT CTACTTTACA TGAAAATATA AAGAAGCAGA 174 0 

65 GTTTGATGAT GCTGCAAATG TTTGAAGATT TGCACATTCA AGAAAGCAAG ATTAACAATC 1800 

TCACCGTCTC TTTGGAGATG GAGAAAGAGT CTCTCAGAGG TGAATGTGAA GACATGTTAT 1860 

CCAAATGCAG AAATGATTTT AAATTTCAAC TTAAGGACAC AGAAGAGAAT TTACATGTGT 1920 

TAAATCAAAC ATTGGCTGAA GTTCTCTTTC CAATGGACAA TAAGATGGAC AAAATGAGTG 1980 

AGCAACTAAA TGATTTGACT TATGATATGG AGATCCTTCA ACCCTT6CTT GAGCAGGGAG 2040 

70 CATCACTCAG ACAGACAATG ACATATGAAC AACCAAAGGA AGCMTAGTG ATAAGGAAAA 2100 

AGATAGAAAA TCTGACTAGT GCTGTCAATA GTCTAAATTT TATTATCAAA GAACTTACAA 2160 

AAAGACACAA CTTACTTAGA AATGAAGTAC AGGGTCGTGA TGATGCCTTA GAAAGACGTA 2220 

TCAATGAATA TGCCTTAGAA ATGGAAGATG GCCTCAATAA GACAAT6ACT ATTATAAATA 2280 

ATGCTATT6A TTTCATTCAA 6ATAACTATG CCCTAAAAGA GACTTTAAGT ACTATTAAG6 2340 

75 ATAATAGTGA GATCCATCAT AAATGTACCT CCGATATGGA AACTATTTTG ACATTTATTC 2400 

CTCAGTTCCA CCGTCTGAAT 6ATTCTATTC AGACTTT6GT CAATGACAAT CAGAGATATA 2460 



227 



wo 02/079492 



PCT/LS02/049I5 



10 



15 



20 



25 



30 



ACTTTGTTTT 

AGTCCAACTT 

ACCAGCAAAA 

ATTTTGAGAC 

ATATTTCAGT 

AAGTATTAAA 

TCTTTT06CT 

GTGTGTCAGA 

AACTTCTTCA 

CTGCCCTATC 

TTGTCAAGTC 

CAACGGTAAA 

TATATCCTGA 

TAAATGGAAG 

CTATCAAGCT 

ATGCACCCAT 

TCCTGTTTAA 

TTAGAATTCC 

ATATTTCTGG 

TTAACAGTGA 

ATGGGCAGGA 

TTACTACATT 

CACCTTTATT 

TTGGTTTTTC 

TCTATTTTAT 

CTAAAGAAAT 

TTATTTCTTC 

ATATTATCAG 

TAAATATATA 



GCAAGTCGCC 

CCAAAAGATG 

TATGAGTCAT 

TCGGTTGCAA 

TAAAAAAGGC 

TTCCAGATTT 

TAACAAAACT 

ACTGAATGCT 

GAAA6GTCTA 

TAATTCAACT 

TCAGAAGCAA 

TCTTACCACA 

GGAGTATTCA 

AACT AG CTTT 

TGTGGAAGAA 

GGTGGCATTT 

TAACTTGGAT 

GTATCTTGGA 

ATTTTTAGTG 

AATACACTGT 

AGTCTGGTTA 

TAGTGGCTAT 

GAGAAACAGC 

TACAGGAAAT 

AAAATTATTT 

TTAGTGGCAC 

ATTTTAAGTC 

TCACAGTTTT 

ACACACATTT 



AAGACCCTTG 

TATCAAATGT 

TTGGAAGAAA 

GACATTGAGT 

AGTGTAGTTA 

AAGGCX5TTGG 

CTCCACGAAG 

ACCATCCCTA 

ACAGAATTT6 

TGTTGTATAG 

GTAAAATCAT 

GTCCTGATAG 

AGCTGTA6TC 

ACCTGTGCCT 

AATGCTTTAG 

TTTGCATCTC 

GTCAATTATG 

GTATATGTTT 

GTTGATGGAA 

GATAGGGTTT 

CGACTTGCAA 

TTATTATATC 

CAGTGTTTTC 

GAAAATCAAC 

GAATATTGTT 

AGAAAACAAA 

ATTGCAATGG 

CTTTCCAATT 

TCTAGATTCA 



CAGGTATTCC 

TCAATGAAAC 

AACTACTCTT 

CTAAAGTTAC 

CAAATGAGAG 

AAGCAAAATC 

TTTTAACAAT 

AGTGGATAAA 

TGGAACCAAT 

ATCGATCGTT 

TGCCAAAGAA 

GCC6GACTCA 

G6CATCCGTG 

GCAGACATCC 

CTCCAGATTT 

ATAC6TAT6G 

GAGCTTCATA 

TCAAGTACAC 

TAGACAAGCT 

TAACTGGGGA 

AAGGAACAAT 

GTAC ATAAG T 

ATTTATCTTT 

TTGTTTTTTT 

TAATGTCTGA 

GTGAATTTGT 

AAAGTAATAT 

AAACACTTAA 

CAAATTTAAA 



CAGAGATGAG 

CACTTCCCAA 

AACTACCAAG 

CCAGACGCTC 

AGATCAGGCT 

TATCCATCTT 

GTGTCACAAT 

ACATTCCCTG 

AATTCAAATA 

GCCTGGTAGT 

AATTAACGCA 

AAGAAACACG 

CCAAAATGGG 

TTTTACTGGT 

TTCCAAAG6A 

AATGACTATA 

TACCCCAAGA 

CATCGAGTCA 

TGCATTTGAG 

TGCCTTATTA 

TCCAGCCAAG 

TAGTATGAAA 

GCTTGCACAT 

AATATGAGTA 

ATATGAAAGA 

TAGCATAATT 

TATAAAACGG 

CTTTTGTTAT 

TAAATTACTC 



AAACTAAATC 

GTGAGAAAAT 

ATTTCCAAAA 

ATACCTTATT 

CTTCAACTGC 

TCAATTAACT 

GCTTCTACAA 

CCAGATATTC 

AAAACTCAAG 

CTGGCAAATG 

CTTAAGAAAC 

GACAACATAA 

GGCACGTGCA 

GACAACTGCA 

TCTTACAGAT 

CCTGGTCCTA 

ACTGGAAAAT 

TTTAGTGCTC 

TCTGAAAATA 

GAATTAAATT 

TTTCCCCCTG 

AACAGACTAT 

CTGCTCTGTT 

AACTTGTATG 

GTTCTTGATC 

ATTCCTATTC 

TAATTACAAC 

TCCCTGTATA 

AAAAAATG 



2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 



Seq ID No; 101 Protein s equence ; 
Protein Accession #: NP 031377,1 



35 



40 



45 



50 



55 



60 



MKGARhFVhL, 
MSAEIATTPE 
ASIKFNPGAE 
ETYLSRGDSS 
TGGSCPQRSQ 
AVGRGVAEQQ 
GKVSEDKSRE 
ESTRQIIQKV 
LEVKQTHLEG 
NVTEYMSTliH 
NOFKFQLKDT 
QTMTYEQPKB 
ALEMEDGWK 
RLNDSIQTLV 
MSHLEEKLLL 
SRFKALEAKS 
KGIiTEPVEPI 
LTTVIilGRTQ 
VEENAIiAPDF 
YLGVYVFKYT 
VWIiRLAKGTI 



11 
I 

SSLWSGGIGIi 
ARTSEDSIjLK 
SWLSNSTLK 
SSQRTDYQKS 
KISNPVYRMQ 
QQQGCGDPEV 
FQSIiljKGLKS 
NESWSIAAQ 
AXiEQEHSRSI 
ENIKKQSLMM 
EENIiHVIiNQT 
AIVIRKKIEN 
TMTIINNAID 
NDNQRYNFVL 
TTKISKNPET 
IHIiSINFFSL 
IQIKTQAAIiS 
RNTDNIIYPE 
SKGSYRYAPM 
XBSFSAHISG 
PAKFPPVTTF 



21 

I 

msKHsmip 

STLPPSETSA 
FLQSFARKSN 
NPETTRGKNW 
HKIVTSLDWR 
MQKMTDQVNY 
KSINVLIRDI 
QKFVIiVQENR 
LYYESLNKTL 
LQMFEDLHIQ 
LAEVLPPMDN 
LTSAVNSIiNF 
FIQDNYAIiKE 
QVAKTLAGIP 
RliQDIESKVT 
NKTLHEVLTM 
NSTCCIDRSL 
EYSSCSRKPC 
VAFFASHTYG 
FLWDGIDKIi 
SGYliLYRT 



31 
I 

EZXa^SQKTMP 
PAEGVRNQTI* 
BQATSIjNTVG 
CAYVHTRIjSP 
CCPGYSGPKC 
QAMKLTLLQK 
VREQFKIFQN 
PTIiTDIVEIiR 
SKIiKEVHEQL 
ESKINNIiTVS 
KMDKMSEQLN 
IIKELTKRHN 
TLSTIKDNSE 
RDEKLNQSHF 
QTLIPYYISV 
CHNASTSVSE 
PGSLAKWKS 
QNGGTCIN6R 
MTIPGPILFN 
AFESENINSE 



41 
I 

SASVPPNKIQ 
TSTEKAEGW 
GTGGI6GVGG 
TVTLDNQVTY 
QLRAQEQQSL 
KIDNISLTVN 
DMQETVAQLF 
NHIVNVRQEM 
LSTEQVSDQK 
IiEMEKESLRG 
DLTYDMEILQ 
LLRNEVQGRD 
IHHKCTSDME 
QKMYQMFNET 
KKGSWTNER 
UiATIPKWIK 
QKQVKSLPKK 
TSFTCACRHP 
NLDVNYGASY 
IHCDRVLTGD 



51 
I 

SIiQILPTTRV 

KLQNIiTLPTN 

TGGVGNRAPR 

VPG6KGPCGW 

IHTNQAESHT 

DVRNTYSSLE 

ICTVSSLSEDL 

TLTCEKPIKE 

NAPAAESVSN 

ECEDMLSKCR 

PLLEQGASIiR 

DALERRINEY 

TIIiTFIPQFH 

TSQVRKYQQN 

DQALQLQVLN 

HSLPOIQIiLQ 

INAIiKKPTVN 

FTGDNCTIKI* 

TPRTGKPRIP 

ALLSUOYGQE 



60 
120 
180 
240 
300 
360' 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



Seq ID NO: 102 DNA sequence 

Nucleic Acid Accession #: NM_000873.2 

Coding sequence: 57-884 (underlined sequences correspond to start and stop codons) 



65 



70 



75 



1 
I 

ATCTCCCTCC 
CCTCTTTCGG 
CGGATGAGAA 
GGTCCCTCGA 
CCTCTCTAAA 
ACATCTCCCA 
TGAATTCCAA 
CTTTGGTGGC 
TGGACAGCCT 
GGAAGGCAGC 
AGGATGGCCA 
ACATCTTTCA 



11 
I 

AGGCAGCCCT 
TTACAGGACC 
GGTATTCGA6 
GGTCAACTGC 
TAA6ATTCT6 
TGACACGGTC 
CGTCAGCGTG 
TGTGGGCAA6 
CACCCTCTTC 
CCCTGCTCCQ 
CCGCAACTTC 
CAAACACTCA 



21 

I 

TGGCTGGTCC 
CTGACTGTGG 
GTACACGTGA 
AGCACCACCT 
CTGGACGAAC 
CTCCAATGCC 
TACCAGCCTC 
TCCTTCACCA 
CTGTTCCXjTG 
CAGGAGGCCA 
TCCTGCCTGG 
GCCCCGAAGA 



31 
I 

CTGCGAGCCC 
CCCTCTTCAC 
GGCCAAAGAA 
GTAACCAGCC 
A6GCTCAGTG 
ACTTCACCTG 
CAAGGCAGGT 
TTGAGTGCAG 
GCAATGAGAC 
CAGCCACATT 
CTGTGCTGGA 
T6TTGGAGAT 



41 

I 

GTGGAGACTG 
CCTGATCTGC 
GCTGGCGGTT 
TGAAGTGGGT 
6AAACATTAC 
CTCCGGGAAG 
CATCCTGACA 
GGTGCCCACC 
TCTOCACTAT 
CAACA6CAC6 
CTTGATGTCT 
CTATGAGCCT 



51 
I 

CCAGAGATGT 
TGTCCAGGAT 
GAGCCCAAAG 
GGTCTGGAGA 
TTGGTCTCAA 
CAGGAGTCAA 
CTGCAACCCA 
GTGGAGCCCC 
GAGACCTTCG 
GCTGACAGA6 
C6CG6TGGCA 
GTGTCGGACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



228 



INSOOCIO: <WO 02079492A2.r > 



wo (12/079492 



PCT/US02/(M915 



10 



GCCAGATGGT CATCATAGTC ACGGTGGTGT CGGTGTTGCT GTCCCTGTTC GTGACATCTG 780 
TCCTGCTCTG CTTCATCTTC GGCCAGCACT TGCGCCAGCA GCGGATGGGC ACCTACGGGG 84 0 
TGCGAGCGGC TTGGAGGAGG CTGCCCCAGG CCTTCCGGCC ATAGCAACCA TGAGTGGCAT 900 
GGCCACCACC ACGGTGGTCA CTGGAACTCA GTGTGACTCC TCAGGGTTGA GGTCCAGCCC 960 
TG6CTGAAGG ACTGTGACAG GCAGCAGAGA CTTGGGACAT TGCCTTTTCT AGCCCGAATA 1020 
CAAACACCTG GACTT 

Seq ID No: 103 Protein aecruence; 
Protein Accession #: NP 000864.1 



1 11 21 31 41 51 

I I I I I i 

MSSFGYRTLT VALFTIiICCP GSDEKVFEVH VRPKKIiAVEP KGSLEVNCST TCNQPEVGGIj 60 

15 ETSLNKILLD EQAQWKHYLV SNISHDTVLQ CHFTCSGKQE SMNSNVSVYQ PPRQVILTLQ 120 

PTIjVAVGKSF TIECRVPTVE PIiDSIiTLFXiF RGNBTLHYET FGKAAPAPQE ATATFNSTAD 180 

redghrnfsc lavldlmsrg gnifhkhsap kmlbiyepvs dsqmviivtv vsviilislfvt 240 
svllcfifgq hlrqqrmgty gvraawrrlp qaprp 

20 Seq ID NO: 104 DNA sequence 

Nucleic Acid Accession #: NM_001795.2 

Coding sequence: 121-2475 (underlined sequences correspond to seaxt and stop codons) 



25 1 11 21 31 41 51 

I I I I I I 

gacggtcggc tgacaggctc CACAGAGCTC CACTCACGCT CAGGCCCTGG ACGGACAGGC 60 

AGTCCAACGG AACAGAAACA TCCCTCAGCC CCACAGGCAC GATCTGTTCC TCCTGGGAAG 120 

ATG CAGAGGC TCATGATGCT CCTCGCCACA TCGGGCGCCT GCCTGGGCCT GCTGGCAGTG 180 

30 GCAGCAGTGG CAGCAGCAGG TGCTAACCCT GCCCAACGGG ACACCCACAG CCTGCTGCCC 240 

ACCCACCGGC GCCAAAAGAG AGATTGGATT TGGAACCAGA TGCACATTGA TGAAGAGAAA 300 

AACACCTCAC TTCCCCATCA TGTAGGCAAG ATCAAGTCAA GCGTGAGTCG CAAGAATGCC 360 

AAGTACCTGC TCAAAGGAGA ATATGTGGGC AAGGTCTTCC GGGTCGATGC AGAGACAGGA 420 

GACGTGTTCG CCATTGAGAG GCTGGACCGG GAGAATATCT CAGAGTACCA CCTCACTGCT 480 

35 GTCATTCTGG ACAAGGACAC TGGTGAAAAC CTGGAGACTC CTTCCAGCTT CACCATCAAA 540 

GTTCATGACG TGAACGACAA CTGGCCTGTG TTCACGCATC GGTTGTTCAA TGCGTCCGTG 600 

CCTGAGTCGT CGGCTGTGGG GACCTCAGTC ATCTCTGTGA CAGCAGTGGA TGCAGACGAC 660 

CCCACTGTGG GAGACCACGC CTCTGTCATG TACCAAATCC TGAAGGGGAA AGAGTATTTT 720 

GCCATCGATA ATTCTGGACG TATTATCACA ATAACX3AAAA GCTTGGACCG AGAGAAGCAG 780 

40 GCCAGGTATG AGATCGTGGT GGAAGCGCGA GATGCCCAGG GCCTCCGGGG GGACTCGGGC 840 

ACGGCCACCG TGCTGGTCAC TCTGCAAGAC ATCAATGACA ACTTCCCCTT CTTCACCCAG 900 

ACCAAGTAC3V CATTTGTCGT GCCTGAAGAC ACCCGTGTGG GCACCTCTGT GGGCTCTCTG 960 

TTTGTTGAGG ACCCAGATGA GCCCCAGAAC CGGATGACCA AGTACAGCAT CTTGCGGGGC 1020 - 

GACTACCAGG ACGCTTTCAC CATTGAGACA AACCCCGCCC ACAACGAGGG CATCATCAAG 1080 

45 CCCATGAAGC CTCTGGATTA TGAATACATC CAGCAATACA GCTTCATCGT CGAGGCCACA 114 0 

GACCCCACCA TCGACCTCCG ATACATGAGC CCTCCCGCGG GAAACAGAGC CCAGGTCATT 1200 

ATCAACATCA CAGATGTGGA CGAGCCCCCC ATTTTCCAGC AGCCTTTCTA CCACTTCCAG 1260 

CTGAAGGAAA ACCAGAAGAA GCCTCTGATT GGCACAGTGC TGGCCATGGA CCCTGATGCG 1320 

GCTAGGCATA GCATTGGATA CTCCATCCGC AGGACCAGTG ACAAGGGCCA GTTCTTCCGA 1380 

50 GTCACAAAAA AGGGGGACAT TTACAATGAG AAAGAACTGG ACAGAGAAGT CTACCCCTGG 1440 

TATAACCTGA CTGTGGAGGC CAAAGAACTG GATTCCACTG GAACCCCCAC AGGAAAAGAA 1500 

TCCATTGTGC AAGTCCACAT TGAAGTTTTG GATGAGAATG AC»ATGCCCC GGAGTTTGCC 1560 

AAGCCCTACC AGCCCA/UVGT GTGTGAGAAC GCTGTCCATG GCCAGCTGGT CCTGCAGATC 1620 

TCCGCAATAG ACAAGGACAT AACACCACGA AACGTGAAGT TCAAATTCAC CTTGAATACT 1680 

55 GAGAACAACT TTACCCTCAC GGATAATCAC GATAACACGG CCAACATCAC AGTCAAGTAT 1740 

GGGCAGTTTG ACCX^GGAGCA TACCAAG6TC CACTTCCTAC CCX3TGGTCAT CTCAGACAAT 1800 

GGGATGCCAA GTCX3CACGGG CACCAGCAC6 CTGACCGTGG CCGTGTGCAA GTGCAACGAG 1860 

CAGGGCGA6T TCACCTTCTG CGA6GATATG GCCGCCCAGG TGGGCXSTGAG CATCCAGGCA 1920 

GTGGTAGCCA TCTTACTCTG CATCCTCACC ATCACAGTGA TCACCCTGCT CATCTTCCTG 1980 

60 CGGCGGCGGC TCCGGAAGCA GGCCCGCGCG CACGGCAAGA GCGTGCCGGA GATCCACGAG 204 0 

CAGCTGGTCA CCTACGACGA GGAGGGCGGC GGCGAGATGG ACACCACCAG CTACGATGTG 2100 

TCGGTGCTCA ACTCGGTGCG CCGCGGCGGG GCCAAGCCCC CGCGGCCCGC GCTGGACGCC 2160 

CGGCCTTCCC TCTATGCGCA GGTGCAGAAG CCACCGAGGC ACGCGCCTGG GGCACACGGA 2220 

GGGCCCGGGG AGATGGCAGC CATGATCGAG GTGAAGAAGG ACGAGGCGGA CCACGACGGC 2280 

65 GACGGCCCCC CCTACGACAC GCTGCACATC TACGGCTACG AGGGCTCCGA GTCCATAGCC 234 0 

GAGTCCCTCA GCTCCCTGGG CACCGACTCA TCCGACTCTG ACGTGGATTA CGACTTCCTT 2400 

AACGACTGGG GACCCAGGTT TAAGATGCTG GCTGAGCTGT ACGGCTCGGA CCCCCGGGAG 2460 

GAGCTGCTGT ATTAGGCGGC CGAGGTCACT CTGGGCCTGG GGACCCAAAC CCCCTGCAGC 2520 

CCAGGCCAGT CAGACGCCAG GCACCACAGC CTCCAAAAAT GGCAGTGACT CCCCAGCCCA 2580 

70 GCACCCCTTC CTCGTGGGTC CCAGAGACCT CATCAGCCTT GGGATAGCAA ACTCCAGGTT 2640 

CCTGAAATAT CCAGGAATAT ATGTCAGTGA TGACTATTCT CAAATGCTGG CAAATCCAGG 2700 

CTGGTGTTCT- GTCTGGGCTC AGACATCCAC ATAACCCTGT CACCCACAGA COGCCGTCTA 2760 

ACTCAAAGAC TTCCTCTGGC TCCCCAAGGC TGCAAA6CAA AACAGACTGT GTTTAACTGC 2820 

TGCAGGGTCT TTTTCTAGGG TCCCTGAACG CCCTGGTAAG GCTGGTGAGG TCCTGGTGCC 2880 

75 TATCTGCCTG GAGGCAAAGG CCTGGAC3VGC TTGACTTGTO GGGCAG6ATT CTCTGCAGCC 2940 

CATTCCCAAG GGAGACTGAC CATCATGCCC TCTCTCXKSGA GCCCTAGCCC TGCTCCAACT 3000 
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10 



15 



20 



CCATACTCCA 
AGGAAGGGGC 
ATGCCAGTAA 
TTTQAAGCAA 
GGGTGAGGGC 
GCTTTGAGAC 
CAGAAGACGT 
TCTGTCTACT 
GACAATGTCC 
CGCTGTTGTC 
GCCCTGCCCA 
CACCTTGGAG 
CCTTCGTCAT 
AGATAACACT 
AATGATACTT 
AGCAGGTTGT 
CTGTAACCTT 
GCCAAACTGG 
GCCTGGGCAA 



CTCCAAGTGC 
CCCATGGCAG 
CTGTGCTGTA 
CTGTGAATTC 
CACCTCCACA 
TCCTCAGCAC 
CTCCCCTTCT 
CCTTATCCCT 
AAACCCACTC 
ACATCTCAGG 
ACCTCTGTGG 
AAGTGGCATC 
GGACCGAGGT 
GACTTGTTTG 
ACAAGTTTCT 
TATTTAGGTT 
CTATTTTCTA 
TGCATGACAA 
CAAAAAAA 



CCCACCACTC 
CTCCTGACCT 
CTGAGCACTG 
ATTCTGGAGG 
CCCACCCCCT 
CCCTCCAGTT 
CTGCCTCACC 
TGGTTTAGAG 
ATGACTGCAT 
GAACT6ACCC 
TCACCCATGC 
AGTCAACAGA 
TCCCACTCTG 
TTTTAACCAA 
AGCTCTCACA 
AACAATATTA 
TAATTGTAGT 
GTACTGTATT 



CCCAACCCCT 
TGGGTCCTGA 
AACCACATTC 
GGCAGTGGAG 
CTGGAGAAGG 
TTGCCTGAGA 
TGGTCGCCAA 
GAACCCAAGA 
GAC6GAGCCG 
TCAGGCACAC 
ATCTTCCACT 
GA6GGGCAGG 
GGCAAAGCCC 
TAACTAGCTT 
GACATATAGA 
ATTCAGGTTT 
AATTGCTCTA 
TTTTTATACC 



CTCCAGGCCT 
AGTGACCTCA 
AGGGAAATGG 
ATCAGGAGTG 
CCTGGAAGAG 
AGGGGCAGAT 
TCCATGCTCT 
TGTGGCCTTT 
AGCCATGTGT 
CTTGCAGAAG 
GGAACGTTTC 
GAAGGAGACA 
CTCACACTGC 
CTTATAATGA 
ATAAGGGTTT 
TTTAGTTGGA 
CAGATAATGT 
TAAATAAAGA 



GTCAAGAGGG 
CTGGCCTGCC 
CTTATTAAAC 
ACAGATCACA 
CTGAGACCTT 
GTTCCCGGAG 
CTTTCTTTTC 
AGCAAAACTG 
CTTTACACCT 
GCAAGGCCCT 
ACTGCAAACA 
CCAAGCTCAC 
AAGGGATTGT 
TTTTTTTACT 
TTGCATAATA 
AAAACAATTC 
CTATATATTG 
AAAATCTTTA 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3460 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 



Seq ID No: 105 Protein aecmence : 
Protein Accession #: NP 001786.1 



25 



30 



35 



40 



MQRIiMMLIAT 
NTSLPHHV6K 
VIVDKDTGEN 
PTVGDHASVM 
TATVLVTLQD 
DYQDAFTIET 
INITDVDEPP 
VTKKGDIYNE 
KPYQPKVCEN 
GQFDREHTKV 
WAILLCILT 
SVLNSVRRGG 
DGPPYDTLHI 
EIiLY 



11 
I 

SGACLGLIiAV 
IKSSVSRKNA 
LBTPSSFTIK 
YQILKGKEYP 
INDNFPFFTQ 
NPAHKEGIIK 
IFQQPFYHFQ 
KELDREVYPW 
AVHGQIjVLQI 
HPLPWISDN 
ITVITLLIPL 
AKPPRPALDA 
YGYEGSESIA 



21 
I 

AAVAAAGANP 
KYLIiKGEYVG 
VHDVNDNWPV 
AIDNSGRZZT 
TKYTPWPBD 
PMKPIiDYEYI 
LKEUQKKPIil 
YNLTVEAKEL 
SAIDKDITPR 
GMPSRTGTST 
RRRIiRKQARA 
RPSLYAQVQK 
ESIiSSIiGTDS 



31 

I 

AQRDTHSLLP 
KVFRVDAETG 
FTHRLFNASV 
ITKSIaDRBKQ 
TRVGTSVGSL 
QQYSFIVEAT 
GTVIiAMDPDA 
DST6TPTGKE 
NVKFKFTLNT 
LTVAVCKCatlE 
HGKSVPEIHE 
PPRHAPGAHG 
SDSDVDYDFL 



41 

1 

THRRQKRDWI 
DVFAIERIiDR 
PESSAVGTSV 
ARYEIWEAR 
FVEDPDEPQM 
DPTIDIiRYMS 
ARHSI6YSIR 
SIVQVHIEVL 
ENNFTLTDNH 
QGEFTFCEDM 
QLVTYDEEGG 
GPGEMAAMZE 
NDWGPRFKML 



51 
I 

WNQMHIDEEK 
ENISEYHIiTA 
ISVTAVDADD 
DAQGLRGDSG 
RMTKYSILRG 
PPAGNRAQVI 
RTSOKGQFFR 
DENDKAPBFA 
DNTANITVKY 
AAQVGVSIQA 
GEMDTTSYDV 
VKKDEADHD6 
AELYGSDPRE 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



45 



50 



55 
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Seq ID NO: 106 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-474 (underlined sequences correspond to start and stop codons) 



1 
I 

ACAG TACTCT 
GCTAAGGTGG 
CTTGATCCAA 
ATCAGTGTAT 
GTTCGTCTTC 
TTATGCAAAC 
CAGTCCAGAG 
AACGATTTAC 
GAAGGTAAAC 
GAAAGCTGTG 
ATCCTCTCGA 
CTTTACCATA 
CAATCGAATA 
AGCCATTGAA 
AC 



11 
I 

GTGCAAAAAA 
TGGTTGATGG 
AGTGGAATCA 
GGAATCACAA 
TTTCCAATGC 
TCGGGCCAAA 
ACCGAATA6G 
CAGACGGAGC 
ACCCGGTTAA 
GAGTTTTTTG 
GTGCCTCTGC 
CACCAGATTG 
GACATTCCAC 
GAAACATGT6 



21 
I 

CCTGGTGAAA 
ATCTGGGCAA 
GCATTATQAC 
GAAGATCCAT 
CATCAACCGC 
TGACAATGAT 
CACAGGAGGA 
TCATTATTTG 
AACACTGTAC 
ATGAAGAGCG 
AGGGCTTCAA 
ATGCCTGCAC 
CCTATGAAAG 
GATTTGCTGT 



31 
I 

AAGGATTTTT 
TGCCATTCTA 
CTGTATATTG 
AAGAAACAAG 
CTCAAAGACA 
ACAGTTAGAG 
CAAGTTGTGG 
TCGACTTGOA 
ACCAGACAGC 
ACGAGCAAGA 
AGCATTGCAA 
TAACAACCTG 
CTATGAAAAG 
GGAATGACAA 



41 

I 

TCCGACTTCC 
CAGATACTGT 
GAAAGTCTGA 
GTGCTGGATT 
CTGGTTATCA 
GACAGATAGT 
ACT6CAGTC6 
AAGATAGATG 
AACATTGTCA 
TTGCTTCAGT 
GGTGCTGCAG 
CCGAAAGCCC 
CTATATGAAA 
GCTTCAAGGA 



51 
I 

TGATCCATTT 
GAAGAATACG 
TTCAGTTACG 
TCTCGGTTGT 
GAGGTTGGAT 
AGTAA6TCTT 
TTTATTTGAT 
TTAATGACTG 
AATGGTTCTG 
TTGTGACAGG 
GCCCGAGACT 
ACACTTGCTT 
AGCTGCTAAC 
TTTACCCAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



65 



Seq ID No: 107 Protein sequence! 
Protein Accession #: none found 



70 



75 



1 11 21 31 41 51 

< > I I I I 

TVLCAKNLVK KDFPRIjPDPF AKWVDGSGQ CHSTDTVKNT LDPKWNQHYD LYIGKSDSVT 

ISVWNHKKIH KKQGAGFIX3C VRLI.SNAINR LKDTGYQRZiD IiCKLGPNDND TVRGQIWSL 
QSRDRIGTGG QWDCSRXiFO NDXjPDGAHYL WTWKDRC 



60 
120 



Seq ID NO: 108 DNA sequence 

Nucleic Acid Accession #: NM_002318.1 

Coding sequence: 248-2572 (underlined sequences correspond to start and stop codons) 
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NSDOCID: <WO. 
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PCT/US02/04915 



1 11 21 31 41 51 

I I I 1 I I 

5 ACTCCAGCGC GCGGCTACCT ACGCTTGGTG CTTGCTTTCT CCAGCCATCG GAGACCAGAG 60 

CCGCCCCCTC TGCTCGAGAA AGGGGCTCAG CGGCGGCGGA AGCGGAGGGG GACCACCGTG 120 

GAGAGCGCGG TCCCAGCCCG GCCACTGCGG ATCCCTGAAA CCAAAAAGCT CCTGCTGCTT 180 

CTGTACCCCG CCTGTCCCTC CCAGCTGCGC AGGGCCCCTT CGTGGGATCA TCAGCCCGAA 240 

GACAGGGATG GAGAGGCCTC TGTGCTCCCA CCTCTGCAGC TGCCTGGCTA TGCTGGCCCT 300 

10 CCTGTCCCCC CTGAGCCTGG CACAGTATGA CAGCTGGCCC CATTACCCCG AGTACTTCCA 360 

GCAACCGGCT CCTGAGTATC ACCAGCCCCA GGCCCCCGCC AACGTGGCCA AGATTCAGCT 420 

GCGCCTGGCT GGGCAGAAGA GGAAGCACAG CGAGGGCCGG GTGGAGGTGT ACTAT6ATGG 480 

CCAGTGGGGC ACCGTGTGCG ATGACGACTT CTCCATCCAC GCTGCCCACG TCGTCTGCCG 540 

GGAGCTGGGC TATGTGGAGG CCAAGTCCTG GACTGCCAGC TCCTCCTACG GCAAGGGAGA 600 

15 AGGGCCCATC TGGTTAGACA ATCTCCACTG TACTGGCAAC GAGGCGACCC TTGCAGCATG 660 

CACCTCCAAT GGCTGGGGCG TCACTGACT6 CAAGCACACG GAGGATGTCG GTGTGGTGTG 720 

CAGCGACAAA AGGATTCCT6 GGTTCAAATT TGACAATTCG TTGATCAACC AGATAGAGAA 780 

CCTGAATATC CAGGTGGAGG ACATTCGGAT TCGAGCCATC CTCTCAACCT ACCGCAAGCG 840 

CACCCCA6TG ATGGAGGGCT ACGTGGAGGT GAAGGAGGGC AAGACCTGGA AGCAGATCTG 900 

20 TGACAAGCAC TGGACGGCCA AGAATTCCCG CGTGGTCTGC GGCATGTTTG GCTTCCCTGG 960 

GGAGAGGACA TACAATACCA AAGTGTACAA AATGTTTGCC TCACGGAGGA AGCAGCGCTA 1020 

CTGGCCATTC TCCATGGACT GCACCGGCAC AGAGGCCCAC ATCTCCAGCT GCAAGCTGGG 1080 

CCCCCAGGTG TCACTGGACC CCATGAAGAA TGTCACCTGC GAGAATGGGC TGCCGGCCGT 1140 

GGTGAGTTGT GTGCCTGGGC AGGTCTTCAG CCCTGACGGA CCCTCGAGAT TCCGGAAAGC 1200 

25 ATACAAGCCA GAGCAACCCC TGGrGCGACT GAGAGGCGGT GCCTACATCG GGGAGGGCCG 1260 

CGTGGAGGTG CTCAAAAATG GAGAATGGGG GACCGTCTGC GACGACAAGT GGGACCTGGT 1320 

GTCGGCCAGT GTGGTCTGCA GAGAGCTGGG CTTTGGGAGT GCCAAAGAGG CAGTCACTGG 1380 

CTCCCGACTG GGGCAAGGGA TCGGACCCAT CCACCTC:AAC GAGATCCAGT GCACAGGCAA 144 0 

TGAGAAGTCC ATTATAGACT GCAAGTTCAA TGCCGAGTCT CAGGGCTGCA ACCACGAGGA 1500 

30 GGATGCTGGT GTGAGATGCA ACACCCCTGC CATGGGCTT6 CAGAAGAAGC TGCGCCTGAA 1560 

CGGCGGCCGC AATCCCTACG AGGGCCGAGT GGAGGTGCTG GTGGA6AGAA ACGGGTCCCT 1620 

TGTGTGGGGG ATGGTGTGTG GCCAAAACTG GGGCATCGTG GAGGCCATGG TGGTCTGCCX3 1680 

CCAGCTGGGC CTGGGATTCG CCAGCAACGC CTTCCAGGAG ACCTGGTATT GGCACGGAGA 1740 

TGTCAACAGC AACAAAGTGG TCATGAGTGG AGTGAAGTGC TCGGGAACGG AGCTGTCCCT 1800 

35 GGCGCACTGC CGCCACGACX3 GGGAGGACGT GGCCTGCCCC CAGGGCGGA6 TGCAGTACGG 1860 

GGCCGGA6TT GCCTGCTCAG AAACCGCCCC T6ACCTGGTC CTCAATGCGG AGATGGTGCA 1920 

GCAGACCACC TACCTGGAGG ACCX«3CCCAT GTTCATGCTG CAGTGTGCCA TGGAGGAGAA 1980 

CTGCCTCTCG GCCTCAGCCG CGCAGACCGA CCCCACCAC6 GGCTACCGCC GGCTCCTGCG 2040 

CTTCTCCTCC CAGATCCACA ACAATGGCCA GTCCGACTTC CGGCCCAAGA ACGGCCGCCA 2100 

40 CGCGTGGATC TGGCACGACT GTCACAGGCA CTACCACAGC ATGGAGGTGT TCACCCACTA 2160 

TGACCTGCTG AACCTCAATG GCACCAAGGT GGCAGAGGGC CACAAGGCCA GCTTCTGCTT 2220 

GGAGGACACA GAATGTGAAG GAGACATCCA GAAGAATTAC GAGTGTGCCA ACTTCGGCGA 2280 

TCAGGGCATC ACCATGGGCT GCTGGGACAT GTACCGCCAT GACATCGACT GCCAGTGGGT 2340 

TGACATCACT GACGTGCCCC CTGGAGACTA CCTGTTCCAG GTTGTTATTA ACCCCAACTT 2400 

45 CGAGGTTGCA GAATCCGATT ACTCCAACAA CATCATGAAA TGCAGGAGCC GCTATGACGG 2460 

CCACCGCATC TGGATGTACA ACTGCCACAT AGGTGGTTCC TTCAGCGAAG AGACGGAAAA 2520 

AAAGTTTGAG CACTTCAGCG GGCTCTTAAA CAACCAGCTG TCCCCGCAG^VAAAGAAGCCT 2580 

GCGTGGTCAA CTCCTGTCTT CAGGCCACAC CACATCTTCC ATGGGACTTC CCCCCAACAA 2640 

CTGAGTCTGA ACGAATGCCA CGTGCCCTCA CCCAGCCCGG CCCCCACCCT GTCCAGACCC 2700 

50 CTACAGCTGT GTCTAAGCTC AGGAG6AAAG GGACCCTCCC ATCATTCATG GGGGGCTGCT 2760 

ACCTGACCCT TGGGGCCTGA GAAGGCCTTG GGGGGGTGGG GTTTGTCCAC AQAGCTGCTG 2820 

GAGCAGCACC AAGAGCCA6T CTTGACCGGG ATGAGGCCCA CAGACAGGTT GTCATCAGCT 2880 

TGTCCCATTC AAGCCACC6A GCTCACCACA GACACAGTGG AGCCGCGCTC TTCTCCA6TG 2940 

ACACGTGGAC AAATGCGGGC TCATCAGCCC CCCCAGAGAG GGTCA6GCCG AACCCCATTT 3000 

55 CTCCTCCTCT TAGGTCATTT TCAGCAAACT TGAATATCTA GACCTCTCTT CCAATGAAAC 3060 

CCTCCAGTCT ATTATAGTCA CATAGATAAT GGTGCCACGT GTTTTCTGAT TTGGTGAGCT 3120 

CAGACTTGGT GCTTCCCTCT CCACAACCCC CACCCCTTGT TTTTCAAGAT ACTATTATTA 3180 

TATTTTCACA GACTTTTGAA GCACAAATTT ATTGGCATTT AATATTGGAC ATCTGGGCCC 3240 

TTGGAAGTAC AAATCTAAGG AAAAACCAAC CCACTGTGTA AGTGACTCAT CTTCCTGTTG 3300 

60 TTCCAATTCT GTGGGTTTTT GATTCAACGG TGCTATAACC AGGGTCCTGG GTGACAGGGC 3360 

GCTCACTGAG CACCATGTGT CATCACAGAC ACTTACACAT ACTTGAAACT TGGAATAAAA 3420 
GAAAGATTTA TG 

Seq ID No: 109 Protein sequence; 
65 Protein Accession #; NP_002309.1 



1 11 21 31 41 51 

I I I I I I 

70 MERPLCSHIiC SCIAMIAIiIiS PLSLAQYDSW PHYPEYFQQP APEYHQPQAP ANVAKIQLRL 60 

AGQKRKHSEG RVEVYYDGQfW GTVCDDDFSI HAAHWCREIj GYVEAKSWTA SSSYGKGEGP 120 

IWLDNIiHCTG NBATLAACTS NGWGVTDCKH TEDVGWCSD KRIPGPKFDN SIiINQIENIiN 180 

IQVEDIRIRA IIiSTYRRRTP VMEGYVEVKE GKTWTOICDK HWTAKNSRW CGMFGFPGBR 240 

TYNTKVYKMF ASRRKQRYWP FSMDCTGTEA HISSCKL6PQ VSIiDPMKNVT CENGLPAWS 300 

75 CVPGQVFSPD GPSRPRKAYK PEQPLVRLRG GAYIGEGRVB VIiKNGEWGTV CDDKWDIiVSA 360 

SWCRBLGFG SAKEAVTGSR LGQGIGPIHIi NEIQCT6NEK SIIDCKFNAB SQGCNHEBDA 420 



231 
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GVRCNTPAMG LQKKIiRENGG RNPYEGRVEV LVERNGSLVW GMVCGQNWGI VEAMWCRQL 480 

Gl^FASNAFQ ETWYWHGDVN SNKWMSGVK CSGTELSIiAH CRHDGEDVAC PQGGVQYGAG 54 0 

VACSBTAPDL VLNAEMVQQT TYLEDRPMFM LQCAMEENCL SASAAQTDPT TGYRRLLRFS 600 

SQIHNNGQSD FRPKNGRHAW IWHDCHRHYH SMBVFTHYDL LNLNGTKVAE GHKASFCLED 660 

O TECEGDIQKN YECANFGDQG ITMGCWDMYR HDIDCQWVDI TDVPPGDYLF QWINPNFEV 720 
AESDYSNNIM KCRSRYDGHR IWMYNCHIGG SFSEETBKKF EHFSGLLNNQ LSPQ 

Seq ID NO: 110 DNA sequence 
in Nucleic Acid Accession #: none found, CAT_73007_3 

lU Coding sequence: 1-495 (underlined sequences correspond to start and stop codons) 

] ]^ 21 31 41 51 

' I I I I I 

CGGACGCGTG GGTCGACCCA CGCGTCCGCC CACGCGTC06 TATGGACAGA GCCTCCACTG 60 

ID GCTGCTGCCT GCCCGCCACA TACCCAGCTG ACATGGGCAC CGCAG6AGCC ATGCAGCTGT 120 

CTGGGTGATC CTGGGCTTCC TCCTGTTCCG AGGCCACAAC TCCCAGCCCA CAATGACCCA 180 

ACCTCTAGCT CTCAGGGAGG CCTTGGCGGT CTAAGTCTGA CCACAGAGCC AGTTTCTTCC 240 

ACCCAGGATA CATCCCTTCC TCAGAGGCTA ACAGGCCAAG CCATCTGTCC AGCACTGGTA 300 

CCCAGGCGCA GGTGTCCCCA GCAGTGGAAG AGACGGAGGC ACAAGCAGAG ACACATTTCA 360 

ACTGTTCCCC CCAATTCAAC CACCATGAGC CTGAGCATGA GGGAAGATGC GACCATCCTG 420 

CCAGCCCCAC GTCAGAGACT GTGCTCACTG TGGCTGCATT TGGGATGGAG TCGGGTGGAG 480 

GCCCACTCTG GCTAGGGGGC GGCAGGCTGA GAGCTCACCT GTTCAGCAGA GAAGTGGAAC 540 

CACTTTGCTC CTGGAGCCTG TCTACCACAG TGTTATCAGC TTCATTGTCA TCCTGGTGGT 600 

GTGGTGATCA TCCTAGTTGG TGTGGTCAGC CTGAGGGTTC AGTGTCGGAA GAGCAAGGAG 660 

ZD TCTGAAGATC CCAGAACCTG GGAGTACAGG GCGTGTCTGA CAAGCTGGTC ACAGACCATG 720 

GCGAGAACGA CAGCATCGCC CATTATCACA TGGAAGACAT CACACGACTT AGGGCAACAC 780 

GCACTCAGCA GC6AGCATCA AAGGAGCCTA CGCATGGCCC AGACTGAGAG CAAGCACAAA 840 
GG6C 

30 Seq ID No: 111 Protein gecruence ; 

Protein Accession #; none found, CAT 73007 3 
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40 
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50 
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60 



65 



70 



75 



1 11 21 31 41 51 

' I I I I I 

RTRGSTHASA HASVWTEPPL AAACPPHTQL TWAPQEPCSC LGDPGLPPVP RPQLPAHNDP 
TSSSQGGLGG LSLTTEPVSS TQDTSLPQRI, TGQAICPAIiV PRRRCPQQWK RRRHKQRHIS 
TVPPNSTTMS LSMRBDATIL PAPRQRIjCSI* WIiHLGWSRVE AHSG 



60 
120 



Seq ID NO: 112 DNA secmence 

Nucleic Acid Accession #: NM_005424.l 

Coding sequence: 37-3453 (underlined sequences correspond to start and stop codons) 



1 
I 

CGCTCGTCCT 

TTGCTCCCCA 

GCCAACCTGC 

GGGGCGGGGA 

ATCGTGCGCA 

ACGCTTCGCG 

GCTGGGGCGC 

CCAGACAAGG 

CACAAGGAGA 

GACTGGCATG 

TCGAGCGGCA 

CGGCTCATCG 

CCAGGTTGCC 

GGCTTCACTG 

CAG6AGCAGT 

TATGGCTGCT 

GGTCATTTTG 

CGGTTCAGTG 

CGGATCCCCC 

CX3GATCAACT 

AAGCCAGACG 

GCTGAGTTCG 

TCCACATCTG 

CCCCTGGCTG 

GTCTCGTTCT 

AGTACCATGG 

CTGAGGCCAA 

GAGGGGGCCT 

CCGTGGTTGG 

CCCTTGGTGC 



11 

I 

GGCTGGCCT6 
TCCTCTTCTT 
GGCTCACGGA 
GGGGCTCG6A 
CCCCGCCCGG 
GCTTCTCCAA 
GGCGCACGCG 
TCACACACAC 
AGCAGACAGA 
AAGCCCAGGA 
TCTACAGTGC 
TGCGGGGTTG 
TACATGGAGG 
GCACCCGCTG 
GCCCAGGCAT 
CTTGTGGATC 
GGGCTGATTG 
GTTGTGTCTG 
AGATCCTCAA 
GTGCA6CTGC 
GCACTGTGCT 
AGGTGCCCCG 
GCGGCCAAGA 
CACCTCGGCT 
CTGGGGATGG 
ACTGGTCGAC 
AGACAGGATA 
GGGGGCCTCC 
AGGGCTGGCA 
CCGGGCCACT 



21 
I 

GGTCGGCCTC 
GGCTTCTCAT 
CCCCCAGC6C 
CCSCCTGGGGC 
GCCACCCCTG 
GCCCTCGGAC 
CGTCATCTAC 
TGTGAACAAA 
CGTGATCTGG 
TGGGCGGTTC 
CACTTACCTG 
TGGGGCTGGG 
TGTCTGCCAC 
TGAACAGGCC 
ATCAGGCTGC 
TGGCTGGAGA 
CCGACTCCAG 
CCCCTCTGGG 
CATGGCCTCA 
AGGGAACCCC 
CCTGTCCACC 
CTTGGTTCTT 
CAGCCGGCGC 
CCTGACCAAG 
ACCCATCTCC 
CATTGTGGTG 
CAGTGTTCGT 
CACCCTCATG 
TGTGGAA(3GC 
GGTGGGCGAC 



31 
I 

TGGAG TATGG 
GTGGGCGCGG 
TTCTTCCTGA 
CCGCCCCTGC 
CGCCTGCSCGC 
CTCGTGGGCG 
GTGCACAACA 
GGTGACACCG 
AAGAGCAACG 
CTGCTGCAGC 
GAAGCCAGCC 
CGCTGGGGGC 
GACCATGACG 
TGCAGAGAGG 
CGGGGCCTCA 
GGAAGCCAGT 
TGCCAGTGTC 
TGGGATGGAG 
GAACTGGAGT 
TTCCCCGTGC 
AA66CCATTG 
GCGGACAGTG 
TTCAAGGTCA 
CAGAGCCGCC 
ACTGTCCGCC 
GACCCCAGTG 
GTGCAGCTQA 
ACCACACSACT 
ACTGACCGGC 
GGTTTCCTGC 



41 

I 

TCTGGCGGGT 
CGGTGGACCT 
CTTGCGTGTC 
T6CT6GAGAA 
GCAACG6TTC 
TCTTCTCCTG 
GCCCTGGAGC 
CTGTACTTTC 
GATCCTACTT 
TCCCAAATGT 
CCCTGGGCAG 
CAGGCTGTAC 
GCGAATGTGT 
GCCGTTTTGG 
CCTTCTGCCT 
GCCAAGAAGC 
AGAATGGTGG 
TGCACTGTGA 
TCAACTTAGA 
GGGGCAGCAT 
TGGAGCCAGA 
GGTTCTGGGA 
ATGTGAAA6T 
AGCTTGTC^GT 
TGCACTACCG 
AGAAC6TGAC 
GCCG6CCAGG 
GTCCTGAGCC 
TGGGAGTGAG 
TGCX3CCTGTG 



51 

I 

GCCCCCTTTC 

GACGCTGCTG 

TGGGGAGGCC 

G<3ACGACCGT 

6CACCAGGTC 

CGTGGGCGGT 

CCACCTGCTT 

TGCACGTGTG 

CTACACCCTG 

GCAGCCACCA 

CGCCTTCTTT 

CAAGGAGTGC 

ATGCCCCCCT 

GCAGAGCTGC 

CCCAGACCCC 

TTGTGCCCCT 

CACTTGTGAC 

GAAGTCAGAC 

GACGATGCCC 

AGAGCTACGC 

GAAGACCACA 

GTGCCGTGTG 

GCCCCCCGTG 

CTCCCCGCTG 

GCCCCAGGAC 

GTTAATGAAC 

GGAAG6AGGA 

TTTGTTGCA6 

CTGGTCCTTG 

GGACGGGACA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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960 
1020 
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1140 
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1620 
1680 
1740 
1800 
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CGGGGGCAGG AGCGGCGGGA GAACGTCTCA TCCCCCCAGG CCCGCACTGC CCTCCTGACG 1860 

GGACTCACGC CTGGCACCCA CTACCAGCTG GATGTGCAGC TCTACCACTG CACCCTCCTG 1920 

GGCCCGGCCT CGCCCCCTGC ACACGTGCTT CTGCCCCCCA GTGGGCCTCC AGCCCCCCGA 1980 

CACCTCCACG CCCAGGCCCT CTCAGACTCC GAGATCCAGC TGACATGGAA GCACCCGGAG 204 0 

GCTCTGCCTG GGCCAATATC CAAGTACGTT GTGGAGGTGC AGGTGGCTGG GGGTGCAGGA 2100 

GACCCACTGT GGATAGACGT GGACAGGCCT GAGGAGACAA GCACCATCAT CCGTGGCCTC 2160 

AACGCCAGCA CGCGCTACCT CTTCCGCATG CGGGCCAGCA TTCAGGGGCT CGGGGACTGG 2220 

AGCAACACAG TAGAAGAGTC CACCCTGGGC AACGGGCTGC AGGCTGAGGG CCCAGTCCAA 2280 

GAGAGCCGGG CAGCTGAAGA GGGCCTGGAT CAGCAGCTGA TCCTGGCGGT GGTGGGCTCC 2340 

GTGTCTGCCA CCTGCCTCAC CATCCTGGCC GCCCTTTTAA CCCTGGTGTG CATCCGCAGA 24 00 

AGCTGCCTGC ATCGGAGACG CACCTTCACC TACCAGTCAG GCTCGGGCGA GGAGACCATC 2460 

CTGCAGTTCA GCTCAGGGAC CTTGACACTT ACCCGGCGGC CAAAACTGCA GCCCGAGCCC 2520 

CTGAGCTACC CAGTGCTAGA GTGGGAGGAC ATCACCTTTG AGGACCTCAT CGGGGAGGGG 2580 

AACTTCGGCC AGGTCATCCG GGCCATGATC AAGAAGGACG GGCT6AAGAT GAACGCAGCC 2640 

ATCAAAATGC TGAAAGAGTA TGCCTCTGAA AATGACCATC GTGACTTTGC GGGAGAACTG 2700 

GAAGTTCTGT GCAAATTGGG GCATCACCCC 7ACATCATCA ACCTCCTGGG GGCCTGTAAG 2760 

AACCGAGGTT ACTTGTATAT CGCTATTGAA TAT6CCCCCT ACGGGAACCT GCTAGATTTT 2820 

CTGCGGAAAA GCCGGGTCCT AGAGACTGAC CCAGCTTTTG CTCGAGAGCA TGGGACAGCC 2880 

TCTACCCTTA GCTCCCGGCA GCTGCTGCGT TTCGCCAGTG ATGCX3GCCAA TGGCATGCAG 2940 

TACCTGAGTG AGAAGCAGTT CATCCACAGG GACCTGGCTG CCCG6AATGT GCTGGTCGGA 3000 

GAGAACCTAG CCTCCAAGAT TGCAGACTTC GGCCTTTCTC GGGGAGAGQA GGTTTATGTG 3060 

AAGAAGACGA TGGGGCGTCT CCCTGTGCGC TGGATGGCCA TTGAGTCCCT GAACTACAGT 3120 

GTCTATACCA CCAAGAGTGA TGTCTGGTCC TTTGGAGTCC TTCTTTGGGA GATAGTGAGC 3180 

CTTGGAGGTA CACCCTACTG TGGCATGACC TGTGCCGAGC TCTATGAAAA GCTGCCCCAG 324 0 

GGCTACCGCA TGGAGCAGCC TCGAAACTGT GACGATGAAG TGTACGAGCT GATGCGTCAG 3300 

TGCTGGCGGG ACCGTCCCTA TGAGCGACCC CCCTTTGCCC AGATTGCGCT ACAGCTAGGC 3360 

CGCATGCTGG AAGCCAGGAA GGCCTATGTG AACATGTCGC TGTTTGAGAA CTTCACTTAC 3420 

GCGGGCATTG ATGCCACAGC TGAGGAGGCC TGAGCTGCCA TCCAGCCAGA ACGTGGCTCT 34 80 

GCTGGCCGGA GCAAACTCTG CTGTCTAACC TGTGACCAGT CTGACCCTTA CAGCCTCTGA 354 0 

CTTAAGCTGC CTCAAGGAAT TTTTTTAACT TAAGGGAGAA AAAAAGGGAT CTGGGGATGG 3600 

GGTGGGCTTA GGGGAACTGG GTTCCCATGC TTTGTAGGTG TCTCATAGCT ATCCTGGGCA 3660 

TCCTTCTTTC TAGTTCAGCT GCCCCACAGG TGTGTTTCCC ATCCCACTGC TCCCCCAACA 3720 

CAAACCCCCA CTCCAGCTCC TTCGCTTAAG CCAGCACTCA CACCACTAAC ATGCCCTGTT 3780 

CAGCTACTCC CACTCCCGGC CTGTCATTCA GAAAAAAATA AATGTTCTAA TAAGCTCCAA 3840 
AAAAA 



Seq ID No: 113 Protein sequence: 
Protein Accession #: MP_005415.1 



MVWRVPPFLI* 
LLLEKDDRIV 
NSPGAHLIiPD 
QLPNVQPPSS 
DGECVCPPGF 
QCQEACAPGH 
EFNIiETMPRI 
SGFWECRVST 
RLJnTRPQDST 
DCPEPLLQPW 
QARTALLTGL 
QLTWKHPEAIi 
SIQGIiGDWSN 
LTIiVCIRRSC 
FEDIiIGEGNF 
XNIjIiGACKNR 
SDAANGMQYIj 
AIESLNYSVY 
EVYEIiMRQCW 



11 
I 

PILFItASHVG 
RTPPGPPLRL 
KVTHTVNKGD 
GIYSATYLEA 
TGTRCEQACR 
FGADCRIiQCQ 
NCAAAGNPFP 
SGGQDSRRFK 
MDWSTIWDP 
IjEGWHVEGTD 
TPGTHYQIoDV 
PGPISKYWE 
TVBESTIjGNG 
mRRRTFTYQ 
GQVXRAMZKK 
GYLYIAIEYA 
SEKQFIHRDL 
TTKSDVWSFG 
RDRPYERPPF 



21 
I 

AAVDLTLIiAN 
ARNGSHQVTL 
TAVLSARVHK 
SPLGSAFFRL 
EGRFGQSCQE 
CQNGGTCDRF 
VRGSIELRKP 
VNVKVPPVPIi 
SENVTLMNIiR 
RliRVSWSLPL 
QLYHCTUJGP 
VQVAGGAGDP 
LQAEGFVQES 
SGSGEETILQ 
DGIiKMNAAXK 
PYGNLLDFLR 
AARNVLVGEN 
VIOtWEIVSIiG 
AQIALQIiGRM 



31 
I 

LRIjTDPQRFF 
RGFSKPSDIiV 
EKQTDVIWKS 
IVRGCGAGRW 
QCPGISGCRG 
SGCVCPSGWH 
DGTVIjIiSTKA 
AAPRIiIiTKQS 
PKTGYSVRVQ 
VPGPLVGDGF 
ASPPAHVLIiP 
LWIDVDRPEE 
RAAEEGIiDQQ 
FSSGTIiTIiTR 
MLKEYASEND 
KSRVIiETDPA 
liASKXADFGL 
GTPYCGMTCA 
IjEARKAYVNM 



41 

I 

LTCVSGEAGA 
GVFSCVGGAG 
NGSYFYTLDW 
GPGCTKECPG 
IjTFCLPDPYG 
GVHCEKSDRI 
IVEPEKTTAE 
RQLWSPIiVS 
LSRPGEGGEG 
IjIjRLWDGTRG 
PSGPPAPRHL 
TSTIlRGItNA 
IiXIiAWGSVS 
RPKIiQPEPIiS 
HRDFAGELEV 
FAREHGTAST 
SRGEEVYVKK 
ELYEKIiPQGY 
SLFENFTYAG 



51 

I 

GRGSDAWGPP 
ARRTRVIYVH 
HEAQDGRFLL 
CliHGGVCHDH 
CSCGSGWRGS 
PQILNMASEIi 
FEVPRIjVIjAD 
FSGDGPISTV 
AWGPPTIiMTT 
QBRRENVSSP 
HAQAIiSDSEI 
STRYTjFRMRA 
ATCLTXLAAL 
YPVLEWEDXT 
JjCKLGHHPNX 
LSSRQLLRFA 
TMGRIiPVRWM 
RMEQPRNCDD 
IDATAEBA 
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Seq ID NO: 114 DNA secruence 

Nucleic Acid Accession #: NM_002632.1 

Coding sequence: 322-771 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGGATTCGGG CCGCCCAGCT ACGGGAGGAC CTGGAGTGGC ACTGGGCGCC C6ACGGACCA 60 

TCCCCGGGAC CCGCCTGCCC CTCGGCGCCC CGCCCCGCCX5 GGCCGCTCCC CGTCGGGTTC 120 

CCCAGCCACA GCCTTACCTA CGGGCTCCTG ACTCCGCAAG GCTTCCAGAA GATGCTCGAA 180 

CCACCGGCCG GGGCCTCGGG GCAGCAGTGA GGGAG6CGTC CA GCC CCCCA CTCAGCTCTT 240 

CTCCTCCTGT GCCAGGGGCT CCCCGGGG6A TGAGCATG6T GGTTTTCCCT CGGAGCCXCC 300 

TGGCTCGGGA CGTCTGAGAA GATGCCGGTC ATGAGGCTGT TCCCTTGCTT CCTGCAGCTC 360 

CTGGCCGGGC TGGCGCTGCC TGCTGTGCCC CCCCAGCAGT GGGCCTTGTC TGCTGGGAAC 420 

GGCTCGTCAG AGGTGGAAGT GGTACCCTTC CAGGAAGTGT GGGGCCGCAG CTACTGCCGQ 480 
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10 



15 



20 



GCGCTGGAGA GGCTGGTGGA CGTCGTGTCC GAGTACCCCA GCGAGGTGGA GCACATGTTC 
AGCCCATCCT GTGTCTCCCT GCTGCGCTGC ACCGGCTGCT GCGGCGATGA GAATCTGCAC 
TGTGTGCCGG TGGAGACX3GC CAATGTCACC ATGCAGCTCC TAAAGATCCG TTCTGGGGAC 
CGGCCCTCCT ACGTGGAGCT GACGTTCTCT CAGCACGTTC GCTGCGAATG CCGGCCTCTG 
CG6GAGAAGA TGAAGCCGGA AAGGTGCGGC GATGCTGTTC CCCGQAGGTA^ACCCACCCCT 
TGGAGGAGAG AGACCCCGCA CCCGGCTCGT GTATTTATTA CCGTCACACT CTTCAGTGAC 
TCCTGCTGGT ACCTGCCCTC TATTTATTAG CCAACTGTTT CCCTGCTGAA TGCCTCGCTC 
CCTTCAA6AC GAGGGGCAGG GAAGGACAGG ACCCTCAGGA ATTCAGTGCC TTCAACAACG 
TGAGAGAAAG AGAGAAGCCA GCCACAGACX: CCTGGGAGCT TCCGCTTTGA AAGAAGCAAG 
ACACGTGGCC TCGTGAGGGG CAAGCTAGGC CCCA6AGGCC CTGGAGGTCT CCAGGGGCCT 
GCAGAAGGAA AGAAGGGGGC CCTGCTACCT GTTCTTGGGC CTCAGGCTCT GCACAGACAA 
GCAGCCCTTG CTTTCGGAGC TCCTGTCCAA AGTAGGGATG CGGATTCTGC TGGGGCCGCC 
ACGGCCTGGT GGTGGGAAGG CCGGCAGCGG GCGGAGGGGA TTCAGCCACT TCCCCCTCTT 
CTTCTGAAGA TCAGAACATT CAGCTCTGGA GAACAGTGGT TGCCTGGGGG CTTTTGCCAC 
TCCTTGTCCC CCGTGATCTC CCGTCACACT TTGCCATTTG CTTGTACTGG GACATTGTTC 
TTTCCGGCCG AGGTGCCACC ACCCTGCCCC CACTAAGAGA CACATACAGA GTGGGCCCCG 
GGCTGGAGAA AGAGCTGCCT GGATGAGAAA CAGCTCAGCC AGTGGGGATG AGGTCACCA6 
GGGAGGAGCC TGTGCGTCCC AGCTGAAGGC AGTGGCAGGG GAGCAGGTTC CCCAAGGGCC 
CTGGCACCCC CACAAGCTGT CCCTGCAGGG CCATCTGACT GCCAAGCCAG ATTCTCTT6A 
ATAAAGTATT CTAGTGTGGA AACGC , 
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Seq ID No: 115 Protein seemence ; 
Protein Accession #: NP 002623.1 
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MPVMRLPPCF LQLIiAGLALP AVPPQQWALS AGNGSSBVEV VPPQEVWGRS YCRALERLVD 
WSEYPSEVE HMFSPSCVSL LRCTGCCGDE NLHCVPVETA NVTMQLLKIR SGDRPSYVEL 
TFSQHVRCEC RPLREKMKPE RCGDAVPRR 



60 
120 



Seq ID NO: 116 DMA aemieneg 

Nucleic Acid Accession #: NM_00736X.l 

coding sequence: 1-4131 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGGAGGGGG 
CTGCAGTTGC 
GAGTCGTGGT 
GCTGGCGAAT 
ACGGCATCAT 
CCACCGACTT 
GCCGAGTCCT 
TGCGCGCTGG 
CTGGGAGCAG 
AACACTTTCC 
CCTGCCAACG 
CAGCTTCCAG 
CCATATTTCA 
CTGGGGATCC 
AGGCCAGCTG 
TTCAGCCATG 
GTGAATGAGG 
AGCAGCATTG 
ACCTTGGATC 
AAAGGCCAAG 
GACAGAGATT 
ATCCAGCCCT 
CCTGAAGAAG 
CGAGGGACGT 
TATAATGCTG 
TTCTGCACGG 
GGGAAGCACT 
CTCCACGTGG 
GGCAATGATG 
CTCCTCCCCC 
GGCTCTGAGA 
TTCTACCCGG 
AACTACCTGA 
ACAGCCCACA 
ACAAGTTCCA 
ATCCACCAGA 
ACCACCCAGC 
CTTAGATTTG 
GTGAATCCTT 
ACAGGTGTAG 



11 
I 

ACCGGGTGGC 
TAATGTTGCG 
GGGACCAGCT 
CCCCTGCACT 
CTCCACTCAG 
CCCGGCCATC 
GTACCGAGAG 
CTTCCCGCGC 
GTAGGCGCTT 
AGGCAGTTTT 
GCCTGCAGTT 
CTCGGGTGGG 
GCTTGACTAG 
CTGGAGTGTG 
CAGTTGGAGA 
CTACAGCCCT 
AGGAAGCTGA 
ATGTTTCCTT 
CTCACACCAA 
TTGAGCCCTG 
CACTGGCTCC 
ACCCAGATGG 
AAATTGTTCT 
ATGAGGTGGG 
CCAACAAGGA 
ACTATGCCAC 
GTCTGCCTGA 
GCCATACACC 
6CAGA6CCTA 
TCACACCAAT 
ACGGCTTCAG 
GA6AGGA6AC 
GCATTAAGAC 
TCTCTCCCTA 
GAGACTACTC 
ACATCACTTA 
AGCTGAACGT 
CTGTGACCAA 
GCTATGATGG 
ATTACACCTG 



21 
I 

CGGGCGGCCG 
GGCCGCGGCG 
CCTGCAGGAA 
TCTTACGAAG 
GACTTCCCCA 
GCCCCTTTTC 
GACACCTCCC 
TCTGCGCGCT 
ACGAGGAGGT 
GGCATCTGAT 
CCTTGGAACC 
CTTCTGCCGA 
CACTGAACA6 
GQCTTTCCAT 
CCTTTCCGCT 
GGAAAGTGAC 
ATACCTTCCG 
CCAATCCTU^ 
AGAAGGAACA 
GGATGAGAGA 
TTCCTGGGAA 
AGGGCCAGTG 
TCGAAGTTAC 
ACTGGAAGAC 
AACCTGTGAA 
TGGCTTCTGC 
GGGGGCACCT 
CGTGCACTTC 
CAGGGCCATC 
TGGAGGCCTG 
CCTCGCAGGT 
GGTTCGTATC 
CAACATTCAA 
CAAGGAGCTG 
TCTGACTTTT 
CCAGGTGTGC 
GGACCGGGTC 
TCAAATTGGC 
GAGCCACATG 
TGAGT6CGCA 



31 
I 

GTGCTGTCGT 
CTGCACCCAG 
GGCGACGACG 
CCCGATTCAG 
GGGAAACGCA 
TGGCGGACAT 
CCGCAGTGCT 
TTTTACCCCC 
CAAACGCGGG 
GGGTCTGATA 
CGCCCCAAAG 
GGGGAGGCTG 
TCTGTGAAAA 
ATCGGCAGCA 
GCCCACTCTT 
TATAATGAG6 
GGTGAACCAG 
GTGGATACAA 
TCTCTGGGAG 
GAGACCAGAA 
ACCCCACCAC 
CCTTCGGAAA 
CCTGCTTCAG 
AACATAGGTT 
CACAACCACA 
TGCCACTGCC 
CACCGAGTGA 
ACTGATGTGG 
A6CCACATCC 
TTTGGCTGGC 
GCTGCCTTTA 
ACTCAAACTG 
GGCCAGGT6C 
TACCACTACT 
GGTGCAATCA 
A6GCACGCCC 
TTTGCCTTGT 
CCGGTCAAAG 
TGTGACACAA 
TCTGGGTACC 



41 
I 

CGTTACCAGT 
ACGAGCTCTT 
TAAAGCTCAG 
CAACCTCTAC 
GTATGTGGAC 
CGACACGAGC 
GGGCCTGGCC 
ACCCACGCCT 
CGCTGCCCTC 
GCTACGCCCT 
AGTCTTACAA 
ATGATCTGAA 
ATCTCTATCA 
CTTCCCCGTT 
CT6TTCCCCT 
ACAATTTGGA 
AG6AGGCATT 
AGCCTTTAGA 
AGGTAGGGGG 
GCCCAGCTCC 
CGTACCCCGA 
TGGATGTTCC 
GTCACACTAC 
CCAACACCGA 
GACAATGCTC 
AATCCAAGTT 
ATGGGAAAGT 
ACCTGCATGC 
CACAGCCAGC 
TCTTTGCTTT 
CCCATGACAT 
CTGAGGGACT 
CTTACGTCCC 
CCGACTCCAC 
ACCAAACATG 
CCAGACACCC 
ATAATGAT6A 
AAGATTCAGA 
CAGCACG6TG 
A6GGAGATGG 



51 
I 

GCTACT6CTG 
CCCACACGGG 
CCGTGGT6AA 
GTGGGCACCA 
TATGATTTCC 
CACGGCAGAG 
GCCCGCTATG 
TCCTGGCCAC 
GGGAGAGCTG 
CTTTCTTTAT 
TGTCCAGCTT 
GTCAGAAGGA 
ACTAAGCAAC 
GGACAATGTC 
GGGACGTTCC 
TTACTACGAT 
6AATGGCCAC 
GGAATCTTCC 
CCCAGATTTA 
ACCAGA6GTA 
AAACGGAAGC 
CCCAGCTCAT 
ACCCTTTIAGT 
GGTCTTCACG 
CCGGCATGCC 
TTATGGAAAT 
GAGTGGCCAC 
GTATATCGTG 
AGCCCAGGCC 
AGAAAAACCT 
GGAAGTTACA 
TGACCCAGAG 
AGCAAATTTC 
TGTGACCTCT 
GTCCTACCGC 
GTCCTTCCCC 
AGAAA6AGT6 
CCCCACTCCG 
CCATCCAGGG 
ACGGAACTGT 



60 
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GTGGATGAAA 
AACTTGCCTG 
CATACTTGCA 
GCTCCTGCTG 
CTGCCTGGTT 
AGATGTCACC 
CCCGGATATT 
TGTGAACAAC 
ATCCCCCAAT 
TTCTGCTGGT 
TCCACCCCGC 
GAGCGCTGGA 
GTGCCCCAGT 
TTCTGCTGGT 
ACCACCCCTG 
GATGTGACCC 
TACTTACCCC 
CATG6CTCCA 
GATGTTGCTG 
ATCGTGAATT 
ATGTACTGGA 
CGCAAGGTCC 
CGAGGCAACT 
TTAGATGGAG 
ACCTTTGACC 
TGTACACTAC 
AGCATCGTAA 
TCAGTAAATA 
CTCTACGGGA 
TGTAAAGGAA 
AGGCAAAGAA 
TTTTGTAGTG 
TTAAAAATGA 
TTAAAGCAAC 
AAGAATTAGA 
ATTTU^GCAC 
AATGCAGAAC 
TTTTTACCTC 
TGCATTTTCT 
TTATAAATAC 
TAAGTTTAAA 



ATGAATGTGC 
GAAGCTACAG 
TCTTGATCAC 
GGCAGGCCCG 
ATGCCGGCGA 
CTGCAGCTAC 
ATGGGGATGG 
AGCAGCGCCA 
GCGACGAGCA 
GCGTGGACCC 
CTCACTGTGG 
GGGAAAACCT 
GCGATGACCT 
GTGTGGACAA 
CGTGTATACC 
CTCCATCTGT 
TCAATGGCAC 
TAATCGTGGG 
GACGGACAAT 
CAGGTCTGAT 
CGGACAGTGT 
TCTTCTACAC 
TGTACTGGAC 
AAAACAGAAG 
CTTTCTCTAA 
CTGATGGAAC 
GCTATGCAGA 
AACATAGTGG 
TAACTGCAGT 
GACTTGGAGT 
AGTAAAAAAG 
CAAAAAGACT 
AGGTTGTTAT 
TTCTTGTAAA 
GTTTAAGACT 
ATGTGATCAA 
TGATGGGAAA 
TGCATCAGTG 
GATCCTGTAG 
CAAAGATGTA 
AAAAAAAAAA 



AACTGGCTTT 
GTGTGAGTGC 
CCCACCTGCC 
GTGTGTTCAC 
TGGGCACCAG 
CTGCTACAAT 
ATTTCAGTGC 
TGCCCAGGCC 
GGGCAACTTC 
TGATGGTCAT 
ACCATCACCA 
GCTGGAGCAC 
GGGCCACTTC 
AGATGGCAGA 
CACCGTCGCT 
G66CACCTTC 
CAGGCTTCAG 
AATTGATTAC 
CAGCCGTGCC 
AAGCCCTGAA 
CCTGGATAAG 
AGATCTGGTG 
AGACTGGAAT 
AATTCTGATC 
ACTGCTCTGC 
TGGACGGCGT 
TCACTTCTAC 
CCAGTTTACT 
CTACCCCTAC 
TTACAATCAG 
GAATTGGCCA 
TTTGTGAAAA 
TGCAAGTTTA 
CATTTATCAT 
CTAAACCTGA 
TATAACAATA 
G6ACCAATTA 
CATTTCTATT 
TGCCTCTATA 
ACAATTTTAA 
AAAAAAAAA 



CATCGCTGTG 

CGGAGTGGTT 

AACCCCTGTG 

CATGGAGGCA 

TGCACTGATG 

ACTCCTGGTT 

ATACCTGACT 

CAGTATGCCT 

CTGCCCCTAC 

GAAGTTCCTG 

GAGCCCACCC 

TACGGTGGCA 

ATCCCCCTGC 

GAGGTGCAGG 

CCACCCATGG 

CTGCTCTATA 

AAGGATGCAG 

GACT6CCGGG 

GGTCTGGAAC 

GGACTTGCCA 

ATAGAGAGCG 

AATCCCCGTG 

AGAGAAGCTC 

AATACAGACA 

TGGGCAGATG 

GTCATTCAAA 

CACACAGACT 

GATGAGTATC 

TGCCCAACAG 

AACCTGGACC 

TTAGACGTTC 

GCTGATACCT 

AAAAGGTAAC 

TAATATTTAA 

TTTTTGCCAT 

TAATCCTAAA 

TTTATAGTTT 

TATATCAAAA 

GAAGTACCCA 

AATTTTCTAG 



GCCCCAACTC 
ATGAGTTTGC 
AGGATGGCAG 
GCACGTTCAG 
TAGATGAATG 
CCTTCTCCTG 
CCACCTCAAG 
ACCCTGGGGC 
AGTGTCATGG 
GTACCCAGAC 
AGAGGCCCCC 
CCCCCCGAGA 
AGTGCCACGG 
GCACCCGCTC 
TCCGGCCCAC 
CTCAGGGCCA 
CTAAGACCCT 
AGAGGATGGT 
TGGGAGCAGA 
TAGACCACAT 
CCCTGCTGGA 
CCATCGCTGT 
CTAAAATTGA 
TTGGATTGCC 
CAGGAACCAA 
ACAACCTCAA 
GGAGGAGGGA 
TCCCAGAACA 
GAAGAAAGTA, 
CTAAAGAACA 
CTGAGCATCC 
CAATCTTTAC 
AGAATTTTAA 
AAGATCAAAT 
GGATTCCTTC 
CCTTGACAGT 
CCCAACAAAA 
GGTGCTAAAA 
CAGAAA6TAA 
ATTACTCCAA 



TGTATGTATC 
AGATGACCGG 
TCATACCTGT 
CTGTGCCTGC 
CTCAGAAAAC 
CCGTTGTCAA 
CCTGACACCC 
CCGGTTCCAC 
CAGCACTGGT 
TCCACCTGGC 
GACCATCTGT 
TGACCAGTAC 
AAAGAGCGAC 
CCAGCCAGGC 
GCCCCGGCCA 
GCAGATTGGC 
GCTGTCTCTG 
GTACTGGACA 
6CCTGAGACG 
CCGCAGAACA 
TGGCTCTGAG 
GGATCCAATC 
AACGTCATCT 
CAATGGCTTA 
AAAACTGGAG 
GTACCCCTTC 
TGGTGTTGTA 
ACGATCTCAC 
_AGTACAGTAA 
GTGACTGCAA 
AAGATGAACA 
TACTGTATTT 
CTGTTGCTTA 
TCATTCAACT 
TG6CCAAGAA 
TGGAGAAGCC 
GTTCTAAGAT 
TGATTCAATT 
AGTATCACAT 
TAAA6TGTTT 



2460 

2520 

25B0 

2640 

2700 

2760 

2820 

2B80 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 



Seq ID No: 117 Protein secmence; 
Protein Accession #: NP_031387.1 

1 11 21 31 41 51 

I I 1^1 I I 

MEGDRVAGRP VLSSLPVLLL LQIjIjMLRAAA LHPDEIiFPHG ESWWDQLLQB GDDVKLSRGE 60 

AGESPALLTK PDSATSTWAP TASSPIiRTSP GKRSMWTMIS PPTSRPSPliF WRTSTRATAB 120 

AESCTERTPP PQCWAWPPAM CAIiASRAIiRA FYPHPRIiPGH LGAGRRIiRGG QTRALPSGEL 180 

NTFQAVIASD GSDSYALFIiY PANGLQPLGT RPKESYNVQL QLPARVGFCR GEADDLKSEG 240 

PYFSLTSTEQ SVKNLYQLSN LGIPGVWAFH IGSTSPI«DNV RPAAVGDLSA AHSSVPLGRS 300 

FSHATALESD YNEDNLDYYD VNEEBAEYLP GBPBEAIiNGH SSIDVSFQSK VDTKPIjEESS 360 

TLDPHTKEGT SLGEVGGPDIi KGQVEPWDER ETRSPAPPEV DRDSI.APSWE TPPPYPENGS 420 

IQPYPDGGPV PSEMDVPPAH PEEEIVLRSY PASGHTTPLS RGTYEVGLED NIGSNTEVFT 480 

YNAANKETCE HNHRQCSRHA FCTDYATGFC CHCQSKFYGN GKHCLPEGAP HRVNGKVSGH 540 

3JHVGHTPVHF TDVDLHAYIV GNDGRAYTAI SHIPQPAAQA LLPIiTPIGGL FGWLFALEKP 600 

GSENGFSLAG AAFTHDMEVT FYPGEETVRI TQTAEGIiDPE NYLSIKTNIQ GQVPYVPANF 660 

TAHISPYKEt, YHYSDSTVTS TSSRDYSLTF GAINQTWSYR IHQNITYQVC RHAPRHPSFP 720 

TTQQLNVDRV FAIiYNDEERV LRFAVTNQIG PVKEDSDPTP VNPCYDGSHM CDTTARCHPG 780 

TGVDYTCECA SGYQGDGRNC VDENECATGF HRCGPNSVCI NLPGSYRCEC RSGYEFADDR 840 

HTCILITPPA NPCEDGSHTC APAGQARCVH HGGSTFSCAC LPGYAGDGHQ CTDVDECSEN 900 

RCHPAATCYN TPGSFSCRCQ PGYYGDGFQC IPDSTSSLTP CEQQQRHAQA QYAYPGARFH 960 

IPQCDEQGNF LPLQCHGSTG FCWCVDPDGH EVPGTQTPPG STPPHCGPSP EPTQRPPTIC 1020 

ERWRENLLEH YGGTPRDDQY VPQCDDLGHF IPLQCHGKSD FCWCVDKDGR BVQGTRSQPG 1080 

TTPACIPTVA PPMVRPTPRP DVTPPSVGTF IjIjYTQGQQIG YLPLNGTRLQ KDAAKTLLSL 1140 

HGSIIVGIDY DCRERMVYWT DVAGRTISRA GliEIiGAEPET IVNSGLISPE GLAIDHIRRT 1200 

MYWTDSVliDK lESALLDGSE RKYLFYTDIiV NPRAIAVDPI RGNIiYWTDWN REAPKIBTSS 1260 

LDGENRRILI NTDIGLPNGI, TFDPPSKLLC WADAGTKKLB CTLPD6TGRR VIQNNLKYPF 1320 
SIVSYADHFY HTDWRRDGW SVNKHSGQPT DEYLPBQRSH LYGITAVYPY CPTGRK 

Seq ID HO: 118 DMA sequence 

Nucleic Acid Accession #: NM_003088.1 

Coding sequence: 112-1593 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

> I I I I I 

GCGGAGGGTG CX5TGCGGGCC GCX3GCAGCCG AACAAAGGAG CAGGGGCGCC GCCGCAGGGA 60 
CCCGCCACCC ACCTCCCGGG GCCGCGCAGC GGCCTCTCGT CTACTGCCAC CATGACCGCC 120 
AACGGCACAG CCGAGGC6GT GCAGATCCAG TTCGGCCTCA TCAACTGCGG CAACAAGTAC 180 
CTGACGGCCG AGGCGTTCGG GTTCAAGGTG AACGCGTCCG CCAGCAGCCT GAAGAAGAAG 24 0 
CAGATCTGGA CGCTGGAGCA GCCCCCTGAC GAGGCGGGCA GCGCGGCCGT GTGCCTGCGC 300 
AGCCACCTGG GCCGCTACCT GGCGGCGGAC AAGGACXS6CA ACGTGACCTG CGAGCGCGAG 360 
GTGCCCGGTC CC6ACTGCCG TTTCCTCATC GTGGCGCACG ACGACGGTCG CTGGTCGCTG 420 
CAGTCCGA6G CGCACCGGCG CTACTTCG6C GGCACCGAGG ACCGCCTGTC CTCCTrCGCG 480 
CAGACGGTGT CCCCCGCCGA GAAGTGGAGC GTGCACATCXJ CCATGCACCC TCAGGTCAAC 540 
ATCTACAGTG TCACCCGTAA GCGCTACGCG CACCTGAGCG CGCX3GCCGGC C6ACX5AGATC 600 
GCCGTGGACC GCGACGTGCC CTGGGGCGTC GACTCGCTCA TCACCCTCGC CTTCCAGGAC 660 
CAGCGCTACA GCGTGCAGAC CGCCGACCAC CGCTTCCTGC GCCACGACGG GCGCCTGGTG 720 
GCGCGCCCCG AGCCGGCCAC TGGCTACACG CTGGAGTTCC GCTCCGGCAA GGTGGCCTTC 780 
CGCGACTGCG AGGGCCGTTA CCTGGCGCCG TCX3GGGCCCA GCGGCACGCT CAAGGCGG6C 840 
AAGGCCACCA AGGTGGGCAA GGACGAGCTC TTTGCTCTGG AGCAGAGCTG CGCCCAGGTC 900 
GTGCTGCAGG CGGCCAACGA GAGGAACGTG TCCACGCGCC AGGGTATGGA CCTGTCTGCC 960 
AATCAGGACG AGGAGACCGA CCAGGAGACC TTCCAGCTGG AGATCGACCG CGACACCAAA 1020 
AAGTGTGCCT TCCGTACCCA CACGGGCAAG TACTGGACGC TGACGGCCAC CGGGGGCGTG 1080 
CAGTCCACCG CCTCCAGCAA GAATGCCAGC TGCTACTTTG ACATCGAGTG GCGTGACCGG 114 0 
CGCATCACAC TGAGGGCGTC CAATGGCAAG TTTGTGACCT CCAAGAAGAA TGGGCAGCTG 1200 
GCCGCCTCGG TGGAGACAGC AGGGGACTCA GAGCTCTTCC TCATGAAGCT CATCAACCGC 1260 
CCCATCATCG TGTTCCGCGG GGAGCATGGC TTCATCGGCT GCCGCAAGGT CACGGGCACC 1320 
CTGGACGCCA ACCGCTCCAG CTATGACGTC TTCCAGCTGG AGTTCAACGA TGGCGCCTAC 138 0 
AACATCAAAG ACTCCACAGG CAAATACTGG ACGGTGGGCA GTGACTCCGC GGTCACCAGC 144 0 
AGCGGCGACA CTCCTGTGGA CTTCTTCTTC 6A6TTCTGCG ACTATAACAA GGTGGCCATC 1500 
AAGGTGGGCG GGCGCTACCT GAAGGGCGAC CACGCAGGCG TCCTGAAGGC CTCGGCGGAA 1560 
ACCGTGGACC CCGCCTCGCT CTGG6AGTAC TAGG6CCGGC CCGTCCTTCC CCGCCCCTGC 1620 
CCACATGGCG GCTCCTGCCA ACCCTCCCTG CTAACCCCTT CTCCGCCAGG TGGGCTCCAG 1680 
GGCGGGAGGC AAGCCCCCTT GCCTTTCAAA CTGGAAACCC CAGAGAAAAC GGTGCCCCCA 1740 
CCTGTCGCCC CTATGGACTC CCCACTCTCX: CCTCCX3CCCG GGTTCCCTAC TCCCCTCGGG 1800 
TCAGCGGCTG CGGCCTGGCC CTGGGAGGGA TTTCAGATGC CCCTGCCCTC TTGTCTGCCA 1860 
CGGGGCGAGT CTGGCACCTC TTTCTTCTGA CCTCAGACX3G CTCTGAGCCT TATTTCTCTG 1920 
GAAGCGGCTA AGGGACGGTT GGGGGCTGGG AGCCCTGGGC GTGTAGTGTA ACTGGAATCT 1980 
TTTGCCTCTC CCAGCCACCT CCTCCCAGCC CCCCAGGAGA GCTGGGCACA TGTCCCAAGC 2040 
CTGTCAGTGG CCCTCCCTGG TGCACTGTCC CCGAAACCCC TGCTTGGGAA GGGAAGCTGT 2100 
CGGGAGGGCT AGGACTGACC CTTGTGGTGT TTTTTTGGGT GGTGGCTGGA AACAGCCCCT 2160 
CTCCCACGTG GGAGAGGCTC AGCCTGGCTC CCTTCCCTGG AGCGGCAGGG CGTGACGGCC 2220 
ACAGGGTCTG CCCGCTGCAC GTTCTGCCAA GGTGGTGGTG GCGGGCGGGT AGGGGTGTGG 2280 
GGGCCGTCTT CCT CCTG TCT CTTTCCTTTC ACCCTAGCCT GACTGGAAGC AGAAAATGAC 2340 
CAAATCAGTA TTTTTTTTAA TGAAATATTA TTGCTGGAGG CGTCCCAGGC AAGCCTGGCT 2400 
GTAGTAGCGA GTGATCTGGC GGGGGGCGTC TCAGCACCCT CCCCAGGGGG TGCATCTCAG 2460 
CCCCCTCTTT CCGTCCTTCC CGTCCAGCCC CAGCCCTGGG CCTGGGCTGC CGACACCTGG 2520 
GCCAGAGCCC CTGCTGTGAT TGGTGCTCCC TGGGCCTCCC GGGTGGATGA AGCCAGGCGT 2580 
CGCCCCCTCC GGGAGCCCTG GGGTGAGCCG CCGGGGCCCC CCTGCTGCCA GCCTCCCCCG 2640 
TCCCCAACAT GCATCTCACT CTGGGTGTCT TGGTCTTTTA TTTTTTGTAA GTGTCATTTG 2700 
TATAACTCTA AACGCCCATG ATAGTAGCTT CAAACTGGAA ATAGCGAAAT AAAATAACTC 2760 



Seq ID No: 119 Protein aecnieneei 
Protein Accession #i NP_003079.l 

1 11 21 31 41 51 

> I I I I I 

MTANGTAEAV QIQFGLINCG NKYLTAEAFG FKVNASASSL KKKQIWTLEQ PPDEAGSAAV 60 

CLRSHLGRYL AADKDGNVTC EREVPGPDCR FLIVAHDDGR WSLQSEAHRR YFGGTEDRIiS 120 

CFAQTVSPAE KWSVHIAMHP QVNIYSVTRK RYAHI*SARPA DEIAVDRDVP WGVDSLITLA 180 

FQDQRYSVC3T ADHRFLRHDG RLVARPEPAT GYTLEFRSGK VAFRDCEGRY LAPSGPSGTL 24 0 

KAGKATKVGK DELFALEQSC AQWLQAANE RNVSTRQGMD LSANQDEETD QETFQLEIDR 300 

DTKKCAFRTH TGKYWTIiTAT GGVQSTASSK WASCYFDIEW RDRRITLRAS NGKFVTSKKN 360 

GQLAASVBTA GDSELFLMKL INRPllVPRG EHGFIGCRKV TGTLDANRSS YDVFQIiEFND 420 

GAYNIKDSTG KYWTVGSDSA VTSSGDTPVD FPPEPCDYNK VAIKVGGRYL KGDHAGVLKA 480 
SAETVDPASIi WEY 

Seq ID NO: 120 DWA sequence 

Nucleic Acid Accession #: NM_006404.1 

Coding sequence: 25-741 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

CAGGTCCGGA GCCTCAACTT CAGGAJGTTG 
GGCTGGGCCT TTTGTAGCCA AGACGCCTCA 
ATCTCCTACT TCCGCGACCC CTATCACGTG 



31 41 51 

I I I 

ACAACATTGC TGCCGATACT GCTGCTGTCT 60 
GATGGCCTCC AAAGACTTCA TATGCTCCAG 120 
TGGTACCAGG OCAACGCGTC GCTG6GGGGA 180 
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CACCTAACGC ACGTGCTGGA AGGCCCAGAC ACCAACACCA CGATCATTCA GCTGCAGCCC 240 

TTGCAGGAGC CCGAGAGCTG GGCGCGCACG CAGAGTGGCC TGCAGTCCTA CCTGCTCCAG 30 0 

TTCCACGGCC TCGTGCGCCT GGTGCACCAG GAGCGGACCT TGGCCTTTCC TCTGACCATC 360 

CGCTGCTTCC TGGGCTGTGA GCTGCCTCCC GAGGGCTCTA GAGCCCATGT CTTCTTCGAA 420 

GTGGCTGTGA ATGGGAGCTC CTTTGTGAGT TTCCGGCCGG AGAGAGCCTT GTGGCAGGCA 4 80 

GACACCCAGG TCACCTCCGG AGTGGTCACC TTCACCCTGC AGCAGCTCAA TGCCTACAAC 54 0 

CGCACTCGGT ATGAACTGCG GGAATTCCTG GAGGACACCT GTGTGCAGTA TGTGCAGAAA 600 

CATATTTCCG CGGAAAACAC GAAAGGGAGC CAAACAAGCC GCTCCTACAC TTCGCTGGTC 660 

CTGGGCGTCC TGGTGGGCGG TTTCATCATT GCTGGTGTGG CTGTAGGCAT CTTCCTGTGC 720 

ACAGGTGGAC GGCGATGT TA A TTACTCTCC AGCCCCGTCA GAAGGGGCTG GATTGATGGA 780 

GGCTGGCAAG GGAAAGTTTC AGCTCACTGT GAAGCCAGAC TCCCCAACTG AAACACCAGA 840 

AGGTTTGGAG TGACAGCTCC TTTCTTCTCC CACATCTGCC CACTGAAGAT TTGAGGGAGG 900 

GGAGATGGAG AGGAGAGGTG GACAAAGTAC TTGGTTTGCT AAGAACCTAA GAACGTGTAT 960 

GCTTTGCTGA ATTAGTCTGA TAAGTGAATG TTTATCTATC TTTGTGGAAA ACAGATAATG 1020 

GAGTTGGGGC AGGAAGCCTA TGCGCCATCC TCCAAAGACA GACA6AATCA CCTGAGGCGT 1080 

TCAAAAGATA TAACCAAATA AACAAGTCAT CCACAATCAA AATACAACAT TCAATACTTC 1140 

CAGGTGTGTC AGACTTGGGA TGGGACGCTG ATATAATAGG GTAGAAAGAA GTAACACGAA 1200 

GAAGTGGTGG AAATGTAAAA TCCAAGTCAT ATGGCAGTGA TCAATTATTA ATCAATTAAT 1260 
AATATTAATA AATTTCTTAT ATTT 

Seq ID No: 121 Protein sequence; 
Protein Accession #: NP_006395.1 

1 11 21 31 41 51 

I I I I I i 

MLTTLLPILL LSGWAFCSQD ASDGIiQRLHM LQISYFRDPY HVWYQGNASL GGHLTHVIiEG 60 

PDTNTTIIQL QPLQEPESWA RTQSGIjQSYL LQFHGLVRLV HQERTLAFPL TIRCFW3CEL 120 

PPEGSRAHVF FEVAVNGSSF VSFRPERALW QADTQVTSGV VTFTIiQQLNA YNRTRYELRE 180 
FLEDTCVQYV QKHISAENTK GSQTSRSYTS LVLGVLVGGF IIAGVAVGIF LCTGGRRC 

Seq ID NO: 122 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 2-505 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CGAGAAGCTG GGAGAGACAC CACTTGTCCC TGAACAAGAC AATTCAGTAA CATCTATTCC 60 

TGAGATTCCT CGATGGGGAT CACAGAGCAC GATGTCTACC CTTCAAATGT CCCTTCAAGC 120 

CGAGTCAAAG GCCACTATCA CCCCATCAGG GAGCGTGATT TCCAAGTTTA ATTCTACGAC 180 

TTCCTCTGCC ACTCCTCAGG CTTTCGACTC CTCCTCTGCC GTGGTCTTCA TATTTGTGAG 240 

CACAGCAGTA GTAGTGTTGG TGATCTTGAC CATGACAGTA CTGGGGCTTG TCAAGCTCTG 300 

CTTTCACGAA AGCCCCTCTT CCCAGCCAAG GAAGGAGTCT ATGGGCCCGC CGGGCCTGGA 360 

GAGTGATCCT GAGCCCGCTG CTTTGGGCTC CAGTTCTGCA CATTGCACAA ACAATGGGGT 420 

GAAAGTCGGG GACTGTGATC TGCGGGACAG AGCAGAGGGT GCCTTGCTGG CGGAGTCCCC 480 

TCTTGGCTCT AGTGATGCAT_AGGGAAACAG GGGACATGGG CACTCCTGTG AACAaTTTTT 540 

CACTTTTGAT GAAACGGGGA ACCAAGAGGA ACTTACTTGT GTAACTGACA ATTTCTGCAQ 600 

AAATCCCCCT TCCTCTAAAT TCCCTTTACT CCACTGAGGA GCTAAATCAG AACTGCACAC 660 

TCCTTCCCTG ATGATAGAGG AAGTGGAAGT GCCTTTA6GA TGGTGATACT GGGGGACCGG 720 

GTAGTGCTGG GGAGAGATAT TTTCTTATGT TTATTCGGAG AATTTGGAGA AGTGATTGAA 780 

CTTTTCAAGA CATTGGAAAC AAATAGAACA CAATATAATT TACATTAAAA AATAATTTCT 840 

ACCAAAATGG AAAGGAAATG TTCTATGTTG TTCAGGCTAG GAGTATATTG GTTCGAAATC 900 
CCAGGGAAAA AAATAAAAAT AAAAAATTAA AGGATTGTTG ATAAAA 

Seq ID NO: 123 Protein sequence: 
Protein Accession #: none foimd 

1 11 21 31 41 51 

I I I I I I 

EKIiGETPLVP EQDNSVTSIP EIPRWGSQST MSTLQMSIjQA ESKATITPSG SVISKFNSTT 60 

SSATPQAFDS SSAWFIPVS TAVWIiVXLT MTVLGLVKIK: FHESPSSQPR KBSMGPPGLB 120 
SDPEPAALGS SSAHCTNNGV KVGDCDIiRDR AEGALLAESP LGSSDA 

Seq ID NO: 124 DNA sequence 

Nucleic Acid Accession #: NM_006500.1 

Coding sequence: 27-1967 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCT6T CCTC6CGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 

CGCCTGAGCT G6TGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 

TCATCTTCCG TGTGCGCCAG 6GCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 

TCAGCCTCCA G6ACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 
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GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 4 80 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 54 0 

TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 66 0 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 84 0 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

GGGAGGCAGA GGAAGAGACA ACCAACX3ACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCOGAGTG AGTCCCGCAG 1080 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGA6AAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

TTCAGTTGCA TGACCTGAAA CX3GGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGT6TGGG TG7UVAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 204 0 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAQGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 222 0 

CCGAGCGGGT AGGAGA6TTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

C7UUVGGCTGG CTTCCACCAT CCAGGT6CAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2560 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 264 0 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 300 0 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 306 0 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 312 0 

AGAATGGTAC TTAGGGATG6 AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 324 0 

TTGTTTCCTT TATATAT6TA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAQACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GC3VCGAAGGG CCTGGCA6GC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 

Seq ID No: 125 Protein aeouenc^i 
Protein Accession #: NP_006491.1 



1 11 21 31 41 51 

I I I I I I 

MGIiPRIiVCAP LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGNIiSHV 60 

DWFSVHKEKR TLIFRVRQGQ GQSEPGEYEQ RI^LQDRGAT IiALTQVTPQD ERIFLCQGKR 120 

PRSQEYRIQL RVYKAPEEPN IQVNPLGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

LKBEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ FYCBLNYRLP SGNHMKESRE 240 

VTVPVFYPTE KVWLEVEPVG MLKEGDRVEI RCLADGNPPP HFSISKQNPS TREAEEETTN 300 

DNGVLVLEPA RKEHSGRYBC QAWNLDTMIS LLSEPQBLLV NYVSDVRVSP AAPERQEGSS 360 

LTLTCEAESS QDLEPQWLRB ETDQVLERGP VIjQLHDLKRE AGGGYRCVAS VPSIPGLNRT 4 20 

QLVKIAIPGP PWMAPKERKV WVKENMVLNL SCBASGHPRP TISWNVNGTA SEQDQDPQRV 4 80 

LSTIiNVLVTP EIiIiETGVECT ASNDLGKNTS ILPLELVIU.T TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTER KLPEPBSRGV VIVAVlVCIIi VLAVLGAVLY FLYRXGiOiPC RRSGKQEITL 600 
PPSRKTELW EVKSDKLPBE MGLIiQGSSGD KRAPGDQGEK YIDLRH 



Seq ID NO: 126 DNA aeouenee 

Nucleic Acid Accession #: NM 001955.1 
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Coding sequence: 337-975 (underlined sequences correspond to start and stop'codons) 



1 11 21 31 41 51 

I I I I I I 

GGAGCTGTTT ACCCCCACTC TAATAGGGGT TCT^TATAAA AAGCCGGCAG AGAGCTGTCC 60 

AAGTCAGACG CGCCTCTGCA TCTGCGCCAG GCGAACGGGT CCTGCGCCTC CTGCAGTCCC 120 

AGCTCTCCAC CACCGCCGCG TGCGCCTGCA GACGCTCCGC TCGCTGCCTT CTCTCCTGGC 180 

AGGCGCTGCC TTTTCTCCCC GTTAAAGGGC ACTTGGGCTG AAGGATCGCT TTGAGATCTG 240 

AGGAACCCGC AGCGCTTTGA GGGACCTGAA GCTGTTTTTC TTCGTTTTCC TTTGGGTTCA 3 00 

GTTTGAACGG GAGGTTTTTG ATCCCTTTTT TTCAGAATGG ATTATTTGCT CATGATTTTC 360. 

TCTCTGCTGT TTGTGGCTTG CCAAGGAGCT CCAGAAACAG CAGTCTTAGG CGCTGAGCTC 420 

AGCGCGGTGG GTGAGAACGG CGGGGAGAAA CCCACTCCCA GTCCACCCTG GCGGCTCCGC 480 

CGGTCCAAGC GCTGCTCCTG CTCGTCCCTG ATGGAT7\AAG AGTGTGTCTA CTTCTGCCAC 540 

CTGGACATCA TTTGGGTCAA CACTCCCGAG CACGTTGTTC CGTATGGACT TGGAAGCCCT 600 

AGGTCCAAGA GAGCCTTGGA GAATTTACTT CCCACAAAGG CAACAGACCG TGAGAATAGA 660 

TGCCAATGTG CTAGCCAAAA AGACAAGAAG TGCTGGAATT TTTGCCAAGC AGGAAAAGAA 720 

CTCAGGGCTG AAGACATTAT GGAGAAAGAC TGGAATAATC ATAAGAAAGG AAAA6ACTGT. 780 

TCCAAGCTTG GGAAAAAGTG TATTTATCAG CAGTTAGTGA GAGGAAGAAA AATCAGAAGA 840 

AGTTCAGAGG AACACCTAAG ACAAACCAGG TCGGAGACCA TGAGAAACAG C6TCAAATCA 90 0 

TCTTTTCATG ATCCCAAGCT GAAAGGCAAG CCCTCCA6AG AGCGTTAT6T GACCCACAAC 960 

CGAGCACATT Q GTGA CAGAC TTCGGGGCCT GTCTGAAGCC ATAGCCTCCA CGGAGAGCCC 102 0 

TGTGGCC6AC TCTGCACTCT CCACCCTGGC TGGGATCAGA GCAGGAGCAT CCTCTGCTGG 1080 

TTCCTGACTG GCAAAGGACC AGCGTCCTCG TTCAAAACAT TCCAAGAAAG GTTAAGGAGT 1140 

TCCCCCAACC ATCTTCACTG GCTTCCATCA GTGGTAACTG CTTTGGTCTC TTCTTTCATC 1200 
TGGGGATGAC AATGGACCTC TCAGCAGAAA CACACAGTCA CATTCGAATT C 

Seq ID No: 127 Protein sequence: 
Protein Accession #: NP_001946.1 



1 11 21 31 41 51 

I 1 I I I I 

MDYLLMIFSIi LFVACQGAPB TAVLGAEIiSA VGENGGEKPT PSPPWRLRRS KRCSCSSLMD 60 

KECVYFCHUD IIWVNTPEHV VPYGLGSPRS KRALENliLPT KATDRENRCQ CASQKDKKCW 120 

NFCQAGKEIiR AEDIMBKDWN NHKKGKDCSK LGKKCIYQQL VRGRKIRRSS BEHIiRQTRSB IBO 
TMRNSVKSSF HDPKItKGKPS RBRYVTHNRA HW 



Seq ID HO: 128 DMA sequence 

Nucleic Acid Accession #: NM__001721.1 

Coding sequence: 34-2061 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GCAAGCACGG AACAAGCTGA GACGGATGAT AAT ATGG ATA CAAAATCTAT TCTAGAAGAA 60 

CTTCTTCTCA AAAGATCACA GCAAAAGAAG AAAATGTCAC CAAATAATTA CAAAGAACGG 120 

CTTTTTGTTT TGACCAAAAC AAACCTTTCC TACTATGAAT ATGACAAAAT GAAAAGGG6C 180 

AGCAGAAAAG GATCCATTGA AATTAAGAAA ATCAGATGTG TGGAGAAAGT AAATCTCGAG 240 

GAGCAGACGC CTGTAGAGAG ACAGTACCCA TTTCAGATTG TCTATAAAGA TGGGCTTCTC 300 

TATGTCTATG CATCAAATGA A6AGAGCCGA AGTCAGTGGT TGAAAGCATT ACAAAAAGAG 360 

ATAAGGGGTA ACCCCCACCT GCTGGTCAAG TACCATAGTG GGTTCTTCGT GGACGGGAAG 420 

TTCCTGTGTT GCCAGCA6AG CTGTAAA6CA GCCCCAGGAT GTACCCTCTG G6AAGCATAT 480 

GCTAATCTGC ATACTGCAGT CAATGAAGAG AAACACAGAG TTCCCACCTT CCCAGACAGA 540 

GT6CTGAAGA TACCTCGGGC AGTTCCTGTT CTCAAAATGG ATGCACCATC TTCAAGTACC 600 

ACTCTAGCCC AATATGACAA CGAATCAAAG AAAAACTATG GCTCCCAGCC ACCATCTTCA 660 

AGTACCAGTC TAGCGCAATA TGACAGCAAC TCAAAGAAAA TCTATGGCTC CCAGCCAAAC 720 

TTCAACATGC AGTATATTCC AAGGGAAGAC TTCCCTGACT GGTGGCAAGT AAGAAAACTG 780 

AAAAGTAGCA GCAGCAGTGA AGATGTTGCA AGCAGTAACC AAAAAGAAAG AAATGTGAAT 84 0 

CACACCACCT CAAAGATTTC ATGGGAATTC CCTGAGTCAA GTTCATCTGA AGAAGAGGAA 900 

AACCTGGATG ATTATGACTG GTTTGCTGGT AACATCTCCA GATCACAATC TGAACAGTTA 960 

CTCAGACAAA AGGGAAAAGA AGGAGCATTT ATGGTTAGAA ATTCGAGCCA AGTGGGAATG 1020 

TACACAGTGT CCTTATTTAG TAAGGCTGTG AATGATAAAA AAGGAACTGT CAAA CATTAC 1080 

CACGTGCATA CAAATGCTGA GAACAAATTA TACCTGGCAG AAAACTACTG TTTTGATTCC 1140 

ATTCCAAAGC TTATTCATTA TCATCAACAC AATTCAGCAG GCATGATCAC ACGGCTCCGC 1200 

CACCCTGTGT CAACAAAGGC CAACAAGGTC CCCGACTCTG TGTCCCTGGG AAATGGAATC 1260 

TGGGAACTGA AAAGAGAAGA GATTACCTTG TTGAAGGAGC TGGGAAGTGG CCAGTTTGGA 1320 

GTGGTCCAGC TGGGCAAGTG GAAGGGGCAG TATGATGTTG CTGTTAAGAT GATCAAGGAG 1380 

GGCTCCATGT CAGAAGATGA ATTCTTTCAG GAGGCCCAGA CTATGATGAA ACTCAGCCAT 1440 

CCCAAGCTGG TTAAATTCTA TCGAGTGTGT TCAAAG6AAT ACCCCATATA CATAGTGACT 1500 

GAATATATAA GCAATGGCTG CTTGCTGAAT TACCTGAGGA GTCACGGAAA AGGACTTGAA 1560 

CCTTCCCAGC TCTTAGAAAT GTGCTACGAT GTCTGTGAAG GCATGGCCTT CTTGGAGA6T 1620 

CACCAATTCA TACAC06GGA CTTGGCTGCT CGTAACTGCT TGGTGGACAG AGATCTCTGT 1680 

GT6AAAGTAT CTGACTTTGG AATGACAAGG TATGTTCTTG ATGACCAGTA TGTCAGTTCA 1740 



239 



wo «2/«79492 



PCT/US02/049I5 



GTCGGAACAA AGTTTCCAGT CAAGTGGTCA 
AGCAGCAAGT CAGACGTATG GGCATTTGGG 
AAGCAGCCCT ATGACTTGTA TGACAACTCC 
AGGCTTTACC GGCCCCACCT GGCATCGGAC 
CACGAGCTTC CAGAAAAGCG TCCCACATTT 
CGGGAAAAAG ACAAGCA TTG AA GAAGAAAT 
CTGGCCAGCA TTTTCATTCA TTTTAAGGAA 
TTTTAATAGT GTTCTCTGTA TTGTCTATTA 
ATTCCCTTGA AATTTAQATC AAATTAGTAA. 
TTTCCAGCCT ATAGCAGAAG CACATTTTCA 
AAAGACTGAG CAGAACTGAA AAATTACTTA 
ATTGTCACAA CAATTAAATA TACTACCAAG 

Seq ID No: 129 Protein aemi^neg.. 
Protein Accession #: NP 001712.1 



GCTCCAGAGG TGTTTCATTA CTTCAAATAC 1800 

ATCCTGATGT GGGAGGTGTT CAGCCTGGGG 1860 

CAGGTGGTTC TGAAGGTCTC CCAGGGCCAC 1920 

ACCATCTACC AGATCATGTA CAGCTGCTGG 1980 

CAGCAACTCC TGTCTTCCAT TGAACCACTT 204 0 

TAGGAGTGCT GATAAGAATG AATATAGATG 2100 

AGTAGGAAGG CATAAGTAAT TTTAGCTAGT 2160 

TTTAGAAATG AACAAGGCAG GAAACAAAAG 2220 

TTTTGTTTTA TGCTGCTCCT GATATAACAC 2280 

GACTGCAATA TAGAGACTGT GTTCATGTGT 234 0 

TTGGATATTC ATTCTTTTCT TTATATTGTC 2400 
TACAGAAATG TGGAAAAAAA AAACCG 



1 11 21 

ZU MPTKSILEEL LLKRSQQKKK MSPNNYKERL 
RCVEKVNLEE QTPVERQYPP QIVYKDGLI.Y 
HSGFFVDGKP LCCQQSCKAA PGCTLWEAYA 
KMDAPSSSTT LAQYDNESKK NYGSQPPSSS 
PDWWQVRKLK SSSSSEDVAS SNQKERNVNH 

25 ISRSQSEQLL RQKGKEGAFM VRNSSQVGMY 

LAENYCFDSI PKLIHYHQHN SAGMITRLRH 
KELGSGQFGV VQLGKWKGQY DVAVKMIKEG 
KEYPIYIVTB YISNGCLLNY LRSHGKGLEP 
NCLVDRDLCV KVSDPGMTRY VIiDDQYVSSV 

30 LMWEVFSLGK QPYDLYDNSQ WLKVSQGHR 
QLLSSIEPLR EKDKH 



31 41 51 

I i I 

FVIjTKTNI*SY YEYDKMKRGS RKGSIEIKKI 60 

VYASNEESRS QWLKALQKEI RGNPHLIiVKY 120 

NLHTAVNEEK HRVPTFPDRV LKIPRAVPVL 180 

TSLAQYDSNS KKIYGSQPNP NMQYIPREDP 240 

TTSKISWEFP ESSSSEEEEN LDDYDWFAGN 300 

TVSLFSKAVN DKKGTVKHYH VHTNAENKLY 360 

PVSTKANKVP DSVSLGNGIW ELKREEITLL 420 

SMSEDEFFQE AQTMMKLSHP KLVKFYGVCS 4 80 

SQLIiEMCYDV CEGMAPLESH QPIHRDLAAR 54 0 

GTKFPVKWSA PEVFHYPKYS SKSDVWAFGI 600 

LYRPHIiASDT lYQIMYSCWH ELPEKRPTFQ 660 



35 



Seq ID NO: 130 DMA sequence 

Nucleic Acid Accession #: NM_012072.2 

Coding sequence: 149-2107 (underlined sequences correspond to start and stop codons) 



40 



45 



50 



55 



60 



65 



70 



75 



AAAGCCCTCA 
CCCCTTGGGG 
TCCCGCAGAG 
GCTGCTGCTC 
CGTGGGGACC 
CCACTGCAAC 
CGTCCAGC6A 
CAAGTTCTGG 
GAAGGGCTTC 
GCTCCGGAAC 
GCTCCTTCCC 
CGGAAGTAAC 
GGCCCTGGGG 
CTTGGAGGCT 
CGAGACTCAG 
CAGCTCGGGC 
CCACCAGGAC 
CCGGCTGCTG 
TCGTGGGGGG 
CCAAGGGTAC 
CTCCCCCTGT 
TGGCTATGAG 
GGGTCGCTCG 
TGAGGAGG6C 
TGTGGGCCCG 
CTGTGGCTGC 
TGTGTCTCTG 
GAGCACCGTG 
GGCTACACCC 
ACTCAAGAT6 
CGCCACAGCT 
AAACAACGAT 
GGCCATCCTA 
GAAGAGGGAG 
TCCAGAGCGA 
CTG CTGAA AG 



11 

I 

GCCTTTGTGT 
CCCAGCTGGG 
GGCCACACAG 
CTGACCCAGC 
GCCTGCTACA 
CAGAACGGGG 
GTACTGGCCC 
ATTGGGCTCC 
AGCTGGGTGG 
TCGTQCATCT 
AACCGCCTGC 
ATTGAGGGCT 
GGCCCAGGTC 
GTGCCCTTTG 
AGTCATTATT 
CCCCTCTGTG 
TGCTTTGAAG 
GATGACCTGG 
GCCACGTGCG 
CAGCTGGACT 
GCCCAGGAGT 
CCGGGCGGTC 
CCTTGCGCCC 
TACGTCCTGG 
GGGGGCCCCC 
CTGCCAGGCT 
GGACCACCAT 
CCCCGCGCTG 
ACCACAAGTA 
CTGGCCCCCA 
GCCTCTGGCC 
GGCACTGACG 
CTCCTGCTGG 
GAGAAGAAGG 
GCTGAGAGCA 
TGAG6TGGCC 



21 
I 

CCTTCTCTGC 
AGCCGAGATA 
AGACCGGGAT 
CCGGGGCGGG 
CGGCCCACTC 
GCAACCTGGC 
AGCTCCTGAG 
AGCXMIGAGAA 
GCGGGGGGGA 
CCAAGCGCTG 
CCAAGTGGTC 
TCGTGTGCAA 
AGGTGACCTA 
CCTCTGCGGC 
TCCTGTGCAA 
TCAGCCCCAA 
G6GGGGATGG 
TGACCTGTGC 
TCCTGGGACC 
CGAGTCAGCT 
GTGTCAACAC 
CTGGAGAGGG 
A6GGCTGCAC 
CCGG6GAGGA 
TCTGCGACAG 

CTGGGCCCCC 
CAACAGCCAG 
GACCTTCGCT 
GTGGGTCCTC 
CCCAGGAGCC 
GGCAAAAGCT 
CCCTGGCTCT 
AGAAGAAGCC 
GGGCCATGGA 
CTAGAGACAC 



31 
I 

GCCGGAGTGG 
GAAGCTCCTG 
^GCCACCTCC 
GACGGGAGCT 
GGGCAAGCTG 
CACTGTGAAG 
GCGGGAGGCA 
GGGCAA6T6C 
6GACACGCCT 
TGTGTCTCTG 
TGAGGGCCCC 
GTTCAGCTTC 
CACCACCCCC 
CAATGTAGCC 
GGAGAAGGCC 
6TATGGCTGC 
CTCCTTCCTC 
CTCTCGAAAC 
CCATGGGAAA 
GGACTGTGTG 
CCCTGGGGGC 
GGCCTGTCAG 
CAACACAGAT 
CGGGACTCAG 
CTTGTGCTTC 
CCCAAATGGG 
CGATGAGGAG 
TCCCACAA6G 
GTCATCTGAC 
AGGCGTCTGG 
TGCAGGTGGG 
GCTTTTATTC 
GGGGCTACTG 
CCAGAATGCG 
GAACCAGTAC 
TAGAGTCACC 



41 
I 

CTGCAGCTCA 
TCGCCGCTGG 
ATGGGCCTGC 
GACACGGAGG 
AGCGCTGCCG 
AGCAAGGAGG 
GCCCTGACGG 
CTGGACCCTA 
TACTCTAACT 
CTGCTGGACC 
TGTGGGAGCC 
AAAGGCATGT 
TTCCAGACCA 
TGTGGGGAAG 
CCCGAT6TGT 
AACTTCAACA 
TGCGGCTGCC 
CCTTGCAGCT 
AACTACACGT 
GACGTGGATG 
TTCCGCTGCG 
GATGTGGATG 
GGCTCATTTC 
TGCCAGGACG 
AACACACAAG 
GTCTCTTGCA 
GACAAAGGA6 
GGCCCCGAGG 
GCCCCCATCA 
AGGGAGCCCA 
GACTCCTCCG 
TACATCCTAG 
GTCTATG6CA 
GCAGACAGTT 
AGTCCGACAC 
AGCCACCATC 



51 
I 

CCCCTCAGCT 
GCTTCTCGCC 
TGCTGCTGCT 
CGGTGGTCTG 
AGGCCCAGAA 
AGGCCCAGCA 
CGAGGATGAG 
GTCTGCCGCT 
GGCACAAGGA 
TGTCCCAGCC 
CAGGCTCCCC 
GCCG6CCTCT 
CCAGTTCCTC 
GTGACAAGGA 
TC6ACTGGGG 
AT6GGGGCTG 
GACCAGGATT 
CCAGCCCATG 
GCCGCTGCCC 
AATGCCAGGA 
AATGCTGGGT 
AGTGTGCTCT 
ACTGCTCCTG 
TGGATGAGTG 
GGTCCTTCCA 
CCATGGGGCC 
AGAAAGAAGG 
GCACCCCCTU^ 
CATCTQCCCC 
GCATCCATCA 
TGGCCACACA 
GCACC6TGGT 
AGCGGAGAGC 
ACTCCTGGGT 
CTGGGACAGA 
CTCA6AGCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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TGAACTCCCC ATTCCAAAGG GGCACCCACA 
CAAACAATTG TAAGTCTCCT CCTTAAAGGC 
TGTTTGATGT TCCTGAAGTG GAAGCTGTGT 
TCTATAATGA TTGTTACTCC CCCTCCCTTT 
5 GGGTGTGAGG AGGCTGGGGC TAAGGGGCTC 
ATCTAAGAGG AAAAGGTGAG TTGCTCATGC 
CCTAGGATGA AAACTAAATC AATTAATTAT 
TCAAAGGGAA CATGTTCGGA CTGGAAACAT 
AGCACAAGTC TTGCTAAATG TGATACTGTT 
10 TAACCTCTTA GGTGGCAAGG AGGCAGGAAG 
CTTGGGTTTA TTTGCAAAGG AAGCTTGAAA 
CAGGTGTTTG TGAAGTCACA TAATCTACGG 
CACAGATACT TGAATTAATT CATCCAAATG 
TGTGATCAAC ACTAACAAGG AAACAAATTC 
15 CCTCAGACAC CCTGCCTGTG GCCCCGCCTC 
CGA6CTCAGA CAGAGGAA6C CCTGCAGAAA 
TGAACGGGAG ATGATGCACT GTGTTTTGAA 
TCATAGTCCA CAGTTGATGC A6CATCCTGA 
CACACCAAGT AGGGAGCTAG TCAGGCAGTT 
20 TTCCTTAAAA TTGGGGGTAA GGAGGGAA6G 
TTTTTACAGC AAAAACTGCT CAAAGCCATT 
TTGCAAATAT TTCTCCCTAT GATAATGCAG 
TCTCTCTCAC ACACACACAC ACACACACAC 
CCTGGGGCAC TGGAACACAT TCCTGGGGGT 
25 TGAGTATCTC TGGGAGGCCT CATGTCTCCT 
ACAGACAGAG GAAATGTGTC TCCCTCCAAG 
GGTTTTGCCT TAGCAATGCA TCGGTCTCTG 
CAAGGTGCAG GGTTAATACT CTTGCCAGTT 
TTTTAATAGA AAACTAAAGG GGCAGGGGAA 
30 TCGATGGGGC ATTTGGAACT TCTTTTTAAA 
AACTCTGGTG TTTAACACTT AAGGGAGACA 
GGCCACGAGA CTCTAGGTGA TGTGTGAAGC 
CTGTCTGGCC ATTCAGAGGA TTCTAAAGAC 
CACTTAAATA AATGCAAATG CAACATTTCT 
35 TCATTTGGGG TGAAGGAGAC ATTTCTGTCC 
GTATGATTCC TGGGATCCAA CQAGCCCTCC 
GCCCAGGCCC ATCGTCTGTT CTCTGAATGC 
GAACCCCTCT GTGGAACCCA CAAGGGGAGA 
ACCTTCCCTG GCAGGCTGGG TCCCTCTCCT 
40 CCCACCACGG GGGGAGAGCC AGCAACCCAA 
AACCACTGGG CTCAAACACG TGCTTTATTC 
ATGGAAATTC TTGTTTGGGG GATCTTGGGG 
GCCAAGAGGC CATTAACAAA TCGTCCTTGT 
CACAGTGGGG AATCCAAGGG TCACAGTATG 
45 TCTCGCTAGA CACAGTGTTT CTGCCCAGGT 
CATGGGGACG GGGGAAGTTT TCACTTGGAG 
CCAAATAGGT CAATAATTCT GGGAGACTCT 
TCTCTCCCTC CCCTCATCCC ACATCTCAAA 
CACCCAGCTC GCCATGCCTA CTCATTCCTG 
50 CTTCTTTGTC ATTTGAGAAA GGATGCAGGA 
CAGAAAAACC AGGGCAGGAC AGTTATCGAC 
TAGAGGGACT CCACCCCTGC TCAACAGCTT 
CTCTGCCTTC 6GTGGCCCAC ACACCTAAGC 
AACACATCTA CX3TGTAGCAC TACGACGTTA 
55 AGGCTCTGAT TAAGGATGTG GGGAAGTGGG 
CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA 
TGCCATCTTC CCTGCGATCA GGCAAAAAAG 
TGTGTTATGT CCATTTTGCA GGATGAACTG 
TTG Ci ' TT GTC TTTTCCATCC TCATCACAAG 
60 TCTTTCGATG GATGGAGATG ATCATTAGGT 
ATTTCTGTGA AAACTAGGAG AACAGAGATG 
TAACACAGTC TTTTTAAAAC TAACATAGGA 
TCTCCATTGT CTAAATCAGG AAAACAGGAA 
TTAATGCCCC CTACATATTT CCATCACCTT 
65 TATGATCCCA GAAAACATCT GTCTCTACTT 
TGGTTTGTGC ATTTTCTCAA CTAAAAATAG 
CTAATCAAAG ACACTATTTT CATACTAGAT 
TTTAAAAATA AATTGTGTTT TGGTCTGTTC 
AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG 
70 TTCAGCAGAT TTTGCXCACT ATTCCTCTGA 
GCTTGAATTA GATCCCTGCA AAGGCTTGCT 
GTAATCACTT CATGAATGCT AAATGAGAAT 
TTTGTTTGAC TAATTCTGGA ATTACAAGAT 
ATGTTTCCCA J^ACTGTGAGG AGGGAAGGCT 
75 CAAAATGGTG CTTTGAGGGT CAGCCTTTAG 
TGTGTTATGT GCCTATCCTA ATAAACTCTT 



TTTTTTTGAA AGACTGGACT GGAATCTTAG 2220 

CCCTTGGAAC ATGCAGGTAT TTTCTACGGG 2280 

GTTGGCGTGC CACGGTGGGG ATTTCGTGAC 2340 

TCAAATTCCA ATGTGACCAA TTCCGGATCA 2400 

CCCTGAATAT CTTCTCTGCT CACTTCCACC 2460 

TGATTAGGAT TGAAATGATT TGTTTCTCTT 2520 

TCAATTAGGT AAGAAGATCT GGTTTTTTGG 2580 

TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

GACATCCTCC AGT^TGGCCA GAAGTGCAAT 2700 

TGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 

AATATGAGAA AAGTTGCTTG AAGTGCATTA 2820 

GGCTAGGGCG AGAGAGGCCA GGGATTTGTT 2880 

TACTGAGGTT ACCACACACT TGACTACGGA 2940 

AAGGACAACC TGTCTTTGAG CCAGGGCAGG 3000 

CACTTCATCC TGCCCGGAAT GCCAGTGCTC 3060 

GTTCCATCAG GCTGTTTCCT AAAGGATGTG 3120 

AGTTGTCATT TTAAAGCATT TTAGCACAGT 3180 

GATTTTAAAT CCTGAAGTGT 6GGTGGCGCA 3240 

TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

AAGAGGGAAA GAGATGACTA ACTAAAATCA 3360 

TAAATTATAT CCTCATTTTA AAAGTTACAT 3420 

TCGATAGTGT GCACTCTTTC TCTCTCTCTC 34 80 

ACACACACAC AGAGACACGG CACCATTCTG 354 0 

CACCGATGGT CAGAGTCACT AGAAGTTACC 3600 

GTGGGCTTTT TACCACCACT GTGCAGGAGA 3660 

GCCCCAAAGC CTCAGAGAAA GGGTGTTTCT 3720 

AGGTGACACT CTGGAGTGGT TGAAGGGCCA 3780 

TTGAAATATA GATGCTATGG TTCAGATTGT 3 840 

GTGAAAGGAA AGATGGAGGT TTTGTGCGGC 3900 

GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 3960 

AAGGCTGTGT CCATTTGGCA AAACTTCCTT 4020 

TGGGCAGTCT GTGGTGTGGA GAGCAGCCAT 4080 

ATGGCTGGAT GCGCTGCTGA CCAACATCAG 4140 

CCCTCTGGQC CTTGAAAATC CTTGCCCTTA 4200 

TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 4260 

TATTTTCACA GTGTTCTGAT TGCTCTCACA 4320 

AGCCCTGTTC TCAACAACAG GGAGGTCATG 4380 

AATGGGTGAT AAAGAATCCA GTTCCTCAAA 4440 

GCTGGGTGGT GCTTTCTCTT GCACACCACT 4500 

CCAGACAGCT CAGGTTGTGC ATCTGATGGA 4560 

TCCTGTTTAT TTTTGCTGTT ACTTTGAAGC 4620 

CTACAGTAGT GGGTAAACAA ATGCCCACCG 4 680 

CCTGAGGGGC CCCAGCTTGC TCGGGCGTGG 4740 

GGGAGAGGTG CACCCTGCCA CCTGCTAACT 4800 

GACCTGTTCA GCAGCAGAAC AAGCCAGGGC 4 860 

ATGGACACCA AGACAATGJ^A GATTTGTTGT 4920 

TGGAAAAAAC TGAATATATT CAGGACCAAC 4 980 

GCAGACAATG Ti\AAGAGAGA ACATCTCACA 5040 

AATTTCAGGT GCCATCACTG CTCTTTCTTT 5100 

GGACAATTCC CACAGATAAT CTGAGGAATG 5160 

AATGCATTAG AACTTGGTGA GCATCCTCTG 5220 

GGCTTCCAGG CAAGACCAAC CACATCTGGT 5280 

GTCATCX3TCA TTGCCATAGC ATCATGATGC 5340 

TGTTTGG6TA ATGTGGGGAT GAACTGCATG 5400 

CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

GGA6CAAGGC TTCAGGAAGG GCCAGCCACA 5520 

TGGAATTAAA AAGTCAAACC TTTATATGCA 5580 

AGTTTAAAAG AATTTTTTTT TCTCTTCAAG 5640 

CCCTTGTTTG AGTGTCTTAT CCCTGAGCAA 5700 

ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

AGATTTGACA AAAAAAAATT GAATTAAAAA 5820 

AAGCCTTTCC TATTATTTCT CTTCTTAGCT 5880 

AACACAGCTT TCTAGCAGCT GCAAAATGGT 594 0 

GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

CGGCTGCAAA ACCCATGGTT TAAATCTATA 6060 

AGATGATAAT CCGAATTCTC CATATATTCA 6120 

TCCTGAGACA AATACTCACT 6AAGGGCTTG 6160 

TTGTAGATAA TGCCCTTCTA TTTTAGGTAG 6240 

CTCTTATCTG CAAGGTGGCA AGCAGTTCTT 6300 

GCTGAAGTTC TTTGCATAGA TTTGGCTTAA 6360 

CTGTGATGTC AGATGTAATT GTAAATGTCA 6420 

GTAAGTATTT TTAAATGTGT GTATTTCAAA 6480 

TTCTATGCAG GATTTACCTT CATCCTGTGC 6540 

CAGAGATCGA GCTTCTCCTC TGAGTTCTAA 6600 

GAAGGT6CAG CTTTGTTGTC CTTTGAGCTT 6660 
AAACACATT 
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Seq ID No: 131 Protein aecruence ; 
Protein Accession # : NP 036204 . 1 



1 11 21 

I I I 

MATSMGLLLL LLLLLTQPGA GTGADTEAW 

ATVKSKEBAQ HVQRVLAQLL RREAAIiTARM 

EDTPYSNWHK ELRNSCISKR CVSLLIiDLSQ 

KFSFK6MCRP IiALGGPGQVT YTTPFQTTSS 

KEKAPDVFDW GSSGPIjCVSP KYGCNFNNGG 

ASRNPCSSSP CRGGATCVIiG PHGKNYTCRC 

TPGGFRCECW VGYEPGGPGE GACQDVDECA 

DGTQCQDVDE CVGPGGPLCD SLCFNTQGSP 

PDEEDKGEKE GSTVPRAATA SPTRGPEGTP 

SGVWREPSIH HATAASGPQE PAGGDSSVAT 

liGLIiVYRKRR AKRBEKKBKK PQNAADSYSW 



31 41 51 

I I I 

CVGTACYTAH SGKLSAAEAQ NHCNQNGGNL 60 

SKFWIGLQRE KGKCLDPSLP LKGFSWVGGG 120 

PLLPNRLPKW SEGPCGSPGS PGSNIEGFVC 180 

SLEAVPFASA ANVACGEGDK DETQSHYPLC 240 

CHQDCPEGGD GSPLCGCRPG FRLLDDLVTC 300 

PQGYQIjDSSQ LDCVDVDECQ DSPCAQECVN 360 

LGRSPCAQGC TNTDGSFHCS CBEGYVLAGE 42 0 

HCGCIiPGWVIi APNGVSCTMG PVSIiGPPSGP 480 

KATFTTSRPS LSSDAPITSA PLKMLAPSGS 540 

QNNDGTDGQK LLLFYILGTV VAILLLIiALA 600 
VPERAESRAM ENQYSPTPGT DC 



Seq ID NO: 132 DMA sequence 

Nucleic Acid Accession #: NM_000963.1 

Coding sequence: 135-1949 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CAATTGTCAT ACGACTTGCA GTGAGCGTCA GGAGCACGTC CAGGAACTCC TCAGCAGCGC 60 

CTCCTTCAGC TCCACAGCCA GACGCCCTCA GACAGCAAAG CCTACCCCCG CGCCGCGCCC 120 

TGCCCGCCGC TCGG^TGCTC GCCCGCGCCC TGCTGCTGTG CGCGGTCCTG GCGCTCAGCC 180 

ATACAGCAAA TCCTTGCTGT TCCCACCCAT GTCAAAACCG AGGTGTATGT ATGAGTGTGG 240 

GATTTGACCA GTATAAGTGC GATTGTACCC GGACAGGATT CTATGGAGAA AACTGCTCAA 300 

CACCGGAATT TTTGACAAGA ATAAAATTAT TTCTGAAACC CACTCCAAAC ACAGTGCACT 3 60 

ACATACTTAC CGACTTCAAG GGATTTTGGA ACGTTGTGAA TAACATTCCC TTCCTTCGAA 420 

ATGCAATTAT GAGTTATGTC TTGACATCCA GATCACATTT GATTGACAGT CCACCAACTT 480 

ACAATGCTGA CTATGGCTAC AAAAGCTGGG AAGCCTTCTC TAACCTCTCC TATTATACTA 540 

GAGCCCTTCC TCCTGTGCCT GATGATTGCC CGACTCCCTT GGGTGTCAAA GGTAAAAAGC 600 

AGCTTCCTGA TTCAAATGAG ATT6TGGAAA AATTGCTTCT AAGAAGAAAG TTCATCCCTG 660 

ATCCCCAGGG CTCAAACATG ATGTTTGCAT TCTTTGCCCA GCACTTCACG CATCAGTTTT 720 

TCAAGACAGA TCATAAGCGA GGGCCAGCTT TCACCAACGG GCTGGGCCAT GGG6TGGACT 780 

TAAATCATAT TTACGGTGAA ACTCTGGCTA GACAGCGTAA ACTGCGCCTT TTCAAGGATG 840 

GAAAAATGAA ATATCAGATA ATTGATGGAG AGATGTATCC TCCCACAGTC AAAGATACTC 900 

AGGCAGAQAT GATCTACCCT CCTCAAGTCC CTGAGCATCT ACGGTTTGCT GTGGGGCAGG 960 

AGGTCTTTGG TCTGGTGCCT GGTCTGATGA TGTATGCCAC AATCTGGCTG CGGGAACACA 1020 

ACAGAGTATG CGATGTGCTT AAACAGGAGC ATCCTGAATG GGGTGATGAG CAGTTGTTCC 1080 

AGACAAGCAG GCTAATACTG ATAGGAGAGA CTATTAAGAT TGTGATTGAA GATTATGTGC 1140 

AACACTTGAG TGGCTATCAC TTCAAACTGA AATTTGACCC AGAACTACTT TTCAACAAAC 1200 

AATTCCAGTA CCA AAAT CGT ATTGCTGCTG AATTTAACAC CCTCTATCAC TGGCATCCCC 1260 

TTCTGCCTGA CACCTTTCAA ATTCATGACC AGAAATACAA CTATCAACAG TTTATCTACA 1320 

ACAACTCTAT ATTGCTGGAA CATGGAATTA CCCAGTTTGT TGAATCATTC ACCAGGCAAA 1380 

TTGCTGGCAG GGTTGCTGGT GGTAGGAATG TTCCACCCGC AGTACAGAAA GTATCACAGG 144 0 

CTTCCATTGA CCAGA6CAGG CAGATGAAAT ACCAGTCTTT TAATGAGTAC CGCAAACGCT 1500 

TTATGCTGAA GCCCTATGAA TCATTTGAAG AACTTACAGG AGAAAAGGAA ATGTCTGCAG 1560 

AGTTGGAAGC ACTCTATGGT GACATCGATG CTGTGGAGCT GTATCCTGCC CTTCTGGTAG 1620 

AAAAGCCTCG GCCAGATGCC ATCTTTGGTG AAACCATGGT AGAAGTTGGA GCACCATTCT 1680 

CCTTGAAAGG ACTTATGGGT AATGTTATAT 6TTCTCCTGC CTACTGGAAG CCAAGCACTT 1740 

TTGGTGGAGA AGTGGGTTTT CAAATCATCA ACACTGCCTC AATTCAGTCT CTCATCTGCA 1800 

ATAACGTGAA GGGCTGTCCC TTTACTTCAT TCAGTGTTCC AGATCCAGAO CTCATTAAAA 1860 

CAGTCACCAT CAATGCAAGT TCTTCCCGCT CCGGACTAGA TGATATCAAT CCCACAGTAC 1920 

TACTAAAAGA ACGTTCGACT GAACTG TAG A AGTCTAATGA TCATATTTAT TTATTTATAT 1980 

GAACCATGTC TATTAATTTA ATTATTTAAT AATATTTATA TTAAACTCCT TATGTTACTT 2040 

AACATCTTCT GTAACAGAAG TCAGTACTCC TGTTGCGGAG AAAGGAGTCA TACTTGTGAA 2100 

GACTTTTATG TCACTACTCT AAAGATTTTG CTGTTGCTGT TAAGTTTGGA AAACAGTTTT 2160 

TATTCTGTTT TATAAACCAG AGAGAAATGA GTTTTGACGT CTTTTTACTT GAATTTCAAC 2220 

TTATATTATA AGAACGAAAG TAAAGATGTT TGAATACTTA AACACTATCA CAAGATGGCA 2280 

AAATGCTGAA AGTTTTTACA CTGTCGATGT TTCCAATGCA TCTTCCATGA TGCATTAGAA 2340 

GTAACTAATG TTTGAAATTT TAAAGTACTT TTGGTTATTT TTCTGTCATC AAACAAAAAC 2400 

A GGTA TCAGT GCATTATTAA ATGAATATTT AAATTAGACA TTACCAGTAA TTTCATGTCT 2460 

ACTTTTTAAA ATCAGCAATG AAACAATAAT TTGAAATTTC TAAATTCATA GGGTAGAATC 2520 

ACCTGTAAAA G CTT GTTTGA TTTCTTAAAG TTATTAAACT TGTACATATA CCAAAAAGAA 2580 

GC TGTC TTGG ATTTAAATCT GTAAAATCAG ATGAAATTTT ACTACAATTG CTTGTTAAAA 2640 

TATTTTATAA GTGATGTTCC TTTTTCACCA AGAGTATAAA CCTTTTTAGT GTGACTGTTA 2700 

AAACTTCCTT TTAAATCAAA ATGCCAAATT TATTAAGGTG GTGGAGCCAC TGCAGTGTTA 2760 

TCTCAAAATA AGAATATTTT GTTGAGATAT TCCAGAATTT GTTTATATGG CTGGTAACAT 2820 

GTAAAATCTA TATCAGCAAA AGGGTCTACC TTTAAAATAA GCAATAACAA AGAAGAAAAC 2880 

CAAATTATTG TTCAAATTTA GGTTTAAACT TTTGAAGCAA ACTTTTTTTT ATCCTTGTGC 2940 
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ACTGCAGGCC TGGTACTCAG ATTTTGCTAT 
ATAACC3ATAT GTTTTCTCAG ATTTTCTGTT 
TTGCAAAAGT AGCAATGACC TCATAAAATA 
CATTAATTTT ATCTCAGTCT TGAAGCCAAT 
ACCTGCATGC TGTTCCTTTT CTTTTCTTCT 
CTCATCACTT CGTTTCTCCT ATTTTGTTTT 
TGGACTCTGC CTATATTTTC TTACCTGAAC 
AGGACTGCTA TTTAGCTCCT CTTAAGAAGA 
AATAGTATAC ACTTATTTTA AGTGAAAAGC 
ATCTGTAACC AAGATGGATG CAAAGAGGCT 
TGACTGGAAA AAGTTACGTT CCCATTCTAA 
CAAATGATAT CTAAGTAGTT CTCAGCAATA 
ATCTCATTGT CACTGACATT TAATGGTACT 
TTTATGTCTT ATTAGGACAC TATGGTTATA 
TTTTTTGTTA TGTCACAATC AGTATATTTT 
AACAATCCAA AGAAATGATT GTATTAAGAT 
CATATTGAGA TATTTAAGGT TGAATGTTTG 
AAAQAATATT GTCTCATTAG CCTGAATGTG 
GGGATCTGTG GATGCTTCGT TAATTTGTTC 
TCAAGCACTG TGGGTTTTAA TATTTTTAAA 
TATAAATAAT TGAAAAAAAT TTTCTTTTGG 
AAAGATAACT CAGGA6AATC TTCTTTACAA 
AGAAATAGTC AATATGCTTG TATAAAACAC 
GATTTGTTAT TAACATTGAT CTGCTGACAA 
GTTTCAGTGC CTCA6ACAAA T6TGTATTTA 
TGTCTGTTTA TTTTTGTACT ATTTA 

seq ID NO: 133 Protein sequence; 
Protein Accession #: NP_000954.1 



GAGGTTAATG AAGTACCAAG CTGTGCTTGA 3000 

GTACAGTTTA ATTTAGCAGT CCATATCACA 3060 

CCTCTTCAAA ATGCTTAAAT TCATTTCACA 3120 

TCAGTA6GTG CATTGQAATC AAGCCTGGCT 3180 

TTTAGCCATT TTGCTAAGAG ACACAGTCTT 3240 

ACTAGTTTTA AGATCAGAGT TCACTTTCTT 3300 

TTTTGCAAGT TTTCAGGTAA ACCTCAGCTC 33 60 

TTAAAAGAGA AAAAAAAAGG CCCTTTTAAA 3420 

AGAGAATTTT ATTTATAGCT AATTTTAGCT 34 80 

AGTGCCTCAG AGAGAACTGT ACGGGGTTTG 3 540 

TTAATGCCCT TTCTTATTTA AAAACAAAAC 3600 

ATAATAATGA CGATAATACT TCTTTTCCAC 366 0 

GTATATTACT TAATTTATTG AAGATTATTA 3720 

AACTGTGTTT AAGCCTACAA TCATTGATTT 3 780 

CTTTGGGGTT ACCTCTCTGA ATATTATGTA 3840 

TTGTGAATAA ATTTTTAGAA ATCTGATTGG 3900 

TCCTTAGGAT AGGCCTATGT GCTA6CCCAC 3960 

CCATAAGACT GACCTTTTAA AATGTTTTOA 4020 

AGCCACAATT TATTGAGAAA ATATTCTGTG 4080 

TCAAACGCTG ATTACAGATA ATAGTATTTA 4140 

GAAGAGGGAG AAAATGAAAT AAATATCATT 4200 

TTTTACGTTT AGAATGTTTA AGGTTAAGAA 4260 

TGTTCACTGT TTTTTTTAAA AAAAAAACTT 4320 

AACCTGGGAA TTTGGGTTGT GTATGCGAAT 4380 

ACTTATGTAA AAGATAAGTC TGGAAATAAA 4440 



1 11 21 31 41 51 

I I I I I I 

MLARAIiLLCA VIALSHTANP CCSHPCQNRG VCMSVGFDQY KCDCTRTGFY GENCSTPEFL 60 

TRIKIiFLKPT PNTVHYILTH PKGPWNWNN IPFLRNAIMS YVLTSRSHLI DSPPTVNADY 120 

GYKSWEAFSN LSYYTRALPP VPDDCPTPLG VKGKKQLPDS NEIVEKLIjIiR RKPIPDPQGS 180 

NMMFAFFAQH FTHQFFKTDH KRGPAFTKGL GHGVDIiNHIY GETLARQRKL RLPKDGKMKY 240 

QIIDGEMYPP TVKDTQABMI YPPQVPEHLR FAVGQEVFGL VPGLMMYATI WLRBHNRVCD 300 

VLKQEHPBWG DBQLFQTSRIi ILIGBTIKIV lEDYVQHLSG YHFKIiKPDPE LLFNKQFQYQ 360 

NRIAAEFNTL YHWHPLIjPDT PQIHDQKYNY QQFIYNNSIL liEHGITQFVE SFTRQIAGRV 420 

AGGRNVPPAV QKVSQASIDQ SRQMKYQSPM BYRKRFMIiKP YESFEELTGE KBMSAELEAIi 480 

YGDIDAVBLY PALLVEKPRP DAIFGBTMVB VGAPFSIiKGIi MGNVICSPAY WKPSTFGGEV 540 

GFQIINTASI QSLICaJNVKG CPFTSPSVPD PELIKTVTIN ASSSRSGIiDD INPTVLUCER 600 
STEIi ' 

Seq ID NO: 134 DNA secmence 

Nucleic Acid Accession #: XM_059648.1 

Coding sequence: 35-664 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
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AGGCTGCTGA GACTTCCCTC TAGAATCCTC CAA CATGG AG CCTCTTGCAG CTTACCCGCT 60 

AAAATGTTCC GGGCCCAGAG CAAAGGTATT TGCAGTTTTG CTGTCTATAG TTCTATGCAC 120 

AGTAAC6CTA TTTCTTCTAC AACTAAAATT CCTCAAACCT AAAATCAACA GCTTTTATGC 180 

CTTTGAAGTG AAGGAT6CAA AAGGAAGJ\AC T6TTTCTCT6 6AAAAGTATA AAGGCAAAGT 240 

TTCACTAGTT GTAAACGTGG CCAGTQACTG CCAACTCACA GACAGAAATT ACTTAGGGCT 300 

GAAGGAACTG CACAAAGAGT TTGGACCATC CCACTTCAGC GTGTTGGCTT TTCCCTGCAA 360 

TCAGTTTGGA GAATCGGAGC CCCXKZCCAAG CAAGGAAGTA GAATCTTTTG CAAGAAAAAA 420 

CTACGGAGTA ACTTTCCCCA TCTTCCACAA GATTAAGATT CTAGGATCTG AAGGAGAACC 480 

TGCATTTAGA TTTCTTGTTG ATTCTTCAAA GAAGGAACCA AGGTGGAATT TTTGGAAGTA 54 0 

TCTTGTCAAC CCTGAGGGTC AAGTTGTGAA GTTCTGGAAG CCAGAGGAGC CCATTGAAGT 600 

CATCAGGCCT GACATAGCAG CTCTGGTTAG ACAAGTGATC ATAAAAAAGA AAGAGGATCT 660 

ATGAG AATGC CATTGCGTTT CTAATAGAAC AGAGAAATGT CTCCATGAGG GTTTGGTCTC 720 

ATTTTAAACA tTTTTTTTTT GGAGACAGTG TCTCACTCTG TCACCCAGGC TGGAGTGCAG 780 

TAGTGCGTTC TCAGCTCATT GCAACCTCTG CCTTTTTAAA CATGCTATTA AATGTGGCAA 840 

TGAAGGATTT TTTTTTAATG TTATCTTGCT ATTAAGTGGT AATGAATGTT CCCAGGATGA 900 

GGATGTTACC CAAAGCAAAA ATCAAGAGTA GCCAAAGAAT CAACATGAAA TATATTAACT 960 

ACTTCCTCTG ACCATACTAA AGAATTCAGA ATACACAGTG ACCAATGTGC CTCAATATCT 1020 

TATTGTTCAA CTTGACATTT TCTAGGACTG TACTTGATGA AAATGCCAAC ACACTAGACC 1080 

ACTCTTTGGA TTCAAGAGCA CTGTGTATGA CTGAAATTTC TGGAATAACT GTAAATGGTT 1140 

ATGTTAATGG AATAAAACAC AAATGTTGAA AAATGTAAAA TATATATACA TAGATTCAAA 1200 

TCCTTATATA TGTATGCTTG TTTT6TGTAC AGGATTTTGT TTTTTCTTTT TAA6TACAGG 1260 

TTCCTAGTGT TTTACTATAA CTGTCACTAT GTATGTAACT GACATATATA AATAGTCATT 1320 
TATAAATGAC CGTATTATAA CA 
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Seq ID No: 135 Protein aequenpe? 
Protein Accession #: XP 059648.1 



10 



11 
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I 



41 

I 



51 



MEPLAAYPLK CSGPRAKVFA VLLSIVLCTV TLPLLQLKFL KPKINSFYAF EVKDAKGRTV 60 

SLEKYKGKVS LWNVASDCQ LTDRNYLGIiK ELHKEPGPSH FSVLAFPCNQ TOESEPRPSK 120 

EVESFARKNY GVTFPIFHKI KILGSEGEPA FRFLVDSSKK EPRWNFWKYL VNPEGQWKF 180 
WKPEEPIEVI RPDIAALVRQ VIIKKKEDL 



15 



Seq ID NO: 136 DNA aeouenee 

Nucleic Acid Accession #: NM_003003.1 

Coding sequence: 304-2451 (underlined sequences correspond to start and stop codons) 
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40 
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50 



55 



60 
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70 



75 



CAAGTGCCGT 
CGGTCGAGCT 
AGCGCCCCTC 
CTCCTTGAGG 
TACCATTCTT 
ATCATGGTGC 
ATGGCTGCCT 
GACACT6TGA 
AAGCTGGATG 
TTTGTCCAGA 
GAAACX3TTTT 
AATGAAGATT 
TTTGAAAGTA 
AAGGAAATCA 
CGTTGGAGTC 
CAAGCAGCGT 
GATGCCCTCA 
GATGCCGACC 
ATTAGACTTC 
ATTCTTCGGT 
CAGTCTTTGA 
CCTCAGGTCC 
CCCCTCTACG 
GAGGAAGCCC 
GAGAATACAA 
GGGCTGAACA 
GTGGTGGAGG 
GTATTTCCTG 
TTCCTCATTT 
AAAGAGATTA 
CTGGTCCCCA 
TGGACTGAGA 
ATTCAGATTG 
ATTGTGTTTA 
GGAGCCCACA 
CAGCTGGGCC 
GTGCAGGGTT 
AGCATGCCTG 
CAGGTCTCTT 
TTCAGAGGTT 
GCCACCACCT 
GCCTGCACCT 
CACCCAGCGG 
GGTAAACGTA 
ACTTAACTCA 
GGGCTCTCTT 
ATT6ATGCAA 
GACATCCTCC 
AAGCT6CCTC 
GTCTGTGGAC 
GCG6AGTACC 
G6G6GCTCAG 
AAACATTACT 
TTCTGTATGT 
CACCTGCAGT 
TTGTCTCAGA 
ATTTGCCACT 



CGCCGCGCCC 
ACGGTCGCGG 
GCCACCCCGT 
ATATTCAGTT 
AATGATTATC 
AAAAATACCA 
ATGAAAGGAG 
GTGAATTCAA 
TAGATGCACC 
AAAACTCACT 
CCAATCGGGT 
GGACCTGTTT 
CAGTGGAAAA 
TCGAATACTA 
CGCCTTCCAT 
CCATGGCCGT 
GCAGCCCCAG 
ACATCAAGAG 
GCCAGTGGCT 
TCCTCCGTGC 
CGTGGAGAAA 
TTCAGGATTA 
TGCTCAGGCT 
TGCTGAGATA 
AAGTCTTTGG 
TGCGCCACTT 
CCAACTACCC 
TGCTCTGGAC 
ATGCAGGAAA 
TTCCAGATTT 
AATCTCTGTA 
CCATCTACCA 
TGGATGCCTC 
ACATCTATCA 
GCATCACCTC 
GCGACTACAG 
CCCATGTGAC 
CGTGCGCCGC 
CGCACAAGTG 
CCATGACGAG 
CCTCCAGCCA 
AGTGTGCAGA 
CGACATTGTA 
GTCGTTTGAT 
ATAGCCATAG 
GAAAGAAAAG 
AAAATTTTTC 
AGAGATGGCC 
ATGGCCCGCA 
TTAGGGCCAG 
TTGTCCCAGG 
GAGGGGCTCT 
TTCTCTTTCC 
GAACTTGGGT 
CAGCTCCCAG 
GCCCAGACAG 
TGACACTGTC 



CTTCCCCCTC 
ACGAGTGGAA 
GTCCCAGGCC 
TTGTATGTTT 
CTCAACAAGA 
GTCCCCAGTG 
GTTCCCTACA 
GAGCGAAGAT 
CAGACTGCTG 
GAATTCTCGG 
CATCATTAAT 
TGAACAGTCT 
AATTGCAATG 
CCTTCGCCAA 
CACGCCCTCT 
CGTCATCCCA 
TGCACCTGAG 
ATACCTGGGC 
CCAGGAGACC 
ACGGGATTTT 
GCAGCATCAG 
CTACGCGGGA 
GGGGCAGATG 
CGTTCTCTCC 
TCGGCCTATC 
GTGGAGACCT 
TGAGACACTG 
GCTGGTTAGT 
TGACTACCAG 
CCTGAGTGG6 
CC6GACTGCA 
GTCTGCAAGC 
GTCAGTCATC 
CTCCAAGA66 
TCCGGGTGGG 
CATGGTGGAG 
CAGGTGGCCXS 
CAGCAGCCTT 
TAAAGTGATG 
CCTGGAGTCC 
GTCCCACTCC 
GGGGACGGCC 
CAGACTCCTC 
CCCAAAACTA 
ATTTTGTATA 
TAGTTTCTGT 
CAACGAACTC 
CCTCCTCACC 
CX3CCGCCTCA 
CCCTTGAGGT 
GCCAGACACA 
CAGGGACTCC 
TCCTTTTCAA 
GGGGGGGTTC 
CCCAGTGTA6 
TTCCAGCCAC 
CATGGGGTTT 



CCGCCTCCCC 
CCGAGACTGC 
CGGCCTTTCT 
GAATATCCTC 
CAGGTGTGAG 
AGAGTGTACA 
TGTCCTTTGA 
GGGGCTATTC 
AAGAAGATTG 
GAAC6TACTT 
GAGCATTGCT 
GCAAGTTTAG 
AAACAATATA 
TTAGAAGAAG 
TCAGAGACAT 
GAAGCTGCCC 
CCCGTGGTGG 
GATTTGACTC 
CACAAGGGCA 
AATATTGACA 
GTAGACTACA 
GGCTGGCATC 
GACACCAAAG 
GTAAATGAAG 
AGCTCATGGA 
GGTGTGAAAG 
GGCCGCCTTC 
CCGTTCATTG 
GGTCCTGGAG 
GAGTGCATGT 
GAGGAGCTGG 
6TCTTCAAA6 
ACTTGGGATT 
TCGCCACAAC 
AACAATGTGC 
TCGCCTCTGA 
GGCTTCTACA 
CCCCGGGTGG 
TACTACACCG 
AGCCACAGCG 
AGCTCCATGA 
GCCCCTCCTC 
TCACCTCTAG 
CCTTGGCAGG 
CGTTGTGCAC 
ACCAATTAAA 
CGCATTGTCC 
TGGGACGGAA 
CGGCCCCCAT 
CCTTATCCTC 
CCXilACACCAC 
TGGTGACTCC 
ATCTTTTTGA 
TTCCCGTTTC 
GCCATCTCCT 
TAG6AGGCCG 
TATTAGTAGC 



41 

I 

GGCCCCCTCC 
CCCGCGGAGC 
GACAAGAGCT 
TCACCATGTT 
AGGGTTGCTG 
AATACCCCTT 
TTCCGATGTT 
ATGTCATTGA 
CAGGAGTTGA 
TGCACATTGA 
GCTACACCGT 
ATATTAAATC 
CCAGCAACAT 
AAGGCATAAC 
CTTCATCATC 
TCAAGGAGGG 
GCACCCCTGA 
CGCTGCAGGA 
AAATTCCAAA 
AAGCCAGAGA 
TTCTTGAAAC 
ATCACGACAA 
GCTTGGTGAG 
AACGGCTAAG 
CCTGCCTGGT 
CGCTGCTGCG 
TCATCCTGCG 
ATGACAACAC 
GCCTGCTGGA 
GCGAAGTGCC 
AGAAC6AAGA 
GA6CCCCACA 
TCGACGTGTG 
CACCXrAAAAA 
AGCTCATAGA 
TCTGCAAAGA 
TCCTGCAGTG 
ACGACGTGCT 
AGGTGATCGG 
GCTTCTCCCA 
TCTCCAGGTA 
GGACAGCAGC 
ATAGCAAATA 
TAGTTTTAAC 
AAAATCCAAC 
GGATTGACGT 
ATTAGTGAAT 
GCTGCCAGCT 
GCTTCCCGCC 
TGAGGATTCA 
CCACTGTCTG 
AG6AAAATGC 
TACTTTTTAG 
CTTCC6TGCG 
CTGTGCCCTC 
TCTTGGAACC 
TAAGCAGCAG 



51 



CCGGAACCGG 
CGCCGGTATG 
AGACTTCGGG 
CAGCATAAAG 
TTGCATTGCA 
TGAATTAATT 
CGTGGGCAGT 
AAGGCGCTGC 
TTATGTTTAT 
GGCTTATAAT 
TCACCCTGAA 
TTTCTTTGGT 
TAAAAAAGGA 
CTTTGTGCCC 
CTCCAAGAAA 
GCTGAGTGGT 
CGACAAACTA 
GAGCTGCCTC 
A6ATGAGCAT 
GATCATGTGT 
CTGGACCCCT 
AGATGGGCGG 
AGCGCTCGGG 
GCGATGCGAA 
GGACTTGGAA 
GATCATCGAG 
GGCGCCCAGG 
CAGAAGGAAG 
TTACATCGAC 
AGA6GGTGGA 
CCTGAAGCTC 
T6AGATTCTC 
CAAA6GGGAC 
GGACTCCCTG 
CAAAGTCTGG 
AGGAGAAAGC 
GAAATTCCAC 
TGCGTCCCTG 
CTCGGAGGAT 
GCTGAGTGCC 
JGTGCCGCGCT 
TGCACCCGCC 
GCTCTCAGAT 
TCTGATCCTA 
CAGAGCGCAA 
GGTCTCAGAT 
GAATTCCTCT 
CGCTTCCCCC 
AGTCAAGATG 
GAGGTTGCCT 
CA6TGGGGCC 
TGCCATCGTT 
AGCAG6ATTT 
TCGCCCCTCT 
TGGAGGCTCA 
A6CAAGTCGC 
CTCTCGCATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
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15 



20 
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35 



CACTTCAGGG 

CTCATGCGTG 

GCTGGGGGGA 

AAATCAGAAT 

TGCTCTATCT 

AGAATCTGTC 

TTCTGTGTGC 

GCGTGGTAGG 

GGGTGCTGCG 

GGGCGGGGGA 

CACACTGTAG 

TTGCTCTTAG 

TAGGGTTCGT 

GTAGGTAGGG 

GGCTAGTAGG 

GGTAGGGCTA 

TAGTAGGTAG 

A6GGCTAGTA 

TAGGTAGGGT 

GTTAGTAGGT 

GTA6G6CTGG 

A6TAG6TAG6 

GGGTTCGTAG 

TGCTTCCACC 

CTCTTTCTCT 

TCACTCAACA 

CCCTGCCCCC 

TTTATACCCA 

TTACAGGGTT 

GCTATGGTTT 

TAAAACGCTG 

TATGTCTTAA 

TTTGTAAACA 

ATTGTTATCA 



TG6CGTGTGG 

TGTGTGTGTG 

CGGGGTGAGT 

ATGGGATTTG 

GGTACAGGCC 

CAGAAGTTGC 

AGCAGAGGCC 

CATGGAGATC 

TGACTGGAGA 

GGGACCGAGC 

ACGTCCCAGG 

AGATCGAGCT 

AGGTAGGGCT 

TTAGTAGGTA 

TAGGGTTCGT 

GTAGGTAGGG 

GGCTAGTAGG 

GGTAGGGCTA 

TAGTAGGTAG 

AGGGCTAGTA 

TAGGTAGGGT 

GCTAGTAGGT 

GTAGGGTTCG 

TGGTGCTTCC 

TTCTCTGTGT 

GTCCTCATGT 

TCCCAGGCTG 

AAGACTGTAG 

TCCTCCCGAG 

GAGTATGCAG 

CTGTCATTTC 

TTCACTTTCC 

TATTACCTCA 

ATAATAAATG 



CATGTAGGAG 

CATGTGCTGT 

GGAAACTTAG 

TTTGCCTTTT 

CTTATTTTTT 

ATAGGGGATG 

GTGTTTTTCA 

CTGGTTGTGC 

GCTGTGTGGA 

AGCCCTCTTG 

GCCTGTGCTG 

CCTCAGTGGT 

AGTAGGTAGG 

GGGTTCGTAG 

AGGTAGGGCT 

TTAGTAGGTA 

TAGGGTTCGT 

GTAGGTAGGG 

GGCTAGTAGG 

GGTAGGGCTA 

TAGTAGGTAG 

AGGGCTAGTA 

TAGGTAGGGT 

TGTTCCCAAA 

CTCAGATGGC 

GCCCAGAGAT 

AAGATCTGTT 

TGCATCTTGA 

TAATCCAATC 

TTTGCATCGT 

CCATTTCTTA 

TTCCTT^AATT 

CTTGGTAATA 

TGAACTATTT 



TCCTGCTTCT 

GTGTGTGCAT 

TTTGAGTAAT 

ACATTTTGTT 

CAGCTTTTTA 

GCCTCCACGA 

TGCCAAACCC 

CGTCTCAGCT 

GGCCATGTGT 

CCCGGTCGGG 

TGATCACCTG 

ACCTGAAGCC 

GTTAGTAGGT 

GTAGGGCTGG 

AGTAGGTAGG 

GGGTTCGTAG 

AGGTAGGGCT 

TTAGTAGGTA 

TAGGGCTAGT 

GTAGGTAGGG 

GGCTAGTAGG 

GGTAGGGCTA 

TCGTAGGTAG 

TCACAAGGGC 

GATTTTGCTG 

GTTTATAGAA 

CTTTTTAAGT 

AGAGCTCAAA 

TCACTCCCCT 

GTTTCTACCT 

GTACTAATGA 

TGTTATTTGC 

CRATACTGAT 

AAAG 



TTGTACATGG 
GTGTGCATGA 
GAAGGAATCT 
TAATTCCTGA 
TGGGAAAAGC 
TAAGGACATG 
CACGCGGCTG 
CCGCTCTGAA 
GCCCCGTGCA 
TCAGCCCTAG 
CCTTTGGACC 
TTTGCTTCCG 
AGGGCTAGTA 
TAGGTAGGGT 
GTTAGTAGGT 
GTAGGGCTGG 
AGTAGGTAGG 
GGGTTCGTAG 
AGGTAGGGCT 
TTAGTAGGTA 
TAGGGCTAGT 
GTAGGTAGGG 
GGTTAGTAGC 
CTGAAGGTGG 
ACAGCTGCCA 
CTGTTTGAAT 
TGATTCGGGA 
GCACATGACC 
TGTAAGGGAA 
TTAGTACCTT 
TTCTTTGATT 
ATATCAAATT 
AGTCTTTAAA 



GAATTGTGGA 

CGGTGGGGGT 

TCACAGAAGC 

TTTTAAAGCC 

AGGTTATTTG 

CAACACGTGT 

TCAACTGTGT 

GGCACTGTGT 

GGGATCAGGA 

TGGCTGCCTG 

ACATTTGTGT 

GAAAGCGCGG 

GGTAGGGCTA 

TAGTAGGTAG 

AGGGCTAGTA 

TAGGTAGGGT 

GTTAGTAGGT 

GTAGGGCTGG 

AGTAGGTAGG 

GGGTTCGTAG 

AGGTAGGGCT 

CTAGTAGGTA 

GCGTCTGTGC 

TCCCTGCTTT 

AGAAAATGCT 

TGCAGCCATC 

GTGGCATTCT 

GCACAAATGC 

TTCTGGGGCA 

GCCACTCTTT 

CTCCCTCTAT 

CTGTAAATGT 

AGATTTTTTT 



3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 
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5220 

5280 

5340 

5400 



Seq ID No: 137 Protein secruencei 
Protieln Accession #: NP 002994.1 
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MVQKYQSPVR 
IiDVDAPRLLK 
EDWTCFEQSA 
WSPPSITPSS 
ADHIKRYLGD 
SLTWRKQHQV 
EALIiRYVLSV 
VEANYPETLG 
EIIPDFIiSGE 
QIVDASSVIT 
IjGRDYSHVES 
VSSHKCKVMY 



11 
I 

VYKYPFELIM 
KIAGVDYVYF 
SLDIKSFFGF 
ETSSSSSKKQ 
LTPLQESCLI 
DYIIiETWTPP 
NEERIiRRCEE 
RLLILRAPRV 
CMCEVPEGGL 
WDFDVCKGDI 
PIjICKEGESV 
YTEVIGSEDF 



21 

I 

AAYERRFPTC 
VQKNSIiNSRE 
ESTVEKIAMK 
AASMAWIPE 
RIiRQWIiQETH 
QVLQDYYAGG 
NTKVFGRPIS 
FPVIiWTIiVSP 
VPKSIiYRTAE 
VFNIYHSKRS 
QGSHVTRWPG 
RGSMTSIiESS 



31 
I 

PLIPMFVGSD 
RTLHIEAYNE 
QYTSNIKKGK 
AALKEGLSGD 
KGKIPKDEHI 
WHHHDKDGRP 
SWTCLVDUEG 
FIDDNTRRKF 
ELENEDIiKXiW 
PQPPKKDSL6 
FYIIiQWKFHS 
HS6FSQLSAA 



41 

I 

TVSEFKSEDG 
TFSNRVIINE 
EIIEYYLRQL 
ALSSPSAPEP 
LRFIjRARDFN 
IiYVIiRLGQMD 
LNMRHLmPG 
LIYAGNDYQG 
TETIYQSASV 
AH8ZTSPGGN 
MPACAASSLP 
TTSSSQSHSS 



51 
I 

AIHVTERRCK 
HCCYTVHPEN 
EEEGITFVPR 
WGTPDDKLD 
IDKAREIMCQ 
TKGLVRAIiGB 
VKAIiIjRIIEV 
PGGIiLDYIDK 
FKGAPHEZLI 
NVQLZDKVWQ 
RVDDVLASIKJ 
SMZSR 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 



seq ID NO: 138 DKA secmence 

Nucleic Acid Accession #: NM_004181.1 

Coding sequence: 32-670 (iinderlined sequences correspond to start and stop codons) 



60 



65 



70 



75 



1 
I 

GCAGAAATAG 
GGTCGCCGGC 
GGTGCCAGCG 
CTTCAGGAAA 
CATGAAGCAG 
TAATCAAGAC 
AGAGAAAATG 
AGCCCATGAT 
TTTTATTCTG 
TCCGGTGAAC 
CAGAGAATTC 
GGCAGC CTAA 
AATATATACC 
TGTTCTGCAG 
ACAGCTGTCC 



11 

I 

CCTAGGGAGA 
CAGTGGCGCT 
CCTGCCTGCG 
AAGCAGATTG 
ACCATTGGGA 
AAACTGGGAT 
TCCCCTGAAG 
GCCGTGGCAC 
TTTAACAACG 
CATGGCGCCA 
ACCGA6CGTG 
TGCTCTGTGG 
CCCCATGCAG 
ACACGCCTTC 
ACTGGGCCAT 



21 
I 

TCAACCCCGA 
TCGTGGACGT 
CGCTGCTGCT 
AAGAGCTGAA 
ATTCCTGTGG 
TTGAGGATGG 
ACAGAGCAAA 
AGGAA66CCA 
TGGATGGCCA 
GTTCAGAGGA 
AGCAAGGAGA 
GAGGGACTTT 
TCTAAAATGC 
CCCTCA6CCA 
TGTGGTGTGA 



31 

1 

GATGCTGAAC 
GCTGGGGCTG 
GCTGTTTCCC 
GGGACT^GAA 
CACAATCGGA 
ATCAGTTCTG 
ATGCTTTGAA 
ATGTCGGGTA 
CCTCTATGAA 
CACCCTGCTG 
AGTCCGCTTC 
GCTGATTTCC 
TTCAGTACTT 
CACCCAGGCA 
GCTTCAGATQ 



41 
I 

AAAGTGCTGT 
GAAGAGGA6T 
CTCACGGCCC 
GTTAGTCCTA 
CTTATTCACG 
AAACAGTTTC 
AAGAATGA66 
GATGACAAGG 
CTTGATGGAC 
AAGGACGCTG 
TCTGCCGTGG 
CCTCTTCCCT 
GTGAAACACA 
CTTAAGCACA 
GTGAAGCATT 



51 
I 

CCCGGCTGGG 
CTCTGGGCTC 
AGCATGAGAA 
AAGTGTACTT 
CAGTGGCCAA 
TTTCTGAAAC 
CCATACAGGC 
TGAATTTCCA 
GAATGCCTTT 
CCAAGGTGT6 
CTCTCTGCAA 
TCAACATGAA 
GCTGTTCTTC 
AGCAGAGTGC 
CTCCCCRGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



245 



wo 02/079492 



PCT/US02/(M9I5 



TATGTCTTGT ATCCGATATC TAACGCTTTA AATGGCTACT TTGGTTTCTG TCTGTAAGTT 
AAGACCTTGG ATGTGGTTAT GTTGTCCTAA AGAATAAATT TTGCTGATAG TA6C 



960 



Seq ID No: 139 Protein sequence! 
Protein Accession #: NP_004172.1 



10 



15 



20 



] 11 21 31 41 SI 

I I i I I I 

MLNKVLSRLG VAGQWRFVDV LGLEEBSLGS VPAPACAIiLI, LFPLTAQHEa^ FRKKQIEELK 
GQEVSPFCVYP MKQTIGNSCG TIGLIHAVAN NQDKLGFEDG SVLKQPLSET EKMSPEDRAK 
CFEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILFNNVDGH LYBLDGRMPF PVNHGASSED 
TLLKDAAKVC REFTEREQGE VRPSAVALCK AA 



60 
120 
160 



Seq ID NO: 140 DNA semience 

Nucleic Acid Accession #: NM_000201.1 

Coding sequence: 58-1656 (underlined sequences correspond to start and stop codons) 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



I 

GCGCCCCAGT 
GCTCCCAGCA 
CCAGGACCTG 
GGCTCCGTGC 
ACCCCGTTGC 
AGCAATGTGC 
ACAGCTAAAA 
CCCTCTTGGC 
CCCCGGGCCA 
GCTGTGGGGG 
GCCAATTTCT 
AACACCTCGG 
GTCAGCCCCC 
CTGTTCCCAG 
ACAGTCACCT 
GACGAGGGCA 
CTGCAGACAG 
GTCTCAGAAG 
CTGAATGGGG 
CCAGAGGACA 
ATACACAAGA 
GATTGTCCGG 
TGGGGGAACC 
GGGGAATCAG 
ACTCAAGGGG 
GTCATCATCA 
CTCTATAACC 
CCCATGAAAC 
CCTCGGCCTT 
CATGCAGCTA 
TACAACAGCA 
CTGTAGTCAC 
TTAAAGTCTA 
ATACAACTGG 
CAGAAGAAGT 
GACX3GATGCC 
TTCATTTGTT 
GGTCTCTGGC 
TACAGGTTGT 
CATTGGCCAA 
ATGGACTGGT 
CCCCCAAAAC 
CACAATGACA 
GCCTTGTCCT 
TCCTGCAGTG 
CCTCCCAGCT 
CGCTCTGTCA 
TTTGGGCTCA 
ACACCACACC 
TGCCCAGACT 



11 



21 



CGACGCTGAG 
GCCCCCGGCC 
GCAATGCCCA 
TGGTGACATG 
CTAAAAAGGA 
AAGAAGATAG 
CCTTCCTCAC 
AGCCAGTGGG 
ACCTCACCGT 
AGCCCGCTGA 
CGTGCCGCAC 
CCCCCTACCA 
GGGTCCTAGA 
TCTCGGAGGC 
ATGGCAACGA 
CCCAGCGGCT 
TGACCATCTA 
GGACCGAGGT 
TTCCAGCCCA 
ACGGGCGCAG 
ACCAGACCCG 
GAAACXGGAC 
CATTGCCCX3A 
TGACTGTCAC 
AGGTCACCCG 
CTGTGGTAGC 
GCCAGCGGAA 
CGAACACACA 
CCCATATTGG 
CACCTACCGG 
TTTGGGGCCA 
ATGACTAAGC 
GCCTGATGAG 
GAAATACTGA 
GGCCCTCCAT 
AGCTTGGGCA 
ATTTTACCAG 
CTCACGGAGC 
ACACTGCAGG 
CCTGCCTTTC 
AATGGTTCAC 
TGACACCTTT 
CTCAGCGGTC 
CTTGTCCTGT 
ATCAGGGTCC 
TTGGAAGGGT 
CCCAGGCTGG 
AGTGATCCTC 
TGGCAAATTT 
TCCTTTGTGT 



CTCCTCTGCT 
CGCGCTGCCC 
GACATCTGTG 
CAGCACCTCC 
GTTGCTCCTG 
CCAACCAATG 
CGTGTACTGG 
CAAGAACCTT 
GGTGCTGCTC 
GGTCAOGACC 
TGAACTGGAC 
GCTCCAGACC 
GGTGGACACG 
CCAGGTCCAC 
CTCCTTCTCG 
GACGTGTGCA 
CAGCTTTCCG 
GACAGTGAAG 
GCCACTGGGC 
CTTCTCCTGC 
GGAGCTTCGT 
GTGGCCAGAA 
GCTCAAGTGT 
TCGAGATCTT 
CGAGGTGACC 
AGCCGCAGTC 
GATCAAGAAA 
AGCCACGCCT 
TGGCAGTGGT 
CCCTGGGACG 
TGGTACCTGC 
CAAGAGGAAG 
AGGGGAAGTG 
AACTTGCTGC 
AGACATGTGT 
CTGCTGTCTA 
CTATTTATTG 
TCCCAGTCCA 
AGAGTGCCTG 
CCCAGAAGGA 
AG6TTCAGAG 
GTTAGCCACC 
ATGTCTGGAC 
TTGCATTTCA 
TGCAAGCAGT 
CATCC6C6T6 
AGTGCAGTGG 
CCACCTCAGC 
GATTTTTTTT 
TAGTTAATAA 



31 
I 

ACTCAGAGTT 
GCACTCCTGG 
TCCCCCTCAA 
TGTGACCAGC 
CCTGGGAACA 
TGCTATTCAA 
ACTCCAGAAC 
ACCCTACGCT 
CGT6GGGAGA 
ACGGTGCTGG 
CTGCGGCCCC 
TTTGTCCTGC 
CAGGGGACCG 
CTGGCACTGG 
GCCAAGGCCT 
GTAATACTGG 
GCGCCCAACG 
TGTGAGGCCC 
CCGAGGGCCC 
TCTGCAACCC 
GTCCTGTATG 
AATTCCCAGC 
CTAAAGGATG 
GA6GGCACCT 
GTGAATGTGC 
ATAATGGGCA 
TACAGACTAC 
CCCTGAACCT 
6CCACACTGA 
CCGGAGGACA 
ACAGCTAAAA 
GAGCAAGACT 
GTGGGGGAGA 
CTATTGGGTA 
AGCATCAAAA 
CTGACCCCAA 
AGTGTCTTTT 
TGTCACATTC 
GCAAAAAGAT 
GTGATTTTTC 
ATTACCCAGT 
TCCCCACCCA 
ATGAGTGCCC 
CTGGGAGCTT 
6GGGAAGGGG 
T6TGTGTGTG 
TGCAATCATG 
CTCCTGAGTA 
TTTTTTTTCA 
A6CTTTCTCA 



41 
I 

GCAACCTCAG 
TCCTGCTCGG 
AAGTCATCCT 
CCAAGTTGTT 
ACCGGAAGGT 
ACTGCCCTGA 
GGGTGGAACT 
GCCAGGTG6A 
AGGAGCTGAA 
TGAGGAGAGA 
AAGGGCTGGA 
CAGCGACTCC 
TGGTCTGTTC 
GGGACCAGAG 
CAGTCAGTGT 
GGAACCAGAG 
TGATTCTGAC 
ACCCTAGAGC 
AGCTCCTGCT 
TGGAGGTGGC 
GCCCCCGACT 
AGACTCCAAT 
GCACTTTCCC 
ACCTCTGTCG 
TCTCCCCCCG 
CT6CAGGCCT 
AACAGGCCCA 
ATCCCGGGAC 
ACAGAGTG6A 
GGGCATTGTC 
CACTAGGCCA 
CAAGACATGA 
CATAGCCCCA 
TGCTGAGGCC 
CACAAAGGCC 
CCCTTGATGA 
ATGTAGGCTA 
AAGGTCACCA 
CAAATGGGGC 
TATCGGCACA 
GAGGCCTTAT 
CATACATTTC 
AGGGAATATG 
GCACTATTGC 
GCCAAGGTAT 
TGTATGTGTA 
GTTCACT6CA 
GCTGGGACCA 
GAGACGGGGT 
ACTGCC 



51 
I 

CCTCGCT ATG 
GGCTCTGTTC 
GCCCCGGGGA 
GGGCATAGAG 
GTATGAACTG 
TGGGCAGTCA 
GGCACCCCTC 
GGGTGGGGCA 
ACGG6AGCCA 
TCACCATGGA 
GCTGTTTGAG 
CCCACAACTT 
CCTGGACGGG 
GTTGAACCCC 
GACCGCAGAG 
CCAGGAGACA 
GAAGCCAGAG 
CAAGGTGACG 
GAAGGCCACC 
CGGCCAGCTT 
GGACGAGAGG 
GTGCCAGGCT 
ACTGCCCATC 
GGCCAGGAGC 
GTATGAGATT 
CAGCACGTAC 
AAAAGGGACC 
AGGGCCTCTT 
A6ACATATGC 
CTCAGTCAQA 
CGCATCTGAT 
TTGATGGATG 
CCATGAGGAC 
CACAGACTTA 
CACACTTCCT 
TATGTATTTA 
AATGAACATA 
GGTACAGTTG 
TGGGACTTCT 
AAAGCACTAT 
TCCTCCCTTC 
TGCCAGTGTT 
CCCAAGCTAT 
A6CTCCAGTT 
T6GAGGACTC 
GACAAGCTCT 
GTCTTGACCT 
TAGGCTCACA 
CTCGCAACAT 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



Seq ID No: 141 Protein sequence; 
Protein Accession #: NP_000192.a 



246 



NSDCX;iD: <WO. 



.02079492A2.I_> 



wo 02/079492 



PCT/US02/04915 



11 



21 



31 



41 



SI 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



MLiQFVRAGAR 
PVQAWVESLR 
RGGGGKPLAA 
KLAQDDIiHIM 
NLIPAVGLNV 
TQGPAATPYH 



AWLRPTGSQG 
GFEQERVGLA 
ERHWAGPAWQ 
DSLELPTGDP 
HSMLKHQTIiV 
C 



liSSLAEEAAR 
DIoHPDVFATA 
HPLSALARRR 
QYLTELAHYR 
LTIjPTVAFIiE 



ATENPEQVAS 
PRLDIIiHQVA 
CCPWPPGPTS 
RWGDSVIjLVD 
DKLIiWQDSRY 



EGLPEPVIiRK 
MWQKNFKRIS 
YYYMLPMKVR 
LTHEEMPQSI 
RPLYPFSLPY 



VELPVPTHRR 6 0 

YAKTKTRAEV 120 

AliGLKVALTV 180 

VEATSRLKTF 240 

SDF PRPIiPHA 300 



Seq ID NO: 142 DNA sequence 

Nucleic Acid Accession #: NM_000270.1 

Coding sequence: 110 - 979 {underlined sequences correspond to start and stop codons) 



1 
I 

AACTGTGCGA 
GGATCGGAGC 
ATACACCTAT 
ACCTCAAGTT 
GGCCCAGATC 
TGCTGGCCGA 
GTTCCACATG 
CCTTCTGGGT 
TGAGGTTGGA 
GAACCCTCTC 
TGCCTACGAC 
ACGTGAGCTA 
AGAATGTCGT 
AGTTATCGTT 
GGTCATCATG 
CAAACAAGCT 
CCCTGACAAA 
GTAGCTGCTA 
CA6AAAG6AA 
TGCCAGATCC 
ACAAAATAAA 
ACCACACATC 
TGCTACTAGC 
CCAGAGACCA 



11 

I 

ACCAGACCCG 
ACACCGGAGC 
GAAGATTATA 
6CAATAATCT 
TTTGACTACA 
CTGGTGTTTG 
TATGAAGGGT 
GTGGACACCC 
GATATCATGC 
AGAGGGCCCA 
CGGACTATGA 
CAGGAAGGCA 
GTGCTGCAGA 
GCACGGCACT 
GATTATGAAA 
GCACAGAAAT 
GCCAGTTGAC 
CCTTCTTTGG 
AAGATTCCTG 
TCTTCTCAAA 
GCTGTTCTCA 
TGTGGAGATG 
TCTTTGAGAT 
AACAAGGACT 



21 
I 

GCAGCCTTGC 
AGGCTCATCG 
AGAACACTGC 
GTGGTTCTGG 
GTGAAATCCC 
GGTTCCTGAA 
ACCCACTCTG 
TGGTAGTCAC 
TGATCCGTGA 
ATGATGAAAG 
GGCAGAGGGC 
CCTATGTGAT 
AGCTGGGAGC 
GTGGACTTCG 
GCCTGGAGAA 
TGGAACAGTT 
CTGCCTTGGA 
CCCCTTGCTG 
TCCTTCACCT 
GCTGGGATTA 
TTCCTGTTCT 
CCCAGGATTT 
AATACATTCC 
AATCCAATAC 



31 
I 

TCAGTTCAGC 
AGAAGGCGTC 
AGAATGGCTT 
ATTAGGAGGT 
CAACTTTCCT 
TGGCAGGGCC 
GT^GGTGACA 
CAATGCAGCA 
CCATATCAAC 
GTTTGGAGAT 
TCTCAGTACC 
GGTGGCAGGC 
AGACGCTGTT 
AGTCTTTGGC 
GGCCAACCAT 
TGTCTCCATT 
GTCGTCTGGC 
GA6TCAT6TG 
TTCCCACTTT 
CAGGTGTGAG 
TTCTTACACA 
GACTCGGGCC 
GAGGGGCTCA 
CTCTTGGA 



41 
I 

ATAGCGGAGC 

TGCGAGACCA. 

CTGTCTCATA 

CTGACTGATA 

CGAAGTACAG 

TGTGTGATGA 

TTCCCAGTGA 

GGAGGGCTGA 

CTACCTGGTT 

CGTTTCCCTG 

TGGAAACAAA 

CCCAGCTTTG 

GGCATGAGTA 

TTCTCACTCA 

GAAGAAGTCT 

CTTATGGCCA 

ATCTCCCACA 

CCTCTGTCCT 

CTTCTACCAG 

CATAGTGAGA 

AGAGCTGGAG 

TTAGAACTTT 

GTTCTGCCTT 



51 
I 

GGATCCGATC 

TGGAGAACGG 

CTAAGCACCG 

AATTAACTCA 

TGCCAGGTCA 

TGCAGGGCAG 

GGGTTTTCCA 

ACCCCAAGTT 

TCAGTGGTCA 

CCATGTCTGA 

TGGGGGAGCA 

AGACTGTGGC 

CAGTACCAGA 

TCACTAACAA 

TAGCAGCTGG 

GCATTCCACT 

CAAGACCCAA 

TAGGTTGTAG 

ACCCTTCTGG 

CCTTGGCGCT 

CCCGTGCCCT 

GCATAGCAGC 

ATCTAAATCA 



seq ID No: 143 Protein sequence; 
Protein Accession #: NP_000261.1 



1 

I 

MENGYTYEDY 
VPGHAGRLVF 
NPKFEVGDIM 
MGEQREXjQEG 
ITNKVIMDYE 



11 

I 

KNTAEWIiLSH 
GFLNGRACVM 
LIRDHXNLPG 
TYVMVAGPSF 
SliEKANHEEV 



21 



31 



41 



51 



TKHRPQVAII CGSGLGGLTD KLTQAQIPDY SBIPNFPRST 
MQGRFHMYEG YPLWKVTFPV RVFHIiLGVDT IiWTNAAGGIi 
FSGQNPIiRGP NDERFGDRFP AMSDAYDRTM RQRAIiSTWKQ 
ETVAECRVLQ KLGADAVGHS TVPEVIVARH CXSIiRVFGFSIi 
liAAGKQAAQK IiEQFVSILMA SIPIiPDKAS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 



Seq ID NO: 144 DNA sequence 

Nucleic Acid Accession #: NM_015577.1 

Coding sequence: 112-3054 (underlined sequences correspond to start and stop codons) 



60 1 11 21 31 41 51 

GPAGCGGCGG GCGGGGTGGA GCAGCCAGCT GGGTCCGGGG AGCGCCGCCG CCGCCTCGAT 60 

GGGGTGTTGA AAAGTCTCCT CTAGAGCTTT GGAAGGCTGA ATGCACTAAA CATGAAGAGC 120 

TTGAAAGCGA AGTTCAGGAA GAGTGACACC AATGAGTGGA ACAAGAATGA TGACCGGCTA 180 

65 CTGCAGGCCG TGGAGAATGG AGATGCGGAG AAGGTGGCCT CACTGCTCGG CAAGAAGGGG 240 

GCCAGTGCCA CCAAACACGA CAGTGAGGGC AAGACCGCTT TCCATCTTGC TGCTGCAAAA 300 

GGACACGTGG AATGCCTCAG GGTCATGATT ACACATGGTG TGGATGTGAC AGCCCAAGAT 360 

ACTACCGGAC ACAGCGCCTT ACATCTCGCA GCCAAGAACA GCC3^CCATGA ATGCA TCAGG 420 

AGGCTGCTTC AGTCTAAATG CCCAGCCGAA AGTQTCGACA GCTCTGGGAA AACAGCTTTA 480 

70 CATTATGCAG CGGCTCAGGG CTGCCTTCAA GCTGTGCAGA TTCTCTGCGA ACAC3UWSAGC 540 

CCCATAAACC TCAAAGATTT GGATGGGAAT ATACCGCTGC TTCTTGCTGT ACAAAATGGT 600 

CACAGTGAGA TCTGTCACTT TCTCCTGGAT CATGGA6CAG ATGTCaATTC CAGGAACAAA 660 

AGTGGAAGAA CT6CTCTCAT GCTGGCCTGT 6AGATTGGCA GCTCTAACGC TGTGGAAGCC 720 

TTAATTAAAA AGGGTGCAGA CCTAAACCTT GTAGATTCTC TTGGATACAA TGCCTTACAT 780 

75 TATTCCAAAC TCTCAGAAAA TCCAGGAATT CAAAGCCTTC TATTATCAAA AATCTCTCAG 840 

GATGCTGATT TAAAGACCCC AACAAAACCA AAQCAGCATG ACCAAGTCTC TAAAATAAGC 900 



247 



wo 02/079492 



PCT/tlS02/049l5 



10 



TCAGAAAGAA GTGGAACTCC AAAAACACGC AAAGCTCCAC CACCTCCTAT CAGTCCTACC 
CAGT TCAGTG ATGTCTCTTC CCCAAGATCA ATAACTTCGA CTCCACTATC GGGAAAGGAA 
TCGGTATTTT TTGCTGAACC ACCCTTCAAG GCTGAGATCA GTTCTATACG AGAAAACAAA 
GACAGACTAA GT6ACAGTAC TAGAGGTGCT GATAGCTTAT TGGATATAAG TTCTGAAGCT 
GACCAACAAG ATCTTCTCTC TCTATTGCAA GCAAAAGTTG CTTCCCTTAC CTTACACAAT 
AAGGAGTTAC AAGATAAATT ACAGGCCAAA TCACCCAAGG AGGCGGAAGC AGACCTAAGC 
TTTGACTCAT ACCATTCCAC CCAAACTGAC TTGGGCCCAT CCCTGGGAAA ACCTGGTGAA 
ACCTCTCCCC CAGACTCCAA ATCATCTCCA TCTGTCTTAA TACATTCTTT AGGTAAATCC 
ACTACTGACA ATGATGTCAG AATTCAGCAA CTGCAA6AGA TTTTGCAA6A TCTACAGAAG 
AGATTAGAGA GCTCTGAAGC AGA6A6AAAA CAGCTACAGG TCGAACTCCA ATCCC6AAGQ 
GCAGAACTGG TATGCTTAAA CAACACTGAG ATTTCAGAGA ACAGCTCTQA CCTCAGCCAQ 
AAACTTAAAG AAACTCAGAG CAAATACGAG GAGGCTATGA AAGAAGTCCT TAGTGT6CAG 
AAGCAGATGA AACTCGGTCT TGTCTCACCT GAAAGCATGG ATAATTATTC ACATTTCCAC 
GAGCTGAGGG TCACGGAAGA GGAAATAAAT GTGCTAAAGC AGGATCTGCA GAATGCATTA 
iD GAAGAAAGTG AAAGAAATAA AGAGAAAGTG AGAGAGTTAG AGGAAAAACT GGTAGAGAGG 
GAGAAAGGTA CAGTGATTAA GCCACCTGTG GAAGAGTACG AGGAAATGAA AAGTTCATAT 
TGCTCTGTTA TTGAGAATAT GAATAAGGAG AAAGCATTTT TGTTTGAGAA ATACCAAGAA 
GCCCAAGAAG AAATCATGAA ATTAAAAGAC ACACTAAAAA GTCAGATGAC ACAGGAAGCC 
AGTGATGAAG CTGAGGACAT GAAAGAAGCC ATGAATAGGA TGATAGATGA ACTCAATAAA 
CAGGTGAGCG AGCTGTCACA GCTGTACAAA GAAGCCCAGG CTGAGCTGGA GGATTACAGG 
AAGAGGAAAT CTCTAGAGGA TGTCACAGCT GAATATATCC ATAAAGCAGA GCATGAGAAA 
CTGATGCAAT TGACAAACGT GTCCAGGGCT AAAGCAGAAG ATGCACTGTC TGAAATGTVAG 
TCTCAGTATT CAAAAGTGTT GAATGAGTTG ACCCAGCTCA AACAACTGGT GGATGCACAA 
AAAGAGAACT CT6TCTCTAT CACAGAACAT TTGCAAGTGA TAACCACGCT GCGGACTGCA 
GCAAAAGAGA TGGAAGAAAA AATAAGCAAT CTTAAGGAAC ACCTTGCAAG CAAGGAAGTG 
GAAGTAGCAA AGCTGGAGAA ACAACTCTTA GAAGAGAAAG CTGCTATGAC TGATGCAATG 
GTACCTCGGT CTTCCTATGA AAAACTCCAG TCATCCTTAG AGAGTGAAGT GAGTGTGTTG 
GCATCGAAAT TAAAGGAATC T6TGAAAQAG AAAGAGAAGG TCCATTCAGA GGTTGTCCAG 
ATTAGAAGTG AGGTCTCACA GGTGAAAAGA GAAAAGGAAA ATATTCA6AC TCTCTTGAAA 
TCCAAAGAGC AAGAAGTAAA TGAACTTCTG CAAAAATTCC AGCAAGCTCA GGAAGAACTT 
GCAGAAATGA AAAGATACGC TGAGAGCTCT TCAAAACTGG AGGAA6ATAA A6ATAAAAAG 
ATAAATGAGA TGTCGAAGGA AGTCACCAAA TTGAAGGAGG CCTTGAACAG CCTCTCCCAG 
CTCTCCTACT CAACAAGCTC ATCCAAAAGG CAGAGTCAGC A6CTGGAGGC 6CTGCAGCAG 
CAAGTCAAAC AGCTCCAGAA CCAGCTGGCG GAATGCAAGA AACAACACCA GGAGGTCATA 
TCAGTTTACA GAATGCATCT TCTGTATGCT GTGCAGGGCC AGATGGATGA AGATGTCCAQ 
AAAGTACTGA AGCAAATCCT TACCATGTGT AAAAACCAGT 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



CTCAAAAGAA GTAAAGTGGA 

TTCCTTGGCA GGACACTGCC CCTTGTCATC TGTCTTTGTG TTAGATCCAG AGTTGTCGGC 
AGCCGCTGCC ATTGTTCTCA TTCGTGGTAT GCACTGTGGC CTAGCX3TAGC TTCTTCCCTT 



TCCAAAGGTT TCTGAGGACT 
ACTTCAAACC AGCAGAGGTG 



TCTCCCAGGA GAAGACTGCC CGCCTCAGAA CTGCTTAGAG 
AAAGTCCCTG TCATCCCTTC AGATTCCAGA GCTGGGATCA 
GCCATGCCCA GAGG TCTGGT CCTGATGCTG GCAGGGGGGC CCCCTCCTCC ATCCCTGACT 
GGCTGAGTGG CTTTATCACC ACCGAGTGAT GTGCTGAGGC CTCCTGCAGT GAATGCTCCT 
TCCATTCCTG TACTCGGGCA GTGCCATTCA GCACAGGAGA GCTCTTTTTG CCTTTGGCTT 
TCAATTCCAA AACATGATTT AATTTCTAAC TAAATTAGTA TGGCACTAGT TATGAAGTAT 
CTGCTTAAAA CCCTTCATCA TGATATCCTG TGGATTTAAA AACTCTAATT CCATGTTTTC 
TTCCCATCTG CCTTATATAT CTCATCACCC TGCTTATCAA TATTCAGTTT GATGAGCACT 
ATTAACTAAA ATATGAAACT TAAAAACAAA AGCAAGTTGT CCTTAAAAGT TCTTTTTTTA 
AGTAAATTGT TGACATACTG CAAATTTTCT ATGCAAACTT GCCTCCTGCT GTTATCTGTG 
AAGCTCAGGA AATCCAAACA TTTGTGTTTC AACAAGGGAC AGTAAACTGT GTGTTTACAG 
CCAAAAGAAA TGCCTCATAG TTCTTAACCT CAACTTTTGT AGAAGTATTT TTTTCTCTGT 
AATATTTTTA TTGGCTCATA AAGATGTTTT CATATCTGAA CTCCTAAATA AGTGAAATTA 
CAGTAGATTA TATTAACAAA ATACTTTTTA GGTAGCCATG CTTGAGACTT TTTAAAAATA 
TAACTTTTTC CTTAAAGTTT TCAGCTATAG CAAAAGGTAG TTATGTATGC CAGACCTAAT 
ATGAGCTGCC ACCAACACCC CTAGAACTTT CAGCCATGGT GTCTTCAGAA TTGTAGCGCA 
TTTCTGAATC TAGCAAATCC TCCTTTTACC CGTTGAATGT TTTGAATGCC CTGACTCTAC 
CAGCGCCCAT AAATGATCTC TAGAAGGACT GTTAGTACCA ATCTGTTTTT CAACTTTGAA 
GCTAAAAACC CTGATATGGT AATATTATGG TGCATAGCAG AGGTCTCGGA AAAAAAATAT 
TTCTGTTCAC TTTACTTTCA GGTTAAAAAT GTTTCTAACA CGCTTGCAAC TTCCCTTATG 
GCATTAATCT TGTTGAGGGA GAGAGACAGA ATCCTGGACT CTCCAAAGTA TTTAACTGAA 
AGTAGGGCCT GCTCTGACAG GGCCCATGTC CCACAAGGCT GCTTGGCCTC AGTGGGTGCT 
TGGCTGTGCT GGATGATATG TTGATCTGTA TTGGATAAGG ACCAATGACA GCAAAGCAAA 
AATGGCTTTA AAGCTTGGTG TTACTTTTCT TAAGTTGTTT AATTATAGTT AAGCAATTTC 
AAAAATGCTC CAAAGAAATG TGAAAGGACC TTTTGTCACA OCACTTCAGA AAATACACAA 
CAGCCCCTTC TGCCCCCGCA CAGAAATGCT GCAGAGTATA TAAAACTTGA GACATTTTTG 
TAGGATGCCT GACGAGGTGT AGCCTTTTAT CTTGTTTCCX3 GATOCATATT TATTAC6AGT 
ACTCTGGTTA AATATTGAAA AGTTATATGC TGTAGTTTTT AGTATTTTGT CTTTGTAATT 
TACAGAAGTT ATTGGAGAAA ATAAACTTGT TTCATTTTGC AAAAAAAAAA AAAAAAAAAA 
AAAAA 

Seq ID No: 145 Protein sequence; 
Protein Accession #: NP 056392.1 
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MKSLKAKFRK SDTNEWNKND DRLLQAVENO DAEKVASLLG KKGASATKHD SEGKTAFHIA 
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AAKGHVECLR VMITHGVDVT AQDTTGHSAL HIAAKNSHHE CIRRLLQSKC PAESVDSSGK 120 

TALHYAAAQG CLQAVQILCE HKSPINIiKDL DGNIPLLLAV QN6HSEICHF IiIiDHGADVNS 180 

RNKSGRTAIiM IiACEIGSSNA VEALIKKGAD LNIiVDSLGYN 7UJHYSKLSEN AGIQSLLLSK 24 0 

ISQDADLKTP TKPKQHDQVS KISSERSGTP KTRKAPPPPI SPTQLSDVSS PRSITSTPLiS 300 

GKESVFFAEP PFKAEISSIR ENKDRLSDST TGADSLLDIS SEADQQDLLS LLQAKVASLT 360 

LHNKBLQDKL QAKSPKEAEA DLSFDSYHST QTDLGPSLGK PGETSPPDSK SSPSVLIHSL 420 

GKSTTDNDVR IQQLQEILQD LQKRLESSEA ERKQLQVELQ SRRAEIjVCLN NTEISENSSD 480 

LiSQKXiKETQS KYEEAMKEVIj SVQKQMKLGL VSPESMDNYS HFHELRVTEE EINVIiKQDLQ 540 

NALEESBRNK EKVRELEEKIj VEREKGTVIK PPVEEYEEMK SS YCSVIENM NKEKAFIiPEK 600 

YQEAQEEIMK LKDTLKSQMT QEASDEAEDM KEAMNRMIDE LNKQVSELSQ LYKEAQAELE 660 

DYRKRKSLED VTAEYIHKAE HEKLMQLTNV SRAKAEDAIiS EMKSQYSKVIj NEliTQLKQLV 720 

DAQKENSVSI TEHIiQVITTL RTAAKEMEEK ISNLKEHIiAS KEVEVAKLEK QLLEEKAAMT 780 

DAMVPRSSYE KLQSSLESEV SVLASKLKES VKEKEKVHSE WQIRSEVSQ VKREKBNIQT 840 

liLKSKEQEVN ELLQKFQQAQ EELAEMKRYA ESSSKIiEEDK DKiaNEMSKE VTKIiKEALNS 900 

IiSQLSYSTSS SKRQSQQLEA LQQQVKQLQN QLAECKKQHQ EVISVYRMHli LYAVQGQMDE 960 
DVQKVIiKQIIj TMCKNQSQKK 

Seg ID NO: 146 DNA sequence 

Nucleic Acid Accession #: NM_000459»1 

Coding sequence: 149-3523 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CTTCTGTGCT GTTCCTTCTT GCCTCTAACT TGTAAACAAG ACGTACTAGG ACGATGCTAA 60 

TGGAAAGTCA CAAACCGCTG GGTTTTTGAA AGGATCCTTG GGACCTCATG CACATTTGTG 120 

GAAACTGGAT GGAGAGATTT GGGGAAGC:AT_GGACTCTTTA GCCAGCTTAG TTCTCTGTGG 180 

AGTCAGCTTG CTCCTTTCTG GAACTGTGGA AGGTGCCATG GACTTGATCT TGATCAATTC 240 

CCTACCTCTT GTATCTGATG CTGAAACATC TCTCACCTGC ATTGCCTCTG GGTGGCGCCC 300 

CCATGAGCCC ATCACCATAG G7AGGGACTT TGAAGCCTTA ATGAACCAGC ACCAGGATCC 360 

GCTGGAAGTT ACTCAAGATG TGACCAGAGA ATGGGCTAAA AAAGTTGTTT GGAAGAGAGA 420 

AAAGGCTAGT AAGATCAATG GTGCTTATTT CTGTGAAGGG CGTWm'CGAG GAGAGGCAAT 480 

CAGGATACGA ACCATGAAGA TGCGTCAACA AGCTTCCTTC CTACCAGCTA CTTTAACTAT 540 

GACTGTGGAC AAGGGAGATA ACGTGAACAT ATCTTTCAAA AAGGTATTGA TTAAAGAAGA 600 

AGATGCAGTG ATTTACAAAA ATGGTTCCTT CATCCATTCA GTGCCCCGGC ATGAAGTACC 660 

TGATATTCTA GAAGTACACC TGCCTCATGC TCAGCCCCA6 GATGCTGGAG TGTACTCGGC 720 

CAGGTATATA GGAGGAAACC TCTTCACCTC GGCCTTCACC AGGCTGATAG TCCGGAGATG 780 

TGAAGCCCAG AAGTGGGGAC CTGAATGCAA CCATCTCTGT ACTGCTTGTA TGAACAATGG 840 

T6TCTGCCAT GAAGATACTG GAGAATGCAT TTGCCCTCCT GGGTTTATGG GAAGGACGTG 900 

TGAGAAGGCT TGTGAACTGC ACACGTTTGG CAGTACTTGT AAAGAAAGGT GCAGTGGACA 960 

AGAGGGATGC AAGTCTTATG TGTTCTGTCT CCCTGACCCC TATGGGTGTT CCTGTGCCAC 1020 

AGGCTGGAAG GGTCTGCAGT GCAATGAAGC ATGCCACCCT GGTTTTTACG GGCCAGATTG 1080 

TAAGCTTAGG TGCAGCTGCA ACAATGGGGA GATGTGTGAT CGCTTCC7UVG GATGTCTCTG 1140 

CTCTCCAGGA TGGCAGGGGC TCCAGTGTGA GAGAGAAGGC ATACCGAGGA TGACCCCAAA 1200 

GATAGTGGAT TTGCCAGATC ATATAGAAGT AAACAGTGGT AAATTTAATC CCATTTGCAA 126 0 

AGCTTCTGGC TGGCCGCTAC CTACTAATGA AGAAATGACC CTGGTGAAGC CGGATGGGAC 1320 

AGTGCTCCAT CCAAAAGACT TTAACCATAC GGATCATTTC TCAGTAGCCA TATTCACCAT 1380 

CCACCGGATC CTCCCCCCTG ACTCAGGAGT TTGGGTCTGC AGTGTGAACA CAGTGGCTGG 1440 

GATGGTGGAA AAGCCCTTCA ACATTTCTGT TAAAGTTCTT CCAAAGCCCC TGAATGCCCC 1500 

AAACGTGATT GACACTGGAC ATAACTTTGC TGTCATCAAC ATCAGCTCTG AGCCTTACTT 1560 

TGGGGATGGA CCAATCAAAT CCAAGAAGCT TCTATACAAA CCCGTTAATC ACTATGAGGC 1620 

TTGGCAACAT ATTCAAGTGA CAAATGAGAT TGTTACACTC AACTATTTGG AACCTCXSGAC 1680 

A6AATATGAA CTCTGT6TGC AACTGGTCCG TCGTGGAGA6 GGTGGGGAAG GGCATCCTGG 1740 

ACCTGTGAGA CGCTTCACAA CAGCTTCTAT CGGACTCCCT CCTCCAAGAG GTCTAAATCT 1800 

CCTGCCTAAA AGTCAGACCA CTCTAAATTT GACCTGGCAA CCAATATTTC CAA6CTCGGA 1860 

AGATGACTTT TATGTTGTUVG TGGAGAGAAG GTCTGTGCAA AAAAGTGATC AGCAGAATAT 1920 

TAAAGTTCCA GGCAACTTGA CTTCGGTGCT ACTTAACAAC TTACATCCCA GGGAGCAGTA 1980 

CGTGGTCCGA GCTAGAGTCA ACACCAAGGC CCAGGGGGAA TGGAGTGAAG ATCTCACTGC 204 0 

TTGGACCCTT AGTGACATTC TTCCTCCTCA ACCAGAAAAC ATCAAGATTT CCAACATTAC 2100 

ACACTCCTCG GCTGTGATTT CTTGGACAAT ATTGGATGGC TATTCTATTT CTTCTATTAC 2160 

TATCCGTTAC AAGGTTCAAG GCAAGAATGA AGACCAGCAC GTTGATGTGA AGATAAAGAA 2220 

TGCCACCATC ATTCAGTATC AGCTCAAGGG CCTAGAGCCT GAAACAGCAT ACCAGGTGGA 2280 

CATTTTTGCA GAGAACAACA TAGGGTCAAG CAACCCAGCC TTTTCTCATG AACTGGTGAC 2340 

CCTCCCAGAA TCTCAAGCAC CAGCGGACCT CGGAGGGGGG AA GATG CTGC TTATAGCCAT 2400 

CCTTGGCTCT- GCTGGAATGA CCTGCCTGAC TGTGCTGTTG GCCTTTCTGA TCATATTGCA 2460 

ATTGAAGAGG GCAAATGTGC AAAGGAGAAT GGCCCAAGCC TTCCAAAACG TGAGGGAAGA 2520 

ACCAGCTGTG CAGTTCAACT CAGGGACTCT GGCCCTAAAC AGGAAGGTCA AAAACAACCC 2580 

AGATCCTACA ATTTATCCAG TGCTTGACTG GAATGACATC AAATTTCAAG ATGTGATTGG 2640 

GGAGGGCAAT TTTGGCCAAG TTCTTAAGGC GCGCATCAA6 AAG6ATGGGT T ACGGA TGGA 2700 

TGCTGCCATC AAAA6AATGA AAGAATATGC CTCCAAAGAT GATCACAGGG ACTTTGCAGG 2760 

AGAACTGGAA GTTCTTTGTA AACTTGGACA CCATCCAAAC ATCATCAATC TCTTAGGAGC 2820 

ATGT6AACAT CXSAGGCTACT TGTACCTGGC CATTGA6TAC GCGCCCCATG GAAACCTTCT 2880 

GGACTTCCTT CGCAAQAGCC GTGTGCTGQA GACGGACCCA GCATTTGCCA TTGCCAATAG 2940 

CACCGCGTCC ACACTGTCCT CCCAGCAGCT CCTTCACTTC GCTGCCGACG TGGCCCGGGG 3000 

CATGGACTAC TTGAGCCAAA AACAGTTTAT CCACAGGGAT CTGGCTGCCA GAAACATTTT 3060 

AGTTGGT6AA AACTATGTGG CAAAAATAGC AQATTTTGGA TTGTCCCGAG GTCAAGAGGT 3120 
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10 



15 



20 



GTACGTGAAA 
TTACAGTGTG 
TGTTAGCTTA 
GCCCCAGGGC 
GAGACAATGC 
CTTAAACAGA 
TACTTATGCA 
CCCTCTGTTT 
CTGCCAAAGG 
TAATATTTAA 
TCCCTCACCT 
TTTCCACAGC 
TTGCTTACAA 
ATTTTCTTTT 
TGGGTGACAT 
GCACATTGTA 
CACTTTGCAC 



AAGACAATGG 
TACACAACCA 
GGAGGCACAC 
TACAGACTGG 
TGGCGGGAGA 
ATGTTAGAGG 
GGAATTGACT 
CCCTTTCACT 
ATGTGATATA 
GACACT6AAA 
GTAGCATGCC 
CTGCAAGTTC 
6CCTAAGAAT 
CTTTTCTCTG 
TTGG6AGACA 
AAAAGTTTTA 
TGATATATCA 



GAAGGCTCCC 
ACAGTGATGT 
CCTACTGCGG 
AGAAGCCCCT 
AGCCTTATGA 
AGCGAAAGAC 
GTTCTGCTGA 
GGCATGGGAG 
TAAGTGTACA 
AATCTAAGTG 
AGTCCCGTTT 
AGTC CAGGAT 
CTTTAGAGAA 
GTAATATTGA 
TGTGACATTT 
GTTTTGATGA 
TGAGTGAATA 



AGTGCGCTGG 
ATGGTCCTAT 
GATGACTTGT 
GAACTGTGAT 
GAGGCCATCA 
CTACGTGAAT 
AGAAGCGGCC 
ACCCTTGACA 
TATGTGCTGG 
ATATAAATCA 
CATTTAGTCA 
QCTAACATCT 
GTATACATAA 
CTTGTATATT 
ATATATTGAA 
GTTGTGAGTT 
AATGTCTTGC 



ATGGCCATCG 
GGTGTGTTAC 
GCAGAACTCT 
GATGAGGTGT 
TTTGCCCAGA 
ACCACGCTTT 
TAGGACAGAA 
ACTGCTGAGA 
AATTCTAACA 
GATTCTTCTC 
TGTGACCACT 
AAAAATAGAC 
GTTTAGGATA 
TTAAGAAATA 
TTAATATCCC 
TACCTTGTAT 
CTACTCAAAA 



AGTCACTGAA 
TATGGGAGAT 
ACGAGAAGCT 
ATGATCTAAT 
TATTGGTGTC 
ATGAGAAGTT 
CATCTGTATA 
AAACATGCCT 
AGTCATAGGT 
TCTCATTTTA 
CTGTCTTGTG 
TTAAATCTCA 
AAATAATGGG 
ACAGAAAGCC 
TACATGTATT 
ACTGTAGGCA 
AAAAAAAA 



3180 
3240 
3300 
3360 
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Seq ID No; 14 7 Protein sequence: 
Protein Accession #: NP 000450.1 
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1 

I 

MDSLASLVLC 
FEALMNQHQD 
QASPLPATLT 
AQPQDAGVYS 
ICPPGPMGRT 
ACHPGFYGPD 
VNSGKFNPIC 
VWVCSVNTVA 
DLYKPVNHYE 
IGIiPPPRGLN 
LLNNIiHPREQ 
ILDGYSISSI 
SNPAFSHEIiV 
MAQAFQNVRE 
ARIKKDGLRM 
AIEYAPHGNL 
IHRDIiAARNI 
VWSYGVLL5^E 
ERPSFAQILV 



11 
I 

GVSLLliSGTV 
PLEVTQDVTR 
mVDKCDNVN 
ARYIGGNLFT 
CEKACELHTF 
CKLRCSCNNG 
KASGWPIiPTM 
GMVEKPFNIS 
AWQHIQVTNE 
LLPKSQTTUI 
YWRARVNTK 
TIRYICVQGKN 
TLPBSQAPAD 
EPAVQFNSGT 
DAAIKRMKEY 
LDFLRKSRVL 
LVGENYVAKI 
IVSIiGGTPYC 
SLNRMLEERK 



21 
I 

EGAMDLILIN 
EWAKKWWKR 
ISFKKVLIKE 
SAPTRIiIVRR 
GRTCKERCSG 
EMCDRFQGCL 
EEMTLVKPDG 
VKVLPKPLNA 
IVTLNYLEPR 
LTWQPIFPSS 
AQGBWSEOLT 
EDQHVDVKIK 
LGGGKMIjLXA 
liALNRKVKNN 
ASKDDHRDFA 
ETDPAFAIAN 
ADFGLSRGQE 
GMTCABLYEK 
TYVNTTLYEK 



31 

I 

SliPIiVSDAET 
EKASKINGAY 
EDAVIYKNGS 
CEAQKWGPEC 
QEGCKSYVFC 
CSPGWQGIiQC 
TVLHPKDFNH 
PNVIDTGHNP 
TEYEIiCVQIjV 
EDDFYVEVER 
AWTLSDZLPP 
NATIIQYQLK 
IIiGSAGMTCL 
PDPTIYPVLD 
GEIiEVLCKIiG 
STASTLSSQQ 
VYVKKTMGRL 
liPQGYRLEKP 
FTYAGIDCSA 



41 

1 

SLTCIASGWR 
FCEGRVRGEA 
FIHSVPRHBV 
NHLCTACMNN 
LPDPYGCSCA 
EREGIPRMTP 
TDHFSVAIFT 
AVINISSEPY 
RR6E6GEGKP 
RSVQKSDQQN 
QPENIKISNI 
GLEPBTAYQV 
TVIjLAFLIIL 
WNDIKFQDVI 
HHPNIINLLG 
LLHFAADVAR 
PVRWMAIESL 
LNCDDEVYDZi 
EEAA 



51 

I 

PHEPITIGRD 
IRIRTMKMRQ 
PDIIiEVHIiPH 
GVCHEDTGEC 
TGWKGLQCNE 
KIVDLPDHIE 
IHRILPPDSG 
FGDGPIKSKK 
GPVRRFTTAS 
IKVPGNLTSV 
THSSAVISWT 
DIFAENNIGS 
QIiKRAMVQRR 
GEGNFGQVIiK 
ACEHRGYLYL 
GMDYLSQKQF 
NYSVYTTNSD 
MRQCWREKPY 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 148 DNA sequence 

Nucleic Acid Accession #: NM_000552.2 

Coding sequence: 311-8752 (underlined sequences correspond to start and stop codons) 



AGCTCACAGC 
TATCTCCCCC 
GAGCTGTAGC 
GCACCATTGT 
GCAGGGGAAG 
GCCCTCATTT 
GCCAGGGACC 
TTTCGGAAGT 
CAGTTACCTC 
GAATGGCAAG 
TGTCAATGGT 
GCTGTATCTA 
GGCCAGGATC 
GACCTGC6GG 
A6GGACCTTG 
ACAGTG6TGT 
GCAGAAGGGC 
CCACCCTCTG 
TGCT66GGGG 
GGAGGGAATG 
TGGTATGGAG 
CAATGAAATG 
GGATGAAGGC 
CCCTCCCGGC 
GATCTGCAGC 



11 
I 

TATTGTGGTG 
AGCAGTGGGG 
AGACCTGATT 
CCAGCAGCTG 
GCACCATTGT 
ATGATTCCTG 
CTTTGTGCAG 
GACTTCGTCA 
CTGGCAGGGG 
AGAGTGAGCC 
ACCGTGACAC 
GAAACTGAGG 
GATGGCAGCG 
CTGTGTGGCA 
ACCTCGGACC 
GAACGGGCAT 
CTGTGGGAGC 
GT66ACCCCG 
CT6GA6TGCG 
GTGCTGTACG 
TATAGGCAGT 
TGTCAGGAGC 
CTCTGCGTGG 
ACCTCCCTCT 
AATGAAGAAT 



21 
I 

GGAAAGGGAG 
ACTCCACAGC 
GAGCCTTTGC 
AGTTTCCCAG 
CCAGCAGCTG 
CCAGATTTGC 
AAGGAACTCG 
ACACCTTTGA 
GCTGCCAGAA 
TCTCCGTGTA 
AGGGGGACCA 
CTGGGTACTA 
GCAACTTTCA 
ACTTTAACAT 
CTTATGACTT 
CTCCTCCCAG 
AGT6CCA6CT 
AGCCTTTTGT 
CCTGCCCTGC 
GCTG6ACC6A 
GTGTGTCCCC 
GATGCGTGGA 
AGAGCACCGA 
CTCGAGACTG 
GTCCAGGGGA 



31 
I 

GGTGGTTGGT 
CCCTGGGCTA 
AGCA6CTGAG 
GGACCTTGGA 
AGTTTCCCAG 
CGGGGTGCTG 
CGGCAGGTCA 
TGGGAGCATG 
ACGCTCCTTC 
TCTTGGGGAA 
AAGAGTCTCC 
CAAGCTGTCC 
AGTCCTGCTG 
CTTTGCTGAA 
TGCCAACTCA 
CA6CTCATGC 
TCTGAAGAGC 
GGCCCTGTGT 
CCTCCTG6AG 
CCACAGCGCG 
TTGCGCCAGG 
TGGCTGCAGC 
GTQTCCCTGC 
CAACACCTGC 
GTGCCTTGTC 



41 

1 

GGAT6TCACA 

CATAACAGCA 

AGCATGGCCT 

GATA6CCGCA 

GGACCTTGGA 

CTTGCTCTGG 

TCCACGGCCC 

TACAGCTTTG 

TCGATTATTG 

TTTTTTGACA 

ATGCCCTATG 

GGTGAGGCCT 

TCAGACAGAT 

GATGACTTTA 

TGGGCTCTGA 

AACATCTCCT 

ACCTCGGTGT 

GAGAAGACTT 

TACGCCCGGA 

TGCAGCCCAG 

ACCTGCCAGA 

TGCCCTGAGG 

GTGCATTCCG 

ATTTGCCGAA 

ACTGGTCAAT 



51 
I 

GCTTGGGCTT 
AGACAGTCCG 
AGGGTGGGCG 
GCCCTCATTT 
GATAGCCGCA 
CCCTCATTTT 
GATGCAGCCT 
CGGGATACTG 
GGGACTTCCA 
TCCATTTGTT 
CCTCCAAAGG 
ATGGCTTTGT 
ACTTCAACAA 
TGACCCAAGA 
GCAGTGGAGA 
CTGGGGAAAT 
TTGCCCGCTG 
TGTGTGAGTG 
CCTGT6CCCA 
TGTGCCCTGC 
GCCTGCACAT 
GACAGCTCCT 
GAAAGCGCTA 
ACAGCCAGTG 
CCCACTTCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
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GAGCTTTGAC AACAGATACT TCACCTTCAO 
TTGCCAGGAC CACTCCTTCT CCATTGTCAT 
CGCTGTGTGC ACCCGCTCCG TCACCGTCCG 
ACTGAAGCAT GGGGCAGGAG TTGCCATGGA 
5 AGGTGACCTC CGCATCCAGC ATACAGTGAC 
CCTGCAGATG GACTGGGATG GCCGCGGGAG 
CGGGAAGACC TGCGGCCTGT GTGGGAATTA 
CCCCTCTGGG CTGGCAGAGC CCCGGGTG6A 
GGACTGCCAG GACCTGCAGA AGCAGCACAG 
10 CAGGTTCTCC GAGGAGGCGT GCGCGGTCCT 
TGCCGTCAGC CCGCTGCCCT ACCTGCGGAA 
CGGCCGCGAG TGCCTGTGCG GCGCCCTGGC 
CGTGCGCGTC GCGTGGCGCG AGCCAGGCCG 
GTACCTGCAG TGCGGGACCC CCTGCAACCT 
15 GGAATGCAAT GAGGCCTGCC TGGAGGGCTG 
GAGGGGGGAC TGCGTGCCCA AGGCCCAGTG 
GCCAGAAGAC ATCTTCTCAG ACCATCACAC 
CTGTACCATG AGTGGAGTCC CCGGAAGCTT 
GTCTCATCGC A6CAAAAGGA GCCTATCCTG 
20 CGCTGACAAC CTGCGGGCTG AAGGGCTCGA 
GGAGTGCATG AGCATGGGCT GTGTCTCTGG 
GGAGTGCATG AGCATGGGCT GTGTCTCTGG 
TGAGAACAGA TGTGTGGCCC TGGAAAGGTG 
CCCTGGAGAA ACAGTGAAGA TTGGCTGCAA 
25 CTGCACAGAC CATGTGTGTG ATGCCACGTG 
CTTCGACGGG CTCAAATACC TGTTCCCCGG 
CTGCGGCAGT AACCCTGGGA CCTTTCGGAT 
CTCAGTGAAA TGCAAGAAAC GGGTCACCAT 
TGACGGGGAG GTGT^TGTGA AGAGGCCCAT 
30 GTCTGGCCGG TACATCATTC TGCTGCTGGG 
CCTGAGCATC TCCGTGGTCC TGAAGCAGAC 
GAATTTTGAT GGCATCCAGA ACAATGACCT 
CCCTGTGGAC TTTGGGAACT CCTGGAAAGT 
GCCTCTGGAC TCATCCCCTG CCACCTGCCA 
35 TTCCTCCTGT AGAATCCTTA CCAGTGACGT 
CGAGCCATAT CTGGATGTCT GCATTTACGA 
CGCCTGCTTC TGCGACACCA TTGCT6CCTA 
GGTGACCTGG AGGACGGCCA CATTGTGCCC 
GAACGGGTAT GAGTGTGAGT GGCGCTATAA 
40 TCAGCACCCT GAGCCACTGG CCTGCCCTGT 
CCCTCCAGGG AAAATCCTGG ATGAGCTTTT 
AGTGTGTGAG GTGGCTGGCC ' GGCGTTTTGC 
TGACCCTGAG CACTGCCAGA TTTGCCACTG 
CCAGGAGCCG GGAGGCCTGG TGGTGCCTCC 
45 GTATGTGGAG GACATCTCGG AACCGCCGTT 
CCTGGTCTTC CTGCTGGATG GCTCCTCCAG 
GGCCTTTGTG GTGGACATGA TGGAGCGGCT 
CGTGGTGGAG TACCACGACG GCTCCCACGC 
GTCAGAGCTG CGGCGCATTG CCAGCCAGGT 
50 CAGCGAGGTC TTGAAATACA CACTGTTCCA 
CTCCCGCATC GCCCTGCTCC TGATGGCCAG 
TGTCCGCTAC GTCCAGGGCC TGAA6AAGAA 
GCCCCATGCC AACCTCAAGC AGATCCGCCT 
CTTCGTGCTG AGCAGTGTGG ATGAGCTG6A 
55 CTGTGACCTT GCCCCTGAAG CCCCTCCTCC 
TGTGGGCCCG GGGCTCTTGG GGGTTTCGAC 
GGATGTGGCG TTCGTCCTGG AAGGATCGGA 
CAAGGAGTTC ATGGAGGAGG TGATTCAGCG 
CACGGTGCTG CAGTACTCCT ACATGGTGAC 
60 CAAAGGGGAC ATCCTGCAGC GGGTGCGAGA 
CACTGGGCTG GCCCTGCGGT ACCTCTCTGA 
GGAGCAGGCG CCCAACCTGG TCTACATGGT 
GAGGCTGCCT GGAGACATCC AGGTGGTGCC 
GGAGCTGGAG AGGATTGGCT GGCCCAATGC 
65 CCCCCGAGAG GCTCCTGACC TGGTGCTGCA 
CCCCACCCTC TCCCCTGCAC CTGACTGCAG 
TGGCTCCTCC AGTTTCCCAG CTTCTTATTT 
CATTTCAAAA GCCAATATAG GGCCTCGTCT 
CATCACCACC ATTGACGT6C CATGGAACGT 
70 TGTGGACGTC ATGCAGCGGG AGGGAGGCCC 
TGTGCGATAC TTGACTTCAG AAATGCATGG 
CATCCTGGTC ACGGACGTCT CTGTGGATTC 
CAACAGAGTG ACAGTGTTCC CTATTGGAAT 
GATCTTGGCA GGCCCAGCAG GCGACTCCAA 
75 CCCTACCATG GTCACCTTGG 6CAATTCCTT 
GATTTGCATG GATGAGGAT6 G6AATGA6AA 



TGGGATCTGC CAGTACCTGC TGGCCCGGGA 1560 

TGAGACTGTC CAGTGTGCTG ATGACCGCGA 1620 

GCTGCCTGGC CTGCACAACA GCCTTGTGAA 1680 

TGGCCAGGAC ATCCAGCTCC CCCTCCTGAA 1740 

GGCCTCCGTG CGCCTCAGCT ACGGGGAGGA 1800 

GCTGCTGGTG AAGCTGTCCC CCGTCTACGC 1860 

CAATGGCAAC CAGGGCGACG ACTTCCTTAC 1920 

GGACTTCGGG AACGCCTGGA AGCTGCACGG 1980 

CGATCCCTGC GCCCTCAACC CGCGCATGAC 204 0 

GACGTCCCCC ACATTCGAGG CCTGCCATCG 2100 

CTGCCGCTAC GACGTGTGCT CCTGCTCGGA 2160 

CAGCTATGCC GCGGCCTGCG CGGGGAGAGG 2220 

CTGTGAGCTG AACTGCCCGA AAGGCCAGGT 2280 

GACCTGCCGC TCTCTCTCTT ACCCGGATGA 2340 

CTTCTGCCCC CCAGGGCTCT ACATGGATQA 2400 

CCCCTGTTAC TATGACGGTG AGATCTTCCA 2460 

CATGTGCTAC TGTGAGGATG GCTTCATGCA 2520 

GCTGCCTGAC GCTGTCCTCA GCAGTCCCCT 2580 

TCGGCCCCCC ATGGTCA7W3C TGGTGTGTCC 2640 

GTGTACCAAA ACGTGCCAGA ACTATGACCT 2700 

CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

TCCCTGCTTC CATCAGGGCA AGGAGTATGC 2 820 

CACTTGTGTC TGTCGGGACC GGAAGTGGAA 28 80 

CTCCACGATC GGCATGGCCC ACTACCTCAC 2940 

GGAGTGCCAG TACGTTCTGG TGCAGGATTA 3 000 

CCTAGTGGGG AATAAGGGAT GCAGCCACCC 3060 

CGTGGTGGAG GGAGGAGAGA TTGAGCTGTT 3120 

GAAGGATGAG ACTCACTTTG AGGTGGTGGA 3180 

CAAAGCCCTC TCCGTGGTCT GGGACCGCCA 3240 

ATACCAGGAG AAAGTGTGTG GCCTGTGTGG 3300 

CACCAGCAGC AACCTCCAAG TGGAGGAA6A 3360 

6AGCTCQCAG TGTGCTGACA CCAGAAAAGT 3420 

TAACAACATC ATGAAGCAGA CGATGGTGGA 3480 

CTTCCAGGAC TGCAACAAGC TGGTGGACCC 3540 

CACCT6CTCC TGTGAGTCCA TTGGGGACTG 3600 

TGCCCACGTG TGTGCCCAGC ATGGCAAGGT 3660 

CCAGAGCTGC GAGGAGAGGA ATCTCCGGGA 3 720 

CAGCTGTGCA CCTGCCTGTC AAGTCACGTG 3780 

GCAGTGTGTG GAGGGCTGCC ATGCCCACTG 3 840 

GCAGACCTGC GTTGACCCTG AAGACTGTCC 3 900 

CTCAGGAAAG AAAGTCACCT TGAATCCCAG 3960 

TGATGTTGTC AACCTCACCT GTGAAGCCTG 4020 

CACAGATGCC CCGGTGAGCC CCACCACTCT 4080 

GCACGATTTC TACTGCAGCA GGCTACTGGA 414 0 

GCTGTCCGAG GCTGAGTTTG AAGTGCTGAA 42 00 

GCGCATCTCC CAGAAGTGGG TCCGCGTGGC 4260 

CTACATCGGG CTCAAGGACC GGAAGCGACC 4320 

GAAGTATGCG GGCAGCCAGG TGGCCTCCAC 4380 

AATCTTCAGC AAGATCGACC GCCCTGAAGC 4440 

CCAGGAGCCG CAACGGATGT CCCGGAACTT 4500 

GAAG6TCATT GTGATCCCGG TGGGCATTGG 4560 

CATCGAGAAG CAGGCCCCTG AGAACAAGGC 4620 

GCAGCAAAGG GACOAGATCG TTAGCTACCT 4680 

TACTCTGCCC CCCCACATGG CACAAGTCAC 474 0 

CCTGGGGCCC AAGAGGAACT CCATGGTTCT 4800 

CAAAATTGGT GAAGCCGACT TCAACAGGAG 4 860 

GATGGATGTG GGCCAGGACA GCATCCACGT 4 920 

CGTGGAGTAC CCCTTCAGCG AGGCACAGTC 4 980 

GATCCGCTAC CAGGGCGGCA ACAGGACCAA 5040 

CCACAGCTTC TTGGTCAGCC AGGGTGACCG 5100 

CACCGGAAAT CCTGCCTCTG ATGAGATCAA 5160 

CATTGGAGTG GGCCCTAATG CCAACGTGCA 5220 

CCCTATCCTC ATCCAGGACT TTGAGACGCT 5280 

GAGGTGCTGC TCCGGAGAGG GGCTGCAGAT 5340 

CCAGCCCCTG GACGTGATCC TTCTCCTGGA 5400 

TGATGAAAT6 AAGA6TTTC6 CCAAGGCTTT 5460 

CACTCAGGTG TCAGT6CTGC AGTATGGAAG 5520 

GGTCCCGGAG AAAGCCCATT TGCTGAGCCT 5580 

CAGCCAAATC GGGGATGCCT TGGGCTTTGC 5640 

TGCCAGGCCG GGAGCCTCAA AGGCGGTGGT 5700 

AGTGGATGCA GCAGCTGATG CCGCCAGGTC 5760 

TGGAGATCGC TACGATGCAG CCCAGCTACG 5820 

CGTGGTGAAG CTCCAGCGAA TCGAAGACCT 5880 

CCTCCACAAA CTGTGCTCTG GATTTGTTAG 5940 

GAGGCCCGGG GACGTCTGGA CCTTGCCAGA 6000 
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CCAGTGCCAC ACCGTGACTT GCCAGCCAGA TGGCCAGACC TTGCTGAAGA GTCATCGGGT 6060 
CAACTGTGAC CGGGGGCTGA GGCCTTCGTG CCCTAACAGC CAGTCCCCTG TTAAAGTGGA 6120 

AGAGACCTGT GGCTGCCGCT GGACCTGCCC CTGCX3TGTGC ACAGGCAGCT CCACTCGGCA 6180 

CATCGTGACC TTTGATGGGC AGAATTTCAA GCTGACTGGC AGCTGTTCTT ATGTCCTATT 624 0 

TCAAAACAAG GAGCAGGACC TGGAGGTGAT TCTCCATAAT GGTGCCTGCA GCCCTGGAGC 6300 

AAGGCAGGGC TGCATGAAAT CCATCGAGGT GAAGCACAGT GCCCTCTCCG TCGAGCTGCA 6360 

CAGTGACATG GAGGTGACGG TGAATGGGAG ACTGGTCTCT GTTCCTTACG TGGGTGGGAA 642 0 

CATGGAAGTC AACGTTTATG GTGCCATCAT GCAT6AGGTC AGATTCAATC ACCTTGGTCA 64 80 

CATCTTCACA TTCACTCCAC AAAACAATGA GTTCCAACTG CAGCTCAGCC CCAAGACTTT 6540 

TGCTTCAAAG ACGTATGGTC TGTGTGGGAT CTGTGATGAG AACGGAGCCA ATGACTTCAT 6600 

GCTGAGGGAT GGCACAGTCA CCACAGACTG GAAAACACTT GTTCAGGAAT GGACTGTGCA 6660 

GCGGCCAGGG CAGACGTGCC AGCCCATCCT GGAGGAGCAG TGTCTTGTCC CCGACAGCTC 6720 

CCACTGCCAG GTCCTCCTCT TACCACTGTT TGCTGAATGC CACAAGGTCC IGGCTCCAGC 6780 

CACATTCTAT GCCATCTGCC AGCAGGACAG TTGCCACCAG GAGCAAGTGT GTQAGGTGAT 6840 

CGCCTCTTAT GCCCACCTCT GTCGGACCAA CGGGGTCTGC GTTGACTGGA GGACACCTGA 6900 

TTTCTGTGCT ATGTCATGCC CACCATCTCT GGTCTACAAC CACTGTGAGC ATGGCTGTCC 6960 

CCGGCACTGT GATGGCAACG TGAGCTCCTG TGGGGACCAT CCCTCCGAAG GCTGTTTCTG 7020 

CCCTCCAGAT AAAGTCATGT TGGAAGGCAG CTGTGTCCCT GAAGAGGCCT GCACTCAGTG 7080 

CATTGGTGAG GATGGAGTCC AGCACCAGTT CCTGGAAGCC TGGGTCCCGG ACCACCAGCC 7140 

CTGTCAGATC TGCACATGCC TCAGCGGGCG GAAGGTCAAC TGCACAACGC AGCCCTGCCC 7200 

CACGGCCAAA GCTCCCACGT GTGGCCTGTG TGAAGTAGCC CGCCTCCGCC AGAATGCAGA 7260 

CCAGTGCTGC CCCGAGTATG AGTGTGTGTG TGACCCAGTG AGCTGTGACC TGCCCCCAGT 7320 

GCCTCACTGT GAACGTGGCC TCCAGCCCAC ACTGACCAAC CCTGGCGAGT GCAGACCCAA 73 80 

CTTCACCTGC GCCTGCAGGA AGGAGGAGTG CA7UUVGAGTG TCCCCACCCT CCTGCCCCCC 7440 

GCACCX5TTTG CCCACCCTTC GGAAGACCCA GTGCTGTGAT GAGTATGAGT GTGCCTGCAA 7500 

CTGTGTCAAC TCCACAGTGA GCTGTCCCCT TGGGTACTTG GCCTCAACCG CCACCAATGA 7560 

CTGTGGCTGT ACCACAACCA CCTGCCTTCC CGACAAGGTG TGTGTCCACC GAAGCACCAT 7620 

CTACCCTGTG GGCCAGTTCT GGGAGGAGGG CTGCGATGTG TGCACCTGCA CCGACATGGA 7680 

GGATGCCGTG ATGGGCCTCC GCGTGGCCCA GTGCTCCCAG AAGCCCTGTG AGGACAGCTG 7740 

TCGGTCGGGC TTCACTTACG TTCTGCATGA AGGCGAGT6C TGTGGAAGGT GCCTGCCATC 7800 

TGCCTGTGAG GTGGTGACTG GCTCACCGCX3 GGGGGACTCC CAGTCTTCCT GGAAGAGTGT 7860 

CGGCTCCCAG TGGGCCTCCC CGGAGAACCC CTGCCTCATC AATGAGTGTG TCCGAGTGAA 7920 

GGAGGAGGTC TTTATACAAC AAAGGAACGT CTCCTGCCCC CAGCTGGAGG TCCCTGTCTG 7980 

CCCCTCGGGC TTTCAGCTGA GCTGTAAGAC CTCAGCGTGC TGCCCAAGCT GTCX3CTGTGA 8040 

GCGCATGGAG GCCTGCATGC TCAATGGCAC TGTCATTGGG CCCGGGAAGA CTGTGATGAT 8100 

CGATGTGTGC ACGACCTGCC GCTGCATGGT GCAGGTGGGG GTCATCTCTG GATTCAAGCT 8160 

GGAGTGCAGG AAGACCACCT GCAACCCCTG CCCCCTGGGT TACAAGGAAG AAAATAACAC 8220 

AGGTGAATGT TGTGGGAGAT GTTTGCCTAC GGCTTGCACC ATTCAGCTAA GAGGAGGACA 8280 

GATCATGACA CTGAAGCGTG ATGAGACGCT CCAGGATGGC TGTGATACTC ACTTCTGCAA 834 0 

GGTCAATGAG AGAGGAGAGT ACTTCTGGGA GAAGAGGGTC ACAGGCTGCC CACCCTTTGA 84 00 

TGAACACAAG TGTCTGGCTG AGGGAGGTAA AATTATGAAA ATTCCAGGCA CCTGCTGTGA 8460 

CACATGTGAG GAGCCTGAGT GCAACGACAT CACTGCCAGG CTGCAGTATG TCAAGGTGGG 8520 

AAGCTGTAAG TCTGAAGTAG AGGTGGATAT CCACTACTGC CAGGGCAAAT GTGCCAGCAA 8580 

A6CCATGTAC TCCATTGACA TCAACGATGT GCAGGACCAG TGCTCCTGCT GCTCTCCGAC 864 0 

ACGGACGGAG CCCATGCAGG TGGCCCTGCA CTGCACCAAT GGCTCTGTTG TGTACCATGA 8700 

GGTTCTCAAT GCCATGGAGT GCAAATGCTC CCCCAGGAAG TGCAGCAAG T GAG GCTGCTG 8760 

CAGCTGCATG GGTGCCTGCT GCTGCCTGCC TTGGCCTGAT GGCCAGGCCA GAGTGCTGCC 8820 

AGTCCTCTGC ATGTTCTGCT CTTGTGCCCT TCTGAGCCCA CAATAAAGGC TGAGCTCTTA 8880 
TCTTGCTGCA TGTTCTGCTC TTGTGCCCTT CTGAGCCCAC AAT 

Seq ID No: 149 Protein secmencei 
Protein Accession #: NP 000543.1 



1 11 21 31 41 51 

I I I I I I 

MIPARFAGVL LALAIilLPGT LCAEGTRGRS STARCSLPGS DFVNTFDGSM YSFAGYCSYI. 60 

LAGGCQKRSF SIIGDFQNGK RVSLSVYLGE FFDIHLFVNG TVTQGDQRVS MPYASKGLYI, 120 

ETEAGYYKLS GBAYGFVARI DGSGNPQVIiIj SDRYFNKTCG LCGNFNIPAE DDFMTQEGTL 180 

TSDPYDFANS WALSSGEQWC ERASPPSSSC NISSGEMQKG LWEQCQLLKS TSVFARCHPL 24 0 

VDPBPFVAIjC EKTLCECAGG LECACPAIiIiE YARTCAQEGM VLYGWTDHSA CSPVCPAGME 300 

YRQCVSPCAR TCQSLHINEM CQERCVDGCS CPEGQLLDEG LCVESTECPC VHSGKRYPPG 360 

TSLSRDCNTC ICRNSQWICS NEECPGECIiV TGQSHFKSFD NRYFTFSGIC QYLLARDCQD 420 

HSFSIVIETV QCADDRDAVC TRSVTVRLPG LHNSLVKIiKH GAGVAMDGQD IQLPLLKGDL 4 80 

RIQHTVTASV RI*SYGEDIiQM DWDGRGRIJiV KLSPVYAGKT CGLCGNYNGN QGDDFLTPSG 540 

LABPRVEDFG NAWKLHGDCQ D1,QKQHSDPC ALNPRMTRFS EEACAVLTSP TFEACHRAVS 600 

PLPYLRNCRY DVCSCSDGRB CLCGALASYA AACAGRGVRV AWREPGRCEI* NCPKGQVYIiQ 660 

CGTPCNIjTCR SLSYPDEECN BACLEGCFCP PGLYMDBRGD CVPKAQCPCY YD6E1FQPBD 720 

IPSDHHTMCY CEDGPMHCTM SGVPGSliLPD AVMSPLSHR SKRSLSCRPP MVKLVCPADN 780 

LRAEGliECTK TCQNYDIiECM SMGCVSGCIiC PPGMVRHENR CVAIiERCPCF HQ6KEYAPGB 840 

TVKIGCNTCV CRDRKWNCTD HVCDATCSTI GMAHYLTFDG LKYLFPGECQ YVLVQDYCGS 900 

NPGTFRILVG NKGCSHPSVK CKKRVTILVE GGEIELFDGE VNVKRPMKDB THFBWESGR 960 

YIILLLGKAIi SWWDRHLSI SWIiKQTYQE KVCGLCGNFD GIQNNDLTSS HZiQVEEDPVD 1020 

FGNSWKVSSQ CADTRKVPLD SSPATCHNNI MKQTMVDSSC RILTSDVFQD CNKIiVDPBPY 1080 

LDVCIYDTCS CESIGDCACF CDTIAAYAHV CAQHGKWTW RTATLCPQSC EBRNLRENGY 1140 

ECEWRYNSCA PACQVTCQHP BPIACPVQCV B6CHAHCPPG KIU)ELbQTC VDPEDCPVCB 1200 
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VAGRRFASGK KVTIiNPSDPE HCQICHCDW NLTCEACQEP GGLWPPTDA PVSPTTLYVE 1260 

DISEPPLHDP YCSRLLDLVF IiLDGSSRLSE AEFEVIiKAFV VDMMERLRIS QKWVRVAWE 132 0 

YHDGSHAYIG LKDRICRPSEL RRIASQVKYA GSQVASTSEV liKYTLFQIFS KIDRPEASRI 1380 

ALLLMASQEP QRMSRNFVRY VQGIjKKKKVI VIPVGIGPHA NLKQIRLIEK QAPENKAFVL 1440 

SSVDELEQQR DEIVSYLCDL APEAPPPTIiP PHMAQVTVGP GLLGVSTLGP KRNSMVLDVA 1500 

FVLEGSDKIG EADFNRSKEF MBEVIQRMDV GQDSIHVTVIi QYSYMVTVEY PFSEAQSKGD 1560 

ILQRVREIRY QGGNRTNTGIj ALRYLSDHSF LVSQGDREQA PNLVYMVTGN PASDEIKRLP 1620 

GDIQWPIGV GPNANVQELE RIGWPNAPIL IQDPETIiPRE APDLVLQRCC SGEGLQIPTL 1680 

SPAPDCSQPIi DVILLIiDGSS SFPASYFDBM KSPAKAFISK ANIGPRLTQV SVLQYGSITT 1740 

IDVPWNWPE KAHLLSLVDV MQREGGPSQI GDALGFAVRY liTSEMHGARP GASKAWILV 1800 

TDVSVDSVDA AADAARSNRV TVFPIGIGDR YDAAQIiRILA GPAGDSNWK LQRIEDLPTM 1860 

VTIiGNSFIiHK LCSGFVRICM DEDGNEKRPG DVWTLPDQCH TVTCQPDGQT LIiKSHRVNCD 1920 

RGI4RPSCPNS QSPVKVEETC GCRWTCPCVC TGSSTRHIVT PDGQNFKLTG SCSYVLFQNK 1980 

EQDIiBVIIiHN GACSP6ARQG CMKSIEVKHS ALSVELHSDM EVTVNGRLVS VPYVGGNMEV 2040 

NVYGAIMHEV RFNHLGHIFT PTPQNNEPQI* QIiSPKTFASK TY6LCGZCDE NGANDFMIiRD 2100 

GTVTTDWKTIi VQEWTVQRPG QTCQPIIjEEQ CLVPDSSHCQ VIiLIiPIiFAEC HKVI*APATPY 2160 

AICQQDSCHQ EQVCEVIASY AHIiCRTNGVC VDWRTPDFCA MSCPPSIiVYN HCEHGCPRHC 2220 

DGNVSSCGDH PSEGCFCPPD KVMUSGSCVP BEACTQCIGE D6VQHQPLEA WVPDHQPCQI 2280 

CTCLSGRKVN CTTQPCPTAK APTCGLCEVA KLRQNADQCC PEYECVCDPV SCDLPPVPHC 2340 

ERGLQPTLTN PGECRPNFTC ACRKEECKRV SPPSCPPHRL PTLRKTQCCD EYECACNCVN 2400 

STVSCPLGYL ASTATNDCGC TTTTCLPDKV CVHRSTIYPV GQFWEEGCDV CTCTDMEDAV 2460 

MGLRVAQCSQ KPCEDSCRSG FTYVLHEGEC CGRCLPSACE WTGSPRGDS QSSWKSVGSQ 252 0 

WASPENPCLI NECVRVKEEV FIQQRNVSCP QLEVPVCPSG FQIiSCKTSAC CPSCRCERME 2 580 

ACMIiNGTVIG PGKTVMIDVC TTCRCMVQVG VISGFKLiECR KTTCNPCPLG YKEENNTGEC 2640 

CGRCIiPTACT IQLRGGQIMT LKRDETLQDG CDTHFCKVNE RGEYFWEKRV TGCPPFDEHK 2700 

CLAEGGKIMK IPGTCCDTCE EPECNDITAR LQYVKVGSCK SEVEVDIHYC QGKCASKAMY 2760 
SIDINDVQDQ CSCCSPTRTE PMQVAIiHCTN GSWYHEVLN AMECKCSPRK CSK 



Seq ID NO; 150 DNA sequence 

Nucleic Acid Accession #: NM_001508.1 

Coding sequence: 1-1362 (vinderlined sec[uences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGG CTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA CAGTCATGTC 60 
CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA TTCTGGTGTA CCTGATCATC 120 
TTCX5TGATGG GCCTTCTGGG GAACAGCGTC ACCATTCGGG TCACCCAGGT GCTGCAGAAG 180 
AAAGGATACT TGCAGAAGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG CTCGGACATC 240 
TTGGTGTTCC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 300 
ACGTCCAGCT ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 360 
GCTACGCTGC TGCACGTGCT GACGCTCAGC TTTGAGCGCT ACATCGCCAT CTGTCACCCC 420 
TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 4 80 
GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 540 
GTGAACGTGC CCAGCCACCG GGGTCTCACT TGCAACCGCT CCAGCACCCG CCACCACGAG 600 
CAGCCCGAGA CCTCCAATAT GTCCATCT6T ACCAACCTCT CCAGCCGCTG GACCGTGTTC 660 
CAGTCCAGCA TCTTCGGCGC CTTC6TGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 720 
ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAA6CCAGA AGGGCTCGCT GGCGGGGGGC 780 
ACGCGGCCTC C6CAGCTGAG GAAGTCCGAG AGCGAAGA6A GCAG6ACCGC CAGGA6GCAG 840 
ACCATCATCT TCCTGAGGCT GATTGTTGT6 ACATT6GCCG TATGCTG6AT GCCCAACCAG 900 
ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCAC6ACT G6ACGAGGTC CTACTTCCX5G 960 

GC6TACAT6A TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCAGCTC GGTCATCAAC 1020 

CCGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTCGTGCA GGTGCTGTGC 1080 

TGCCGCCTGT CGCTGCAGCA CGCCAACCAC GA6AAGCGCC TGCGCGTACA TGCGCACTCC 1140 

ACCACCGACA GCGCCCGCTT TGTGCAGCGC CCGTTGCTCT TCGCGTCCCG GCGCCAGTCC 1200 

TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGCGAGGC CGAGCCCCAG 1260 

TCTAAGTCCC AGTCATTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 1320 
AATTCTGCTG CAGAGAATGG TTTTCAGGAG CATGAAGTTT_GA 

Seq ID No: 151 Protein sequence ; 
Protein Accession #: NP_001499.1 



1 11 21 31 41 51 

I I I I I i 

MA5PSLPGSD CSQIIDHSHV PEFEVATWIK ITLILVYIiII FVMGIiIjGNSV TIRVTQVLQK 60 

KGYIiQKEVTD HMVSIACSDX LVFIiIGMPME FYSIIWNPI.T TSSYTI*SCBCL HTPLFEACSY 120 

ATIiliHVIiTIiS FBRYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAMGTEYPI, 180 

VNVPSHRGLT CNRSSTRHKE QPETSNHSIC TNIiSSRWTVF QSSIFGAFW YLWIiLSVAF 240 

MCWNMMQVIjM KSQKGSIiAGG TRPPQLRKSE SEGSRTARRQ TIIFIiRZilW TIAVCWMPNQ 300 

IRRIMAAAKP KHDWTRSYFR AYMILIiPFSB TPFYLSSVIH PtLYTVSSQQ FRRVPVQV1.C 360 

CRLSLQHAMH EKRWVHAHS TTDSARFVQR PIiLFASRRQS SARRTEKIFI* STFQSEAEPQ 420 

SKSQSLSIiBS IiEPNSGAKPA NSAAENGFQE HEV 

Seq ID NO: 152 DNA sequence 



253 



wo 02/079492 



PCT/LIS02/04915 



Nucleic Acid Accession #: none found 

Coding sequence: 3-65 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

TTATTATTTT GTGTAAACTA TATTCTGCTT ATAGAGAGTC TCTGAGACTA AAATTGACAA 60 
CTTGAAAAGT ATTCCAAGGA ATATTATGAA AATAGGGCAA CATGGACTGT TTAAGATCTC 120 
CATGTAATTG AAATTCATGC AAGGAAACAA CTCATAGAAA AGATAAATAT GGATGCCCTT 180 
CACATGTTAT CAACCTC6TA ACTTTTGGTG CTTGCTGAAT CAGTCCATGA AAAGCTACAG 24 0 
CCCGCTCTTT GGGAATGCTA CATACCCATT TCTGGTATTT AAAAAATATC TAGGAGGAGC 3 00 
TAAATGACAA AACACAGCAG TGTTTTGA6G GAGAAAGGAC CATCATTTAT AATCCTCTGT 360 
ACATACTACC AGAGCT6CTT GGAAAATTAA AGGCCACTTG TGGCTTTTTC CTACCAACTG 4 20 
ATACGTTTAA ATTTGCCCTA GGATTSAGCT AACAGCAAAA AAAAAAAAAA AAAAAAAARA 480 
GAGAGAAAGA AAGGAGKAAA CAGTGGTAAT AAJVUU^TCC ATCT6TCTTC TTGCTATGTT 540 
AATATTAATA AATCATAATA TGACAAGACX: CTCACTGAAT AAGAGTATTT TCAGTCATCA 600 
GAAG CCAGC T GTTGGTAGGC ATTAATGAGT TTAAAATTGT TCTCAATTGA AAAAACATCA 660 
CACTATTTTG CCAAAACCAA AGTAATTATA ATACTGTGTC CTCCTGTAAT TTTTTGAGAA 720 
GTGGTTATAA AGGGCATATT TACATAAATT CTACTTTATT CCTCAACTTC TTTGATGAAT 780 
GTAACCCAAT TTTACTTCTT TAAAAAGTCT CAATTCAAGC TGGATTAGCC AGCTCAGCAT 840 
AATCAACTAG ACAGTGGTTT GTTAAATTTA GCAGCATACT TCGTTCCCAT TCTAATTAAA 900 
GTCATGAGTT CTTGAATCCC AGAGAAATAA TGCTTAGGAA CTTCTCTCAA TCTGCTTGGC 960 

TTGGCCTAGA GAAGTGGCCA TTTTATCAAC AGGRAAAAAA AAAATTTTCT CTACTACAAC 1020 

CCCGTTGCCT TCTGAAAAAC AGCAAGTTAT TTCTTTATAT AATTATCATT TTATTATTTT 1080 

ATGGAAAATT AATTTATTAA TTAATAGCCT ATTATGTGTT CTCACTTGCT TCTCTAAGTA 114 0 

ATATTTTGAG ATAAAATGTT GAATAAAACC ATGGATTATA GAGAAAAGTC AAAATATATG 1200 
TGTAATATTT AATTATTTTA TAAGTTTTAT AATAAAGTAT TCCATTTCTT TATCTT 

Seq ID No: 153 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

IIIiCKLYSAY RESLRLKLTT 

Seq ID NO: 154 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-36 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

I I I I I I 

CTGGATQATA TGGAAGAAAT GGATGGGTTA AGGTAAAAGG CTGATCACAG ATGGGTTCCT 60 

CT CAAG GTTA AAATAGTTTA AGTGCCAGAA GAAAAGGTGG GCACCAGCGA ATTAAGAACC 120 

ATCTTTGAAT GGTCCCCTTG GTTAAATACT TAACTTTTGT CATCAGTGTC TGCATTTATG 180 

AAATGAAGAG GAATTCACTA ATATGCTACG TGATCTTTTG TTTGTCATGA AAAGAGTTAC 240 

TGTTGTGTA6 TTCTCTGTTC CAGGGCTGCC TTTGCTCCAC AAAGCACTGA GAAGCAGTGG 300 

CCCTGTACAA CCATACTGCC TCTCAACACT GTGTAATAGG CTAACACCGC CCAGCGAACC 360 
TTCCTGGGAG ATATAAAATA CATAGGTTTA GGCTGGCAAA AAAAAAAAAA AAA 

Seq ID No: 155 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

LDDMEEMDGL R 



Seq ID NO: 156 DNA sequence 

Nucleic Acid Accession #: NM_032961.1 

Coding sequence: 827-3949 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

' ^ ' I 1 I I 

CAGGCTCAGA GGCTGAAGCA GGAGGAAGQA AGGACTGGAA GGAAAAAGAO ACAGGTTAGA 60 

GGGAAAGAGG CTTG<3GAA6A AAACAGCS^GA AAAGAAACTG CTCATTACa^C TTACAGAGAG 120 

GCAAGTAACG GTGGAGATGA 6GACAGAGGG AACCAAGACT CTGAAAGACA AAAAATACAA 180 

ATAGAGCGAA AGAGGAAAAA AATGTCAAGA AGAACATCCA TCCGGA<3AAA TCSAAGAGAAT 240 

GAAAGTTTTA AACTGCAGAG CCGTTCTGTG CTTTTCCGGC ACAAAATTAT ATCGCTGATT 300 

TTAAGCCCTT TTGCATTTGC CAGCCGTTGA CATTAAGAGG CATGTTTAAC GGTGCCAACA 360 

GCATCTCCTT TTCCTTCTCC TCTTCCTCTT CTTCTTCTTC CTCCTCCTCC TCCTCITTTT 420 
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CCTCCTCCTC 
GAAATTTCCT 
TGTATTATTG 
CCTGTCACCC 
AAAATGAAGC 
CGCACTTTTA 
TGGGTGGCTG 
ATTGTTTGCC 
GGAGGAGCAG 
TACAAAACTT 
CCTCAACCTG 
CAAACAGA6C 
GTTCCAGGTG 
AGACCTGACG 
CGCATTCGAC 
CTACTTCTCC 
GGAGAAGCCA 
CGGAGGAGGT 
CCCCCAGCAG 
CAATGTGCCC 
AGGCACTCTC 
CGTGTACTCC 
GCGCACTGGC 
AGTGTACGTG 
AGTGCGAGTA 
AGCGGTGAGT 
CGACTCAGAG 
CAAGTCTTCC 
GGGGGACTCC 
CAGTAAGTCG 
GCCGGTCTAC 
GAGCGCCACC 
CCAGATCCAG 
GTACGCCCTG 
CCGGGACGCT 
GGATCATU^T 
AGCGCGTGAG 
CGTGGACGCG 
AATGAACCTC 
GGCCAAGCGC 
GCCGCCCCTT 
CCAGGGCGGG 
TGGCGGCGGG 
GTCCTTCATC 
GCTCAACATC 
CGGAGGTTCG 
AGACATCATG 
GGAGTCCGGG 
GACCCCTGAG 
TACGGACACT 
TGATATCATC 
CAGCTATCTA 
AGTAAGCTCT 
CACCAACCGT 
AGCTCTGGGC 
GGCTGCTGAT 
GGTGTTTGAA 
AGAGAAGGCC 
GCGAGCGCCT 
GGACTTACCT 
ACTTCATTAT 
GTCATCATGG 
TGTGCAGAAC 
TGTTTACAGC 
ATTCACCACG 
CACTGTTTTA 
TTCCAACTTT 
TTGTATTCTG 
AAGT6TTTAC 
TATAATTTTC 
CATCATACAA 
TCATTAATAG 
AGAAAATGAT 
AAAGATGTAG 
ACAATCTGCA 
TTATAAAGAA 



GTTCTCCTCC 
CTTTGGGATT 
TTATTTTATT 
TTCCTGTGCT 
AAAAGGAGTA 
TTTGTATTTT 
ACTGGCTGCG 
TTGCTCTGGA 
GAACATGGCA 
TCGGCTCGCG 
GAGACAGGGG 
CCCTCCTGTG 
6AGATC6A6G 
GTGGAAATCT 
CCAGACGTGG 
CTGGACGTGC 
CTGGACCGAG 
GGGGGAGGAG 
CAGCGCACCG 
GCTTTCGACC 
GTGATCCAGC 
TTCAGCAGCC 
AGACTGGAGG 
CAAGCCAAGG 
CTGGATGCTA 
GAGGGCGCGG 
GAGAATGGGC 
TTTAAGAATT 
TACACCCTGA 
ATCCAGGTAC 
GACGTGTATG 
6ACC6GGAT6 
GGCATGAGCG 
CGCTCCTTCG 
GGCAGCCCCC 
GACAACGCCC 
GTGCTGCCCC 
GACGACGGCG 
TTTCGCATGG 
GACCCCCAGC 
TCCTCCACCG 
GGCGGGAGCG 
GAAACCTCGC 
TTCCTGCTGG 
TATACTTGTC 
ACCTGCTGTG 
CTGGTGCAGA 
GGCTTTGGCT 
TCCGCCAAGA 
GAGCACAACC 
TCCAACGGAA 
GTT6ACAGAC 
AAGGACAGTG 
GCCCAGTCAG 
CACTCAGATC 
TATCGCAGCA 
ACTCCAGAAG 
CTTCACAGCA 
TACAAACCAC 
GAAGCAGCAT 
CTTGGCCATC 
CCAATTATAG 
TGTAGAAACT 
ACTATCTATC 
AGAAGCCA6T 
AACTTGACTG 
ACAAGAGAAA 
AA6ACCCACA 
TGTACTATTT 
CTAAAAT6TG 
TAAAATAAAA 



GCTCTAAGCT 
CTATTGATGT 
TAAGTCTGAT 
TCGATAAATT 



CATCAGCAAG 
TGCCAGCGCC 
AATTAGTCAG 
AAGATTTAAA 
AGATTTTTAA 
TTCAGATTTT 
GGAAGCTACT 
TGGTGGAAGG 
CTTTCGTGGG 
GGTTTCAGAC 
TGCTGTACGT 
TCCT6CACCT 
TGCTGGACAT 
CTGAGA6CGC 
GCACCAACTC 
AGACCCAGGG 
AGCAGCAAGC 
TAGGAGAAGG 
GCACGGCCCT 
AACCCGTCTA 
TCAACGCCAC 
ACATTTCGCC 
TAAGCGGCGA 
ACCTGGGCCC 
ATGACAACGC 
CGCCCGGCAC 
AGGTGCAGTG 
ACTACACCAT 
CTGTAGTGGC 
AAGTGTCGGA 
TGACTGAAAA 
AGGGCGCCAA 
TCTTCACCTA 
ACTATGAGCA 
AGGCGCTGGC 
CTGCCATCGT 
GCTCGGCGGA 
AGAACGCCCG 
ACTGGCGCAC 
GGCCTTATGA 
CCACCCTGGT 
GAGGCGGAGG 
TAGACCTCAC 
CCATGATCGT 
TGGCCAGCGA 
GCCGCCAAGC 
GCTCCAATGT 
CCCACCACCA 
CCGACCTGAT 
CCTGC6GGGC 
GCATTTTGTC 
CTCGCCGAGT 
GTCATGGAGA 
CTGGTATGGA 
GGTGCTGGAT 
ATCTGCATGT 
CCCAGCCTGG 
CTCTGGAGAG 
CATATTTGAC 
GATTTGCACA 
CAGTTAGTCA 
GACCTAATTG 
TTAGAGGCAA 
TTTCTCTCTC 
CATAAAGATA 
TTTTATATTA 
TTGTGATTAT 
AAATATCAAA 
CAAAGCTTCT 
GTACAACTCA 
GGTAATTCAG 
ATTTCCATAT 
ACAAAATTTT 
TATCAGACAG 
TCTATTTCTA 
CACCTGTATT 



AAGACAAACC 
AAGACTGTCG 
TGGAAAGATT 
AAAAAATGAG 
AGACAGAAAG 
TTTTTGTTTC 
TCCTTTCCTT 
AGTCTTTTCC 
GAATATCGCT 
GGTGCCCAAC 
G/^CXSAGAAA 
GGAGGTCTTT 
TAATGACAAC 
CACGCCAGGC 
CTTGCGCGAC 
GGAT6GCAAC 
GGTGCACCGC 
AGGGGGAGGT 
ACTCACCATC 
CACTGTGTCC 
CGACCCGGAC 
CCGGGCGCGG 
GTTGGACTAT 
CAACGCCGTG 
GCCAGAGATC 
TGTGGTGGCC 
CGAGCTACTG 
CGTTACCGAA 
TCGGGACCGG 
TGTGAACGAC 
CAAC6TGCCT 
CGCCCAGCTT 
CGTTTCTATC 
GCTGAAG6AC 
TGGTAACGCC 
GGCGCCTCTA 
GCCGGGTTAC 
GCTCACTTAC 
CGGGGAGCTG 
GCTGGTGATC 
GGTTCAGCTG 
GTCAGGAGAG 
CCTCATCCTC 
GCTGGCCGTG 
TTGCTGCCTC 
CCGGGCGCGG 
ACCCAGTAAC 
CAACCAGAAT 
GTTTCTTAAG 
CATCGTCACC 
CAACGAGACT 
TAACAGTTCT 
CAGTGAACAG 
TCTCTTCTCC 
GCCTTCTTTT 
TCCTGGCATG 
GGCAGAGCGG 
GAAGGAGCTG 
ACGGAAAAGG 
AAGTCGACCA 
TGTGTAACTG 
CTCTCAGCAG 
CAGATTTTGC 
CAAATGTCAC 
AAGGAAATTT 
TTTTTGTGTG 
GTTCTTTTCA 
GACATTCTGT 
AAAT AAATAT 
GTTGGTTTTT 
GGTCCCAAAG 
CTTACTCAAC 
GTCAAAAACT 
AGCACTGACT 
T6ACTTTGAA 
TGTTGTTAGA 



GAGGACAGTC 
GAATAAAGGA 
ACAGATGAGG 
GCTGGATTGC 
CCACAGGAGC 
GTGGTGGTGG 
TTGGAGATGA 
CAGCTTCACT 
GAAGATCTGG 
TCAAGGACCC 
ATAGACC6C6 
CTGGAGAACC 
CCCCCCTCTT 
ACTCGCTTCC 
TACGAGATCA 
CGATTCGCTG 
TACGTGCTGA 
GGCGGGGGAG 
CGAGTGCTGG 
CTACCAGAGA 
GAGGGCCAGA 
GAGCTTTTCG 
GAAGAGAGCC 
CCTGCGCACT 
AGCTTCAGCA 
CTTTTCAGCG 
GGAGACGTGC 
GCCCCCCTGG 
GGCGAGCCTG 
AACGC6CCGC 
GGCGCCTACA 
GCCTACTCTA 
AACTCTGAGA 
TTCAGTTTTC 
ACTGTCAACA 
CCAGGGCGCA 
CTGCTCACCC 
AGCATCGTGC 
CGCACAGCAC 
GAGGTGCGCG 
GTGGATGGCG 
CACCAGCGCC 
ATCATCGCGT 
CGTTGCCAAA 
TGCTGCTGCT 
AAGAA6AAAC 
CCGGCCCAGG 
TACTGCTATC 
CCCTGCAGCC 
GGTTACACC6 
AAACACCAGC 
GCATTCCAGG 
GGAGATAGTG 
AATTGCACTG 
GTCCCTTCTG 
GACTCTGTTC 
TCCTTTTCCA 
GATGGACTGC 
ATATGCTAGT 
ACAAAAGCAT 
AGTATTAGAT 
GCCTGAGAAA 
CTCCCCGATC 
TGAGCCCTTT 
GTGCATTATA 
ATCAAGTGTT 
CCTGTGGGTT 
AGTTTATACA 
AAAATATATA 
AAATGGATGC 
ACAAACTTAC 
CGTGTTTTTC 
CATATTGAAT 
ATGTACTATC 
TTTAGAATCA 
AAAAAACTGG 



TTGAAATATC 
CGCTGACTAT 
AAAGGGGACG 
GGGAAGCTCT 
CCCCACGTAG 
GGGAGGTGAT 
TTGTGCTATT 
ACACGGTACA 
GTCTGGACAT 
CTTACTTAGA 
AACAAATCTG 
CCCTGGAGCT 
TCCCGGAGCC 
CCTTGGAGAG 
CCCCCAACAG 
AGCTGGTGCT 
CCGCGGTG6A 
CAGGCCTGCC 
ACTCCAATGA 
ACTCTCCCCC 
ACGGTGAGGT 
GACTCTCGCC 
CAGTGTACCA 
GCJ^GGTGCT 
CCGTGAAGGA 
TGACTGACCG 
CTTTCCGCCT 
ACCGAGAGGC 
CGCTCTCCAC 
GTTTCAGCCA 
TCTACGCGGT 
TCCTCGAGTG 
ACGGCTACTT 
AGGTGGAAGC 
TCCTCATAGT 
ACGGGACTCC 
GCGTGGCCGC 
GTGGCAACGA 
GCCGAGTCCC 
ACCATGGGCA 
CCGTGGAGCC 
CCAGTCGCTC 
TGGGCTCGGT 
AAGAGAAGAA 
GCTGCGGTGG 
TCAGCAAGTC 
TGCCGATAGA 
AGGTATGCCT 
CTTCGCGGAG 
ACCAGCAGCC 
GAGCAGAGCT 
AAGCCGACAT 
ATCATGATGC 
AGGAATGTAA 
ATGGACGCCA 
CAGACACTGA 
CCTTTGGCAA 
TGACTAATAC 
CAATTCTACA 
CAACTTTTCA 
TTCGGATGGA 
TGAGTTGAAA 
AGTGT6TGCC 
AGATGTTTAT 
AATGCAATAT 
CC6CAAGCTA 
ATAAAAAATG 
CCGTGTTGCA 
TATTATATTA 
ATACAGTCCA 
TAAGAT^AAAA 
CTTGTTTAAA 
TTTCAATGCC 
AAACTATCTA 
CTTAAAGCTT 
GTGTCTGTAC 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 



255 
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ATTTTGTGGT 
ACTCACACAG 
TATAATTCTA 
AAAATATGAA 
TTCAGTATTC 
AATTCTTCAT 
TTAATTATTA 



GTAAAATATG 
AATTTTATTT 
TATGAATATA 
AGCTCTAAAT 
CATGTAAAAA 
GAGTCAGCCT 
AATTTAGTAA 



TAATTGAAGA 
TACATAGTTT 
TAGAGATATA 
TTAAAATAAA 
TTTTATAGCT 
TCAAAAGTTA 
GACGCAAAAA 



TTACTATTTT 
TGTGACTTAA 
GAAACATCTG 
TTTAGAGATA 
TAAATGTAGT 
AGCTTGCCTT 
AAAAAAAAAA 



AAGAAGTCAT 
TTACACATGA 
AACTGGTAAA 
GAATCATGGT 
CAGTGTTTGA 
TTACTTTTAT 
AAAA 



CAGTCATATC 504 0 

ATATAAAATC 5100 

G AAT AACTAT 5160 

ACATT ATTGT 5220 

TTAATGAAAA 5280 

6TCAACAATA 5340 



10 



Seq ID No: 157 Protein sequence : 
Protein Accession #: Np 116586.1 



15 



20 



25 



30 



1 

I 

MIVIiLLFALIi 
TPYLDLNLET 
SFPEPDLTVE 
AELVLEKPLD 
IiDSNDNVPAF 
FGIiSPRTGRL 
STVKEAVSEG 
LDREAGDSYT 
YIYAVSATDR 
PQVEARDAGS 
TRVAAVDADD 
RDHGQPPLSS 
ALGSVSFIFL 
KliSKSDIMLV 
SPSRSTDTEH 
QEADIVSSKD 
SDGRQAADYR 
LLTNTRAPYK 



11 
I 

WMVEGVFSQL 
GVLYVNEKID 
ISESATP6TR 
REQQAVHRYV 
DQPVYTVSLP 
EVSGEIiDYEE 
AAPGTWALF 
LTWARDRGE 
DEGANAQLAY 
PQAIiAGNATV 
GENARLTYSI 
TATbWQIiVD 
IiAMXVIiAVRC 
QSSNVPSNPA 
NPGGAIVTGY 
S6HGDSEQGD 
SNLHVPGMDS 
PPYLTRKRIC 



21 
I 

HYTVQBEQEH 
REQICKQSPS 
PPLESAFDPD 
LTAVDGGGGG 
ENSPPGTLVI 
SPVYQVYVQA 
SVTDRDSEEN 
PALSTSKSIQ 
SILECQIQGM 
NILIVDQNDN 
TOGNEMNLFR 
GAVEPQGGGG 
QKEKKLNIYT 
QVPIEESGGF 
TDQQPDIISN 
SDHDATNRAQ 
VPDTEVFBTP 



31 
I 

GTFVGNIAED 
CVIiHIiEVFIiB 
VGTNSLRDYB 
GVGEGGGGGG 
QLNATDPDEG 
KDLGPNAVPA 
GQVQCELLGD 
VQVSDVNDNA 
SVFTYVSINS 
APAIVAPLPG 
MDWRTGELRT 
SGGGGSGEHQ 
CLASDCCIiCC 
GSHHHNQNYC 
GSZLSNETKH 
SAGMDLFSNC 
EAQPGABRSF 



41 
I 

LGLDITKLSA 
NPIiSLFQVEI 
ITPNSYPSLD 
GAGLPPQQQR 
QNGEWYSFS 
HCKVLVRVLD 
VPFRLKSSFK 
PRFSQPVYDV 
ENGYLYAIiRS 
RNGTPAREVL 
ARRVPAKRDP 
RPSRSGGGET 
CCCGGGGSTC 
YQVCLTPESA 
QRAEIiSYIiVD 
TEECKALGHS 
STFGKEKAIiH 



51 
I 

RGFQTVPNSR 
EVIJ^INDKPP 
VQTQGDGNRF 
TGTALLTIRV 
SHZSPRAREL 
ANDNAPEISF 
NYYTIVTEAP 
YVTENNVPGA 
FDYEQLKDFS 
PRSAEPGYLIi 
QRPYELVXEV 
SLDLTLILII 
CGRQTUIARKK 
KTDLMFLKPC 
RPRRVNSSAF 
DRCWMPSFVP 
STLERKELDG 



60 

120 
180 
24 0 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 158 DNA sequence 
J J Nucleic Acid Accession #: NM_02ai59.X 

Coding sequence: 70-1890 (underlined sequences correspond to start and stop codons) 



40 



45 



50 



55 



60 



65 



70 



75 



I 

GTGAAATTTA 

TATTATTGTA 

AATGATGGAA 

ATAGCTGCAA 

TTGCTACAAG 

TATATAGAAA 

GCCAAGGACA 

GTTCAAAGGG 

CTTACAAAAC 

AAGACCACAG 

TCATATAACA 

TTTCCAAAGA 

TATAAGAGTA 

TATGATAATT 

TCAAACCCAC 

GTCACAGATA 

GGCAGCTGGT 

CGCTGTAATC 

AAAGATTATA 

CTTGCCATAT 

ATTCACAAAA 

AATACAAATA 

TTAGCTGCTT 

GTCATCTACA 

6CCGTG6TAG 

TGTTGGCTTA 

ATTCTTGTTA 

GGGTTGAAAC 

GCTCTTCTGT 

TCAGTGGTTA 

TTATTCCTGT 

GTCCCCTGTT 

ACAAAAATAA 

CCAATTATTA 

AACTGTAGAT 

AATAGTTCT6 



11 
I 

AACTCCAGTC 
^TGTGTGTACC 
CCGTCTGTAT 
ATATTAATAA 
AAGTCTATAG 
TATTAGCTGA 
CCCTTTCTAA 
ATACATTTGT 
TCATGCACAC 
AGTTTGATAC 
TGAAACATAT 
GAAAAGCTGC 
TTGGTCCTTT 
CTGAAGAGGA 
CCACATTATA 
GGTATAGGAG 
CTTCAGAGGG 
ACCTGACACA 
ATATTCTTAC 
GCATTTTTAC 
ATCTTTGCTG 
CTAATAAGCT 
TTGCATGGAT 
ACAAGG6ATT 
TTGGATTTTC 
GCACCGAAAA 
ATCTCTTGGC 
CAGAAGTTAG 
TCCTTCTCGG 
CAGCTTACCT 
GTGTTTTATC 
GTTTTGGATG 
AAATTCCAAG 
ACTACTAGAC 
AATAAGGTAA 
TCAAAAATAG 



21 

I 

CTGTGGCGAA 

TGGCTTCAGA 

AGAAAATGTG 

AACTTTAACA 

AAATTCTGTG 

ATCATCTTCA 

CTCAACTCTT 

AGTTTGGGAC 

TGTTGAACAA 

AAATTCAACG 

TCATCCTCAT 

ATATGATTCA 

GCTTTCATCA 

GGAAAGAGTC 

TGAACTTGAA 

TCTATGTGCA 

CTGTGAGCTG 

TTTTGCAATT 

AAGGATCACT 

CTTCTGGTTC 

TAGCCTATTT 

CTTCTGTTCA 

GTGCATTGAA 

TTTGCACAAG 

GGCAGCACTA 

CAACTTTATT 

TTTTGGAGTC 

TTGCTTTGAG 

CACCACCTGG 

CTTCACAGTC 

TAGAAAGATT 

TTTAAG GTAA 

CTGTGGATGA 

AAAAAGTATT 

AATTATGTAT 

TATTGC3VGAT 



31 
I 

AATGCTAATT 

TCCAGCAGTA 

AATGCAAACT 

AAAATCAGAT 

ACAGATCTTT 

TTACTAGGTT 

ACTGAATTTG 

AAGTTATCTG 

GCTACTTTAA 

GATATAGCTC 

ATGAATATGG 

AATGGCAATG 

TCTGACAACT 

ATATCTTCAG 

AAAATAACAT 

TTTTGGAATT 

ACATACTCAA 

TTGATGTCCT 

CAACTAGGAA 

TTCAGTGAAA 

CTTGCTGAAC 

ATCATTGCCG 

GGCATACATC 

AATTTTTATA 

GGATACAGAT 

TGGAGTTTTA 

ATCATATACA 

AACATAAGGT 

ATCTTT66GG 

AGCAATGCTT 

CAAGAAGAAT 

ACATAGAGAA 

CCAATGTATA 

TTAAATCAGT 

CATATAGATA 

ATTTGGAAA6 



41 

I 

GCACTAACAC 

ACCAAGACAG 

GCCATTTAGA 

CCATAAAAGA 

CACCAACAGA 

ACAAGAACAA 

TAAAAACCGT 

TGAATCATAG 

GGATATCCCA 

TCAAAGTTTT 

ATGGA6ACTA 

TTGCAGTTGC 

TCTTATTGAA 

TAATTTCAGT 

TTACATTAAG 

ACTCACCTGA 

ATGAGACCCA 

CTGGTCCTTC 

TAATTATTTC 

TTCAAAGCAC 

TTGTTTTTCT 

GACTGCTACA 

TCTATCTCAT 

TCTTTGGCTA 

ATTATGGCAC 

TAGGACCAGC 

AAGTTTTTCG 

CTTGIXSCAAG 

TTCTCCAT6T 

TCCAG6GGAT 

ATTACiAGATT 

T6GTGGATAA 

AAAATGACTC 



TACTATGTTT 
TAATTGGTTT 



51 
I 

AGAAGGAAGT 

GTTTATCACT * 

TAATGTCTGT 

ACCTGTGGCT 

TATAATTACA 

CACTATCTCA 

GAATAATTTT 

GAGAACACAT 

GAGCTTCCAA 

CTTTTTTGAT 

CATAAATATA 

ATTTTTATAT 

ACCTCAAAAT 

CTCAATGAGC 

TCATCXSAAAG 

TACCATGAAT 

CACCTCATGC 

CATTGGTATT 

ACTGATTTGT 

CAGGACAACA 

TGTTGGGATC 

CTACTTCTTT 

TGTTGTGGGT 

TCTAAGCCCA 

AACCAAAGTA 

ATGCCTAATC 

TCACACTGCA 

AGGAGCCCTC 

TGTGCACGCA 

GTTCATTTTT 

GTTCAAAAAT 

TTACAACTGC 

ATCAAATTAT 

ATGCTATAGG 

TTCTATGTGA 

CTCAGGAGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2X60 



256 
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ATATCACTGC ACCCAAGGAA AGATTTTCTT TCTAACACGA GAAGTATATG AATGTCCTGA 2220 

AGGAAACCAC TGGCTTGATA TTTCTGTGAC TCGTGTTGCC TTTGAAACTA GTCCCCTACC 22 80 

ACCTCGGTAA TGAGCTCCAT TACAGTU^GT GGAACATAAG AGAATGAAGG GGCAGAATAT 2340 

CAAACAGTGA AAAGGGAATG ATAAGATGTA TTTTGAATGA ACTGTTTTTT CTGTAGACTA 2400 

GCTGAGAAAT TGTTGACATA AAATAAAGAA TTGAAGAAAC ACATTTTACC ATTTTGTGAA 2460 

TTGTTCTGAA CTTAAATGTC CACTAAAACA ACTTAGACTT CTGTTTGCTA AATCTGTTTC 2520 
TTTTTCTAAT ATTCTAAAA 

Seq ID No: 159 Protein sequence: 
Protein Accession #: NP__071442.1 



1 11 21 

I I I 

MCVPGFRSSS NQDRFITND6 TVCIENVNAN 
EVYRNSVTDI* SPTDIITYIE IIjAESSSIjIXS 
DTFWWDKLS VNHRRTHIiTK LMHTVEQATL 
MKHIHPHMNM DGDYINIFPK RKAAYDSNGN 
SEEEERVISS VISVSMSSNP PTLYELEKIT 
SSBGCEIiTYS NETHTSCRCN HLTHFAILMS 
CIPTFWFFSE IQSTRTTIHK NLCCSIjFIiAE 
FAWMCIEGIH LYLIWGVIY NKGFLHKNFY 
STENNFIWSF IGPACLIILV NLLAFGVIIY 
FIiLGTTWIFG VLHWHASW TAYLFTVSNA 
CFGCLR 



31 41 51 

I I I 

CHLDNVCIAA HINKTLTKIR SIKEFVAIiIiQ 60 

YKNNTISAKD TLSNSTLTEP VKTVNNFVQR 120 

RISQSFQKTT BFDTNSTDIA LKVFFFDSYN 180 

VAVAPLYYKS IGPLLSSSDN FLLKPQNYDN 240 

FTLSHRKVTD RYRSLCAFMN YSPDTMNGSW 300 

SGPSIGIKDY NILTRITQLG IIISLICLAI 360 

LVFLVGINTN TNKIiFCSIIA GLLHYPPLAA 420 

IFGYIiSPAW VGFSAAIiGYR YYGTTKVCWL 4 80 

KVFRHTAGLK PEVSCFENIR SCARGALALL 540 

FQGMFIFLFL CVLSRKIQEE YYRLFKNVPC 600 



Seq ID NO: 160 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-216 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

TGT CTGCTTA TGCGGTGGCT CGCTGCTCAG AACAGGATGG CAGAGATGAG CACCACCATC 60 

AAAAACTCAA GGACCAGTGC T6TGGGTCCA GTCATCTGTT TCATGGAATT CACCAGTCTG 120 

GTATCTTCAA AATCCAGAAG GATGATGGCA GATGGCAGGA AGGAGGAAGA GGGTAATCTG 180 

GAAGAGTTTC CTGACCTACT CTGCTGCTGT GA TTAAA CAA CCACCAGGAA ATTTTGATGA 240 

CACTGTTCTC CTGAGCTCCT CCCTTTCCTC GGGGAAGAAA AGCATTGAAA CTACAAAAAT 300 

AAAGTGTTAT TTGGCTGGAG TGAGGTCTCA TGTCTGCTTA TGCGGTGGCT CGCTGCTCAG 360 

AACAGGGAAC CATTGGAGAT ACTCATTACT CTTTGAAGGC TTACAGTGGA ATGAATTCAA 420 

ATACGACTTA TTTGAGGAAT TGAAGTTGAC TTTATGGAGC TGATAAGAAT CTTCTTGGAG 4 80 

AAAAAAAGAC TGGTACTTCT GAATTAACCA AAATCACAGT ATTCTGAAGA TGATTCTACA 54 0 

AAGCCTGCTG TTTCTACAAA GGCTGCTGAT GATTTCTACA AAGCCTGCTG TAGTGTTGCT 600 

GTGGCCTCTG CTTAAAAAAG TAGAAAACAC ATTGATGCAG CATGTTCACC CCAACCTCCC 660 

TGCCTAAAGG CTCAGGGACC ATCTTGGAAG AGGAAGGCGC GTGAGATTGT AAGAGCCGAA 720 

TTAGGGGGAT GGAGTGTGGA GAATAAGGAC ACTTCATCTT GGATGCTCAC CTGCCAAATT 780 

GACTTCTGAT GAAAGCCAGC TCCAGAAATG TGCCTACAGT TACTACTTTC ACCTAAACCC 840 

TGCCCTTAGT CAAATCCTTC TCTTCTTCTA AGCAATCAAC TTCAATTCCT TGTATAACCC 900 

ACAGTATAAA AGGGCTTTTA TACCATTCTA TCCTATTGCA TGTAAGCCTT GGGTCTGGGA 960 

GGTAACAGTG TGGGATTCCA CCATCTCATC TCCCTGCCAC CCAAACATGC CTGCTCTTCT 1020 
TTAAGCAATA TTAAATGTTT GTACTTCA 

Seq ID No: 161 Protein s equence; 
Protein Accession #: none found 



1 11 21 31 41 51 

I I t I I I 

CliLMRWLAAQ NRMAEMSTTI KNSRTSAVGP VICPMEFTSL VSSKSRRMMA DGRKEEEGNL 60 
EEFPDLLCCC D 

Seq ID NO: 162 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-159 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAGA CCCTCC AGAGGCAOiGG CCCAGGATTG AAGAGGGAAG CCCTGCTCCA CACGTG TTC A 60 

TCAGGAAGGA CCCACAGACT GCTGCTCCTG GAGGCCTCTC GGTTTATGGA TGTGTGTTTG 120 

TTCCATAAAC CCTCAGAGGG TCACCTGGAG ACCCG CTAAA ATGCAGGTTC TTGGGCCACA 180 

TCCTAGACCT TCTGACCGAC CCAGGGAGTG GGGCCCAGGA AGCTGCATTT GACAGATATC 240 

CCCGTGTGAT CATCATGCAC ACAGGAGTGA GAGAACCAGT GTTCTCCCCG GGCAGAAGGG 300 

AAGCTCGTGT GCAGGACACC TCACACCTCC TTTCCCATTC CCCTGCCAGG CTCTCCCTGC 360 

TGACATTGTT TTTGCGGGAG AGCTGTGAAT TCTGAAGATT AGGTTGCTTC TCACCCCAAG 420 



257 
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PCT/US02/049I5 



CTCCAGAAGT CCAGGCTGAG CCAAACCAAG CTTCAAGTTG TGCCTGGACT TGGAGAACCA 480 

GGAGGTGAGG GGACTGACTA CTTGAAGATC ACATGGAGGA GGAGTCTGAT CCAGGCCCAG 540 

GCACCAAGGA AAGGCCATGC AAGGACACAG GGAGAAGGGC AGCTGTCTGT AAGCCAGAAA 600 

GAGCCTTCAC TAGAAACCAA ATCAGCCAGA ACCTTCATCT TGGACTTTCC AGCCTTCAGA 660 
GATGTGAAAA AATAAATTTC TGTTGATTAA CCTAAAAAA 



10 



Seq ID No: 163 Protein aeouenee! 
Protein Accession #: none found 



51 



15 



20 



25 



30 



35 



40 
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1 11 2X 31 41 

I I I I I , 

ETLQRQGPGIi KREAIiLHTCS SGRTHRLLLL EASRFMDVCI. FHKPSEGHLE TR 



Seq ID NO; 164 DNA aeouence 

Nucleic Acid Accession #: NM_020241.1 

Coding sequence: 4-1557 (underlined sequences correspond to start and stop codons) 



GC CATG CAGA 
CTGGGGGG.CG 
GACTACCTGA 
GAAGGTGCTG 
GGGGACAGGG 
TACCAGAGGA 
GGCAAACAGG 
ACGCTCTTTG 
ACCCTGCAGC 
CACGCCAATG 
CTAGCCATTG 
AAACATGACT 
CATGTCTACT 
GTGTCCCX3CG 
AAGCAGTGGA 
TTCTACTTCA 
GTCCTGGCCG 
GACCTGACAC 
TCCATCTGGA 
GCCCCCGGGA 
AAGACCCACC 
CGGACCCTGA 
GGCAACCAGA 
CGGCCCAATG 
AGGGTGTGTG 
CGACGCTGGG 
CCCCCCACTC 
AGGGCCTGCC 
CGGCXSAAGGT 
GCCACCCGTC 
G 



11 
I 

CCCC6CGAGC 

CCCACGGCCT 

ACCACTATCC 

ACGACCTCAA 

ACAACCTCTA 

AGCTGACCTG 

AGGGCGAGTG 

TGTGCGGTTC 

CCGTCGGAGA 

TTGCCCTCTT 

AT6CTGTCAT 

CCAAGTG6TT 

TCTTCTTCCG 

TGGCCCGAGT 

CGTCCTTCCT 

ACGTGCTGCA 

TTTTTTCCAC 

AGGTGGCAGC 

CX3CCGGTGCC 

TGCAGTACAA 

CTCTGATGGA 

TGAGGCACCA 

CCGTTGTCTT 

CCAGCACCTC 

TCCACGAGCG 

GCTTCCAGAA 

TGCAGAGGGA 

06GAAGTCAC 

GGGTG6GGCC 

CCCTTGTGAC 



21 
I 

GTCCCCTCCC 

CTTTCCTGAG 

CGTGTTTGTG 

CATCCAGCGA 

CCGCGTAGAG 

GAGATCTAAC 

TCGAAACTTC 

CAACGCCTTC 

CAACATCAGC 

CTCTGACGGG 

CTACCGCAGC 

CAAAGAGCCT 

GGAGATTGCG 

6TGCAAGAAC 

GAAGGCGCGG 

GGCTGTCACG 

GCCCAGCAAC 

TGTGTTTGAA 

GGAGGATCAG 

TGCCTCCAGC 

CGAAGCGGTG 

GCTGACTCGA 

CCTGGGTTCT 

AGGGACGTCT 

ACGATCGTGG 

GGCCCGGGGG 

AGCGGGGACA 

ATCGGCAGCA 

CCTCTGTAAA 

CTCCCCCCTC 



31 
I 

CGCCCGGCCC 

GAGCCGCCGC 

GGCAGCGGGC 

GTCCTGCGGG 

TTGGAGCCCC 

CCCAGCGACA 

GTAAAGGTGC 

AACCCGGTGT 

GGTATGGCCC 

ATGCTCTTCA 

CTCG6GGACA 

TACTTTGTCC 

ATGGA6TTTA 

6ACX3TGGGAG 

CTCAACTGCT 

GGCGTG6TCA 

AGCATCCCTG 

GGCCGCTTCC 

GTGCCTCGAC 

GCCTTGCCGG 

CCCTCGCTGG 

GTGGCTGTGG 

GAGGCGGGGA 

GGGCGTGTGT 

TGGCCCCAGC 

CCTCCGAGGT 

ATGCCGGGGT 

GCTGTCTAAA 

TACGGCCCCA 

TGACCTCCAG 



41 
I 

TCCTGCTTCT 

CGCTTAGCGT 

CCGGACGCCT 

TCAACAGGAC 

CCACGTCCAC 

TAAACGTGTG 

TGCTCCTTCX3 

GCGCCAACTA 

GCTGCCCGTA 

CAGCTACTGT 

GGCCCACCCT 

ATGCGGTGGA 

ACTACCTGGA 

GCTCCCCCCX3 

CTGTACCCGG 

GCCTCGGGGG 

GCTCGGCTGT 

GAGAGCAGAA 

CCCGGCCCGG 

ATGACATCCT 

GCCATGCGCC 

ACGTGGGAGC 

CGGTCCTCAA 

GTCAAGTGGG 

GGCCTGGGCG 

GCCGGTTAGG 

TTCAGGCAGG 

GGGCTTGGGG 

GGGTGGTGAG 

CTGACCATGC 



51 
I 

GCTGCT6CTA 

GGCCCCCAGG 

GACCCCCGCA 

GCTGTTCATT 

GGAGCTGCGG 

TCGGATGAAG 

GGACGAGTCC 

CAGCATAGAC 

CGACCCCAAG 

TACCGACTTC 

GCGCACCX3TG 

GTGGGGCAGC 

GAAGGTGGTG 

CGTGCTGGAG 

AGACTCCCAT 

CC6GCCC6TG 

CTGCGCCTTT 

GTCCCCCGAG 

GTGCTGCGCA 

CAACTTTGTC 

CTGGATCCTG 

CGGCCCCTGG 

GTTCCTCGTC 

CCACGCGTGC 

TTGGCTGAGC 

AGTTTG^CC 

AGACACGAGG 

GCCTGGGGGG 

AGAGTCCCAT 

ATGCCACGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



55 



Seq ID No: 165 Protein sequence: 
Protein Accession #: NP 064626.1 



60 



65 



70 



75 



MQTPRASPPR 
GADDIiNIQRV 
KQEGECRNFV 
ANVAIiFSDGM 
VYFPFREIAM 
YFNVLQAVTG 
IWTPVPEDQV 
TIiMRHQLTRV 
VCVHERRSWW 



11 

1 

PALLiIiliLLIiIj 
LRVNRTLFIG 
KVLLLRDEST 
LFTATVTDFL 
EFNYLEKVW 
WSLGGRPW 
PRPRPGCCAA 
AVDVGAGPWG 
PQRPGRWLSR 



21 
I 

GGAHGLFPEE 
DRDNIiYRVEL 
IiFVCGSNAFN 
AIDAVIYRSI* 
SRVARVCKND 
lAVFSTPSNS 
PGMQYNASSA 
NQTWFLGSE 
RH6FQKAR6P 



31 

I 

PPPLSVAPRD 
BPPTSTBLRY 
PVCANYSIDT 
GDRPTLRTVK 
VGGSPRVLEK 
IPGSAVCAFD 
LPDDILNFVK 
AGTVLKFLVR 
PRCRL6V 



41 
I 

YLNHYPVPVG 
QRKIiTWRSNP 
LQPVGDNISG 
HDSKWFKEPY 
QWTSFLKARI* 
IiTQVAAVFEG 
THPIiMDEAVP 
PNAST8GTSG 



51 

1 

SGPGRLTPAE 60 

SDINVCRMKG 120 

MARCFYDPKH 180 

FVHAVEWGSH 24 0 

NCSVPGDSHP 300 

RPREQKSPES 360 

SliGHAPWlIjR 420 

RVCQVGHACR 480 



Seq ID NO: 166 DNA sequence 

Nucleic Acid Accession NM_032108.1 

Coding sequence: 39-2705 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



TCCGAGGCGT CACCTCCTCC TGTCGCCTGG CCCTCGC CAT GC AGACCCCC CGAGCGTCCC 
CTCCCCGCCC GGCCCTGCTG CTTCTGCTGC TGCTACTGGG GGGCGCCCAC GGCCTCTTTC 



60 
120 



258 



WSOCCID: <WO__02079492A2J.> 



wo 02/079492 



PCT/LS02/04915 



CTGAGGACCC GCCGCCGCTT AGCGTGGCCC 
TTGTGGGCAG CGGGCCCGGA CGCCTGACCC 
AGCGAGTCCT GCGGGTCAAC AGGACGCTGT 
TAGAGCTGGA GCCCCCCACG TCCACGGAGC 
5 CTAACCCCAG CGACATAAAC GTGTGTCGGA 
ACTTCGTAAA GGTGCTGCTC CTTCGGGACG 
CCTTCAACCC GGTGTGCGCC AACTACAGCA 
TCAGCGGTAT GGCCCGCTGC CCGTACGACC 
ACGGGATGCT CTTCACAGCT ACTGTTACCG 
10 GCAGCCTCGG GGACAGGCCC ACCCTGCGCA 
AGCCTTACTT TGTCCATGCG GTGGAGTGGG 
TTGCGATGGA GTTTAACTAC CTGGAGAAGG 
AGAACGACGT GGGAGGCTCC CCCCGCGTGC 
CGCGGCTCAA CTGCTCTGTA CCCGGAGACT 
15 TCACGGGCGT GGTCAGCCTC GGGGGCCGGC 
GCAACAGCAT CCCTGGCTCG GCTGTCTGCG 
TTGAAGGCCG CTTCCGAGAG CAGAAGTCCC 
ATCAGGTGCC TCGACCCCGG CCCGGGTGCT 
CCAGCGCCTT GCCGGATGAC ATCCTCAACT 
20 CGGTGCCCTC GCTGGGCCAT GCGCCCTGGA 
CTCGAGTG6C TGTGGACGTG GGAGCCGGCC 
GTTCTGAGGC GGGGACGGTC CTCAAGTTCC 
CGTCTGGGCT CAGT6TCTTC CTGGAGGAGT 
GGCCCGGCGG TGGCGAGACA GGGCAGCGGC 
25 GGGGCCTGCT GGCTGCCTTC CCCCGCTGCG 
AGTACTCGGG GTGTATGAAG AACTGTATCG 
CCGACGGCTC CTGCATCTTC CTCAGCCCGG 
CCGGGGCCAG CACCTCAGGC TTAGGGGACT 
AGGACCGCGC GGGGCTGGTG TCGGTGAACC 
30 TGGTGGGAGC CGTGGTGTCC GGCTTCAGCG 
GGGAGCTGGC CCGGCGCAAG GACAAGGAGG 
TGCTGAGCGT CAGCCGCCTG GGCGAGCGCA 
GCGGTGGCGG TGGCGCCGGG GTTCCCCCGG 
GCTGGGCCAA GGCCACGCTG CTGCAGGGCG 
35 CCACGCCCGA GCAGACGCCG CTGCCGCAGA 
ACGCCCTGGG CCCCCGCGCC TGGGACCACG 
CCTCCCTCCT GCTGCTGGC6 CCCGCCCGGG 
CGACCCCCGA CGGCCX3CCTC TATGCTGCCC 
CGCTCACCCC CCACGCCAGC CCGGACCGCC 
40 TGGACCCAGC CTCAGCCGCC GATGGCCTCC 
GCCTGAGGAG GCCACTGGGC CCCCACGCCC 
CGTTCAACAG CGGCGAGGCC CGGCCTGGGG 
GCACAGACTT GGCCCACCTC CTCCCCTATG 
CC TAG GCCGG GGGCCCCCCG ATGCCTTGGC 
45 ACGGTGCCAG AACGCCGGGG CCCGGGGCAA 
ACCCACCCGC GGAGTGGGGG GCCCCCTCCG 
CCCTACCCGG GGCCGCAGGA CGCTGAGACG 
CTATGGATTT GAGGTTGACC TTATGCGCGT 
CTTTTGCGGT TTTCTAACCA ATTGCACAAC 
50 GGCTTGGACG CCGGTGGGGA ATGGGGGGCC 
CTGGAAAGGT CCCTCCCCAA CCCAGGCCCC 
GCCGTGTTCG TGTGCAAGGG GCCGGGGAGG 
GCTGTGGGCG TGTGTGTCAA GTGGGCCACG 
CGTGGTGGCC CCAGCGGCCT GGGCGTTGGC 
55 GGGGGTCTCC GAGGTGCCGG TTAGGAGTTT 
GGACAAT6CC GGGGTTTCAG GCAGGAGACA 
CAGCAGCTGT CTAAAGGGCT TGGGGGCCTG 



CCAGGGACTA CCTGAACCAC TATCCCGTGT 180 

CCGCAGAAGG TGCTGACGAC CTCAACATCC 24 0 

TCATTGGGGA CAGGGACAAC CTCTACCGCG 309 

TGCGGTACCA GAGGAAGCTG ACCTGGAGAT 360 

TGAAGGGCAA ACAGGAGGGC GAGTGTCGAA 420 

AGTCCACGCT CTTTGTGTGC GGTTCCAACG 4 80 

TAGACACCCT GCAGCCCGTC GGAGACAACA 54 0 

CCAAGCACGC CAATGTTGCC CTCTTCTCTG 600 

ACTTCCTAGC CATTGATGCT GTCATCTACC 660 

CCGTGAAACA TGACTCCAAG TGGTTCAAAG 720 

GCAGCCATGT CTACTTCTTC TTCCGGGAGA 780 

TGGTGGTGTC CCGCGTGGCC CGAGTGTGCA 840 

TGGAGAAGCA GTGGACGTCC TTCCTGAAGG 900 

CCCATTTCTA CTTCAAC6TG CTGCAGGCTG 960 

CCGTGGTCCT GGCCX5TTTTT TCCACGCCCA 1020 

CCTTTGACCT GACACAGGTG GCAGCTGTGT 1080 

CCGAGTCCAT CTGGACGCCG GTGCCGGAGG 1140 

GCGCAGCCCC CGGGATGCA6 TACAATGCCT 1200 

TTGTCAAGAC CCACCCTCT6 ATGGACGAGG 1260 

TCCTGCQGAC CCTGATGAGG CACCAGCTGA 1320 

CCTGGGGCAA CCAGACCGTT GTCTTCCTGG 1380 

TCGTCCGGCC CAATGCCAGC ACCTCAGGGA 1440 

TTGAGACCTA CCGGCCGGAC AGGTGTGGAC 1500 

TGCTGAGCTT GGAGCTGGAC GCAGCTTCGG 15 60 

TGGTCCGAGT GCCTGTGGCT CGCTGCCAGC 1620 

GCAGTCAGGA CCCCTACTGC GGGTGGGCCC 16 8 0 

GCACCAGAGC CGCCTTTGAG CAGGACGTGT 174 0 

GCACAGGACT CCTGCGGGCC AGCCTCTCCG 1800 

TGCTGGTAAC GTCGTCGGTG GCGGCCTTCG 18 60 

TGGGCTGGTT CGTGGGCCTC CGTGAGCGGC 1920 

CCATCCTGGC GCACGGGGCG GGCGAGGCGG 1980 

GGGCGCAGGG TCCCGGGGGC CGGGGCGGAG 2040 

AGGCCCTGCT GGCGCCCCTG ATGCAGAACG 2100 

GGCCCCACGA CCTG6ACTCG GGGCTGCTGC 2160 

AGC6CCTGCC CACTCCGCAC CCGCACCCCC 2220 

GCCACCCCCT GCTCCCGGCC TCCGCTTCAT 2280 

CCCCCGAGCA GCCCCCCGCG CCTGGGGA6C 2340 

GGCCCGGCGG CGCCTCCCAC GGCQACTTCC 2400 

GGCGGGTGGT GTCCGCGCCC ACGGGCCCCT 2460 

CGCGGCCCTG GAGCCCGCCC CCGACGGGCA 2520 

CTCCGGCCGC CACCCTGCGC CGCACCCACA 2580 

ACCGCCACCG CG6CTGCCAC GCCCGGCCGG 2640 

GGGGGGCGGA CAGGACTGCG CCCCGCGTGC 2700 

AGTGCCAGCC ACGGGAACCA GGAGCGAGAG 2760 

CTCCGAGTGG GTGCTCAAGT CCCCCCCGCG 2820 

CCACAAGGAA GCACAACCAG CTCGCCCTCC 2880 

GTTTGGGGGT GGGTGGGCGG GAGGACTTTG 2 940 

AGGTTTTGGT TTTTTTTGCA GTTTTGGTTT 3000 

TCCGTTCTCG GGGTGGCGGC AGGCAGGGGA 3060 

ACAGCTGCAG ACCTAAGCCC TCCCCCACCC 3120 

TGGCGTGTGT GGGTGTGCGT GCGTGTGCGT 3180 

TGGGCGT6TG TGTGCGTGCC A6CGAA6GCT 3240 

CGTGCAGGGT GTGTGTCCAC GA6C6ACGAT 3300 

TGAGCCGACG CTGGGGCTTC CAGAAGGCCC 3360 

GAACCCCCCC CACTCTGCAG AGGGAAGCGG 3420 

CGAQGAGGGC CTGCCCGGAA GTCACATC6G 3480 
GGGGGCGGCG AAAG 



60 Seg ID No: 167 Protein sequence; 

Protein Accession #: NP_11S484.1 

1 11 21 31 41 51 

1 I 1 I I I 

65 MQTPRASPPR PALLLLLLtiL GGAHGLFPED PPPLSVAPRD YIiNHYPVFVG SGPGRLTPAE 60 

GADDLNIQRV LRVNRTLPIG DRDNIjYRVBIi EPPTSTELRY QRKLTWRSNP SDINVCRMKG 120 

KQEGECRNFV KVIiLLRDEST LFVCGSNAPN PVCANYSIDT LQPVGDNISG MARCPYDPKH 180 

ANVALFSDGM LFTATVTDFL AIDAVIYRSL GDRPTLRTVK HDSKWFKEPY FVHAVEWGSH 240 

VYFFFREIAM EFNYIjEKWV SRVARVCKND VGGSPRVLEK QWTSFIiKARIj NCSVPGDSHP 300 

70 YFNVLQAVTG WSIiGGRPW LAVFSTPSNS IPGSAVCAFD IiTQVAAVFBG RFREQKSPES 360 

IWTPVPEDQV PRPRPGCCAA PGMQYNASSA LPDDIIOTVK THPLMDBAVP SWSHAPWIUI 420 

TLMBHQLTRV AVDVGAGPWG NQTWFIjGSE AGTVLKFLVR PNASTSGTSG LSVFLEEFBT 480 

YRPDRCGRPG GGETGQRLIiS LELDAASGGL LAAPPRCWR VPVARCQQYS GCMKKCIGSQ 540 

DPYCGWAPDG SCIFIiSPGTR AAFEQDVSGA STSGLGDCTG IiLRASLSEDR AGLVSVNLIjV 600 

75 TSSVAAFWG AWSGFSVGW FVGUlERRBIi ARRKDKEAIL AHGAGEAVLS VSRLGBRRAQ 660 

GPGGRGGGGG GGAGVPPBAL lAPLMQNGWA KATIiLQGGPH DU>SGLLPTP EQTPIjPQKRI, 720 



259 



non7a>iQ9A9 I ^ 



wo 02/079492 



PCT/US02/04915 



PTPHPHPHAL GPRAWDHGHP LLPASASSSL LLLAPARAPE QPPAPGEPTP DGRIiYAARPG 780 
RASHGDPPLT PHASPDRRRV VSAPTGPLDP ASAADGLPRP WSPPPTGSLR RPLGPHAPPA 840 
ATLRRTHTFN SGEARPGDRH RGCHARPGTD IiAHLLPYGGA DRTAPPVP 



10 



15 



20 



Seq ID NO: 168 DNA aeouence 
Nucleic Acid Accession AW205664 

Coding sequencer 1-135 (underlined sequences correspond to start and stop codons) 



CGGCACGAGG 
CTAATTTGGA 
ACCCTCTCCG 
CTTGGAAGGC 
GTAAGCCTGA 
TGCATGGCGT 
TAAAACAGCT 



11 

I 

AGAACAGGGG 
GCACAGTCTT 
T TTAG TACCT 
GGAATGTGTT 
GTGGATCCTG 
TATGTAGATC 
GCCCTGGATG 



21 

I 

CCTCTGCCTC 
CCCGGTGCCT 
GACCACCTGT 
TTCGTGTCTT 
ACTCAGCTGC 
ACGTGCGGCA 
AAACGGAATA 



31 
I 

AGTTTGCCCG 
AGACATGCCA 
TTCAAAACGC 
CTAGGAAGGG 
AGCCCTTACC 
GAGACAGCCA 
AACCA6TGAT 



41 
I 

GGAGCCAGCC 
AGGCCCCTCC 
AGGTGTTTCT 
TCTGCTGAGG 
TGCCTCGTGC 
CTGTCCTGTG 
GCTAAAAAAA 



51 
I 

AGGGCCCATC 
CACGTGGTAC 
GGTTTAGAAA 
ACCAGACCAC 
TGATGATCTA 
TGCGGGTTTT 
AAAAAAAAAA 



60 
120 
180 
240 
300 
360 



Seq ID No: 169 Protein sequence i 
Protein Accession #: AW205664 



25 



21 



31 



41 



51 



RHEENRGLCL SLPGSQPGPI LIWSTVFPVP RHAKAPPTWY TLSV 



30 Seq ID NO: 170 DMA aeouence 

Nucleic Acid Accession AB033100 

Coding sequence: 32-2623 (underlined sequences correspond to start and stop codons) 
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60 



65 



70 



75 



1 
I 

AGGTCTGGGG 
GACGGTCTCG 
GCACTCCGTC 
CATCATCCCC 
GATCCATGAT 
TGAGCACTAC 
GGATGTCACT 
CCGGCAGGTG 
CAGGCGGGTC 
GCGGGAGGAA 
AGACAAGCAG 
CCTGGAGCTG 
CCATGTGTAC 
TGAGGACGAC 
CTACAGGTAC 
CGCCTTTGTC 
GCCTCCCCCA 
GGTCCTGGGC 
CCCCACGCAG 
CATGGTGCCC 
CGAGTTGCAT 
ACCGGAGAGC 
GAGCCTGGAG 
GCTGGCCTTT 
GCCCX3TGACG 
CCTACGGGAG 
GGCCAACTTC 
GGCCCTG6G6 
CTGGGTGAGC 
GTGGCCTGGG 
CCATCTAAGC 
CCTTACCATG 
CCGCATCCCC 
GGCCCTGCGG 
CGGCCAGGGC 
AGGCTTCCCC 
GGGTGAATTT 
GAAGGAGGTG 
CCTGCGGGAG 
AATGCGGAGG 



11 
I 

TCCTGAGGCT 
GCAGGCACCC 
AGCATCCACT 
AACAAGGTGG 
GAGCTGCTCA 
CTGGTGCAAG 
GAGAAGATGG 
CAGGGTGGGC 
CTCCA6AAAC 
MCTGTGCTTT 
AACCTTCATG 
GCCATCCGGA 
CATAACACCG 
TTGCATGTGA 
CACCGCCTGC 
AGTGTTCTCC 
GCCCTCGTCT 
ACCCTCATCC 
GCCAAGCCCC 
CAGGGAAGGA 
GACCTGAAAG 
CCAGCCCAGG 
CGATACTTCT 
GCCCTCAGTT 
CTGAGCTCAG 
GACGATCTGG 
CGGCGGGTGC 
AGCATCCTGG 
CTTCGGGAG6 
CCCCCT6TGG 
GAGCCTCCCC 
CAG6AGGTCT 
ATGCCGGACT 
GCCGCCCTCT 
CGTACCACAA 
GAGGTGGGTG 
CAGGTAGTAA 
GACGCAGCGC 
ATCATCATCT 
CTGCAGCTGC 



21 

I 

GCTGGCAGAC 
CATTTGAGGG 
CCTTCCAGAG 
CCCCTGTTGT 
AGGCTCATTA 
GAGCTCAGGC 
ATGTGCTGGG 
TCACTGTGTT 
TCCAGAAGGA 
TCCTGCGTGC 
AGAACCTCCA 
AA6AGATCCA 
A6GACCTGTG 
C66AGGAG6T 
CCCTGCCCX3A 
GGGAGACCCC 
TCAGCTGCCA 
TGCTTCACCG 
TGCCTATGGA 
GGATGGTGGA 
AAGTGGTCTT 
GAAGCGGCAG 
ACCTGATCCT 
TCAGCCGCTG 
CAGGCCCTGT 
TCTCCCCGGA 
CCCGCATGCC 
CCTACCTGAC 
AGGCCGTGTT 
CTCCTGACCA 
CAGGGAAGGA 
TCAGCCAGCA 
TCTGTGCCCC 
CCAAGGACCC 
CTGCGATGGT 
AGGAGGAGCT 
TGAAGGTGGT 
TGGACACTGT 
GCACCTACCG 
GGAGCCT6CA 



31 
I 

TAIGGGTACA 
CCTACAGGGC 
CACTAGCTTG 
GATCACGTAC 
CACGTTGGGC 
CTTACCCCAG 
CACCGTGGGA 
CGGCATGGGA 
CGGACATAGG 
AGATGAGGAC 
GGGCCTTGGA 
CGACTTTGCC 
GG6GGAGCCC 
GTACAA6CG6 
GCAAG6GAGT 
CAGCCTGCTQ 
GATGGGCGTG 
CAGTGGGACC 
GCAGTTCCAG 
AGAGGTGGAC 
GGAAAACCAG 
CCGACACAGC 
GTTTAACTAC 
GCTGTGTGCC 
GGCTCCGAGG 
CGCGCTCAGC 
CATCTACGGC 
GGACGCCAAG 
6GAGTGTGAC 
GCTGGAGACC 
GGGCCCCCTQ 
CCGCAGGGCC 
CCGAGAGGAG 
AGGCACTGGC 
GGTGGCTGTC 
CGTGAGTGTG 
GCAGCTGCTA 
CAGCGAGACC 
CCAGGCGAA6 
GTACTTGGAG 



41 
I 

ACGGCCAGCA 
AGT6GCACGA 
CATAACAGCA 
AACTGCAAGG 
CGGCTCTCGG 
GGCCGCTACT 
AGCTGTGGGG 
CAGCCCAGCC 
GAGTGTGTCA 
TTTGTGTCCT 
CCCGGGGTCC 
CAGCTGAGCG 
CATGCTGTGG 
CCCCTCTTCC 
CCCCTGGAGG 
CAGCTCCGTG 
6GCAG6ACCA 
ACCTCCCAGC 
GTGATCCAGA 
AGAGCCATCA 
AAGAAGTTAG 
GTCTGGCAGA 
TACCTTCATG 
CACCCTGAGC 
GACCTCATCG 
ACTGTCAGAG 
ACGGCCCAGC 
AGGAGGCTGC 
GGGCACACCT 
CT66AGGCCC 
ACCTACAG6T 
TGTCCTGGCC 
GACTTTGACC 
TTCGTGTTCA 
CTGGCCTTCT 
CCTGATGCCA 
CCCGATGGGC 
ATGACGCCCA 
GCAGCGAAAG 
CGCTATGTCT 



51 
I 

CAGCCCAGCA 
TGGACAGTCG 
AGGCCAAGTC 
AGGAGTTCCA 
ACAACACCCC 
TCCTGGTGCG 
CCCCCAACTT 
TCTTAGGGTT 
TCTTCTGTGT 
ACACACCTCG 
GGGTGGAGAG 
AGAACACATA 
CCATCCATGG 
TGCAGCCCAC 
CCCAGTTGGA 
ATGCCCACGG 
ACCTGGGCAT 
C AGAGG CTGC 
GCTTTCTCCG 
CTGCCTGTGC 
AAGGTATCCG 
GGGCGCTGTG 
AGCAGTACCC 
TGTACCGCCT 
CCAGGGGCTC 
AGATGGATGT 
CCAGCGCCAA 
GGAAGGTTGT 
ACAGCCTGCG 
AGCTGAAGGC 
TCCAGACCTG 
TCACCTACCA 
AGCT6CTGGA 
GCTGCCTCAG 
GGCACATCCA 
AGTTCACTAA 
ACCGTGTGAA 
TGCACTACCA 
AGGCGCAGGA 
GCCTGATTCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



260 
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wo 02/079492 
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CTTCAACGCG TACCTCCACC TGGAGAAGGC CGACTCCTGG CAGAGGCCCT TCAGCACCTG 
GATGCAGGAG GTGGCATCGA AGGCTGGCAT CTACGAGATC CTTAACGAGC TGGGCTTCCC 
CGAGCTGGAG AGCGGGGAGG ACCAGCCCTT CTCCAGGCTG CGCTACCGGT GGCAGGAGCA 
GAGCTGCAGC CTCGAGCCCT CTGCCCCCGA GGACTTGCTG TAGGGGGCCT TACTCCCTGT 
5 CCCCCCACCC ACAGGGCCCC ACGCAGGCCT GGGGTGTCTG AGGTGCTCTT GGCTGGGAGC 
GGCCCTGAGG GGTGCTGGCC TTGAAATGAT TCCCCCACTT CCTGGAGAGA CTGAGCGGAG 
TTGGGAGCCT TTTTAGAAAG AACTTTTTAT AGGACAGGGA GACAGCACAG CCATCCCTTG 
CAAACCACCA AGGTGTGTGG CTGACCTCCA GGGAGGAGCA CTCACTGGAG TGCTCACAAG 
GTGCACACTG CTGTGTGTAC CTTGCAGACA GGCCGGCGTT CAGCCTCCAA GGGGCTCACT 
10 CCCCCAGTTG CCAAACACTG TGGATCTCTC TGTCCTCTTC TCCCCTCTCT CAGATTGGCC 
TGGCAGCCCC TGGCACAGAG CAGACCCGGC CACTGGTAGC TCCCCACTTC CTTACTCCTG 
CTGCTCTGCC ATTGCCGCTC CCCTTCTTGC TGCCCAAGCA CTGCCCTCGG GCGTCTGGCA 
GCCTGAGGTG GGTGGAGGGG ACAGTGTTCT GGATA6ATCT ATTATGTGAA AGGCAGCTTC 
ACCCAGTTTT CTG6ACTCTC ATGCCCCCAT CTCCGACCTG GGAGACTTCA GGAATGACAA 
15 CCTACCCAGC CTGGTGGGGC TGGCAGGAT6 GTGGAGGTTT CTCAAGGAGC TGGAGACTTC 
AGGGAGCCCC TCTCATGGGG AGGAAAGAGC TTCCAGGGG6 CGAACGCAGC ACAGAGGAAG 
AGGCCTGCTC CACTTGTCTG GGAACCTGGG CAGGAGGCAC AGA6GAAGCC AAGGCCTGGA 
GCTGCAGGTC CCCCGGCATC TCTCTCTGTC CCGGCAGCCC AGGATGGCCT GGTGCCCCCA 
CCTGCTGCAG CAGGAGCCCC AAGGAGTGCT AGCTGAGGGT GGTTGCTGGG GTGGTCCTCA 
20 TGGACAGTGA GGTGTGCAAG GGTGCACTGA GGGTGGTGGG AGGGGATCAC CTGGGTTCCA 
GGCCATCCTT GCTGAGCATC TTTGAGCCTG CCTTCCGGTG GGAGCAGAAA AGGCCAGACC 
CTGCTGAGTT AGAGGCTGCT GGGATCCACT GTTTCCACAC AGCGGGAAGG CTGCTGGGAA 
CAGGTGGCAG AGAAGTGCCA TGTTTGCGTT GAGCCTTGCA GCTCTTCCAG CTGGGGACTG 
GTGCTTGCTG AAACCCAGGA GCTGAACAGT GAGGAGGCTG TCCACCTTGC TTGGCTCACT 
25 GGGACCAGSA AAGCCTGTCT TTGGTTAGGC TCGTGTACTT CTGCAGGAAA AAAAAAAAAG 
GATGTGTCAT TGGTCATGAT ATTTGAAAAG GGGAGGAGGC CGAAGTTGTT CCCATTTATC 
CAGTATTGGA AAATATTTGA CCCCCTTGGC TGAATTCTTT TGCAGAACTA CTGTGTGTCT 
GTTCACTACC TTTTCAGGTT TATTGTTTTT ATTTTTGCAT GAATTAAGAC GTTTTAATTT 
CTTTGCAGAC AAGGTCTAGA TGCGGAGTCA GAGATGGGAC TGAATGGGGA GGGATCCTTT 
30 GTGTTCTCAT GGTTGGCTCT GACTTTCAGC TGTGTTGGGA CCACTGGCTG ATCACATCAC 
CTCTCTGCCT CAGTTTCCCC ATCTGTAAAA TGGGAGAATA ATACTTGCCT ACCTACCTCA 
CRGGGGTGTT GTGAGGATTC ATTTGTGATT TTTTTTTTTT TTTTTGTACA GAGCTTTTAA 
GCATTAAAAA CAGCTAAAT6 TG 

35 Seq ID No: 171 Protein sequence; 
Protein Accession #: BAA86588.1 

1 11 21 31 41 51 

I I I I I I 

40 MGTTASTAQQ TVSAGTPFEG LQGSGTMDSR HSVSIHSFQS TSLHNSKAKS IIPNKVAPW 
ITYNCKEEFQ IHDELIiKAHY TLGRIjSDNTP EHYLVQGAQA LPQGRYPLVR DVTEKMDVLG 
TVGSCGAPNF RQVQGGIiTVF GMGQPSLLGF RRVIiQKLQKD GHRECVIFCV REEVLFLRAD 
EDFVSYTPRD KQNLHENLQG LGPGVRVESL EIAIRKEIHD FAQLSENTYH VYHNTEDLWG 
EPHAVAIHGE DDLHVTEEVY KRPIjFLQPTY RYHRIjPIjPEQ GSPLEAQLDA FVSVIjRETPS 
45 LLQIjRDAHGP PPAIiVFSCQM GVGRTNLGMV LGTLILIiHRS GTTSQPEAAP TQAKPLPMEQ 
FQVIQSFLRM VPQGRRMVEE VDRAITACAE LHDIjKEWIjE NQKKIiEGIRP espaqgsgsr 
HSVWQRALWS LERYFYIilLF NYYLHEQYPL AFALSFSRWL CAHPEIiYRLP VTIjSSAGPVA 
PRDLIARGSL REDDLVSPDA liSTVREMDVA NFRRVPRMPI YGTAQPSAKA LGSlLAYliTD 
AKRRLRKWW VSLREEAVLE CDGHTYSLRW PGPPVAPDQIi BTLEAQIiKAH lsbpppgkeg 
50 PLTYRFQTCIi TMQEVFSQHR RACPGLTYHR IPMPDFCAPR EBDFDQLLEA LRAALSKDPG 
TGFVFSCLSG QGRTTTAMW AVLAFWHIQG FPEVGEEEIiV SVPDAKPTKG EFQWMKWQ 
LIiPDGHRVKK EVDAALDTVS ETMTPMHYHL REIIICTYRQ AKAAKEAQEM RRIiQIiRSLQY 
LERYVCIiIIiF NAYIiHIiEKAD SWQRPFSTWM QEVASKAGIY EILNELGFPE LBSGBDQPPS 
RLRYRWQEQS CSLEPSAPED liL 

55 

Seq ID NO: 172 DMA sequence 
Nucleic Acid Accession #: AK02X806.1 
Coding sequence: 1-645 (underlined sequences correspond to steurt and stop codous) 

60 

1 11 21 31 41 51 

I I I I i I 

ACTG TGCTTT TCCTGCGTGC AGATGAGGAC TTTGTGTCCT ACACACCTCG AGACAAGCAG 60 
AACCTTCATG AGAACCTCCA GGGCCTTGGA CCCX3GGGTCC GGGTGGAGAG CCTGGAGCTG 120 

65 GCCATCCGGA AAGAGATCCA CGACTTTGCC CAGCTGAGCG AGAACACATA CCATGTGTAC 180 
CATAACACCG AGGACCTGTG GGGGQAGCCC CATGCTGTGG CCATCCATGG TGAGGACGAC 240 
TTGCATGTGA CGGAGGAGGT GTACAAGCGG CCCCTCTTCC TGCAGCCCAC CTA CAGGT AC 300 
CACCGCCTGC CCCTGCCCGA GCAAGGGAGT CCCCTGGAGG CCCAGTTGGA CGCCTTTGTC 360 
AGTGTTCTCC GGGAGACCCC CA6CCTGCTG CAGCTCCGTG AT6CCCACGG GCCTCCCCCA 420 

70 GCCCTCGTCT TCA6CTGCCA QATGGGCGTG GGCAGGAGCA ACCTGGGCAT GGTCCTGGGC 480 
ACCCTCATCC TGCTTCACCG CAQTGGGACC ACCTCCCAGC CAGAGGCTGC CCCCACGCAG 540 
GCCAAGCCCC TGCCTATGGA GCAGTTCCAG GTGATCCAGA GCTTTCTCCG CATGC?rGCCC 600 
CAGGGAAGGA GGATGGTG6A AGAGGTGGAT AGATCTATTA TGTGAAAGGC AGCTTCACCC 660 
AGTTTTCTGG ACTCTCATGC CCCCATCTCC GACCTGGGAG ACTTCAGGAA TGACAACCTA 720 

75 CCCAGCCTGG TGGGGCTGGC A6GATGGTGG AGGTTTCTCA AGGAGCTGGA GACTTCAGGG 780 
AGCCCCTCTC ATGGGGAGGA AAGAGCTTCC AGGGGGCGAA CGCAGCACAG AGGAAGAGGC 840 
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CTGCTCCACT TGTCTGGGAA CCTGGGCAGG 
CAGGTCCCCC GGCATCTCTC TCTGTCCCGG 
CTGCAGCAGG AGCCCCAAGG AGTGCTAGCT 
CAGTGAGGTG TGCAAGGGTG CACTGAGGGT 
5 ATCCTTGCTG AGCATCTTTG AGCCTGCCTT 
TOAGTTAGAG GCTGCTGGGA TCCACTGTTT 
TGGCAGAGAA GTGCCATGTT TGCGTTGAGC 
TTGCTGAAAC CCAGGAGCTG AACAGTGAGG 
CCAGGAAAGC CTGTCTTTGG TTAGGCTCGT 

10 TGTCATTGGT CATGATATTT GAAAAGGGGA 
ATTGGAAAAT ATTTGACCCC CTTGGCTGAA 
ACTACCTTTT CAGGTTTATT GTTTTTATTT 
GCAGACAAGG TCTAGATGCG GAGTCAGAGA 
TCTCATGGTT GGCTCTGACT TTCAGCTGTG 

15 CTGCCTCAGT TTCCCCATCT GTAAAATGGG 
GGTGTTGTGA GGATTCATTT GTGATTTTTT 
TAAAAACA6C TAAATGTG 



AGGCACAGAG GAAGCCAAGG CCTGGAGCTG 900 

CAGCCCAGGA TGGCCTGGTG CCCCCACCTG 960 

GAGGGTGGTT GCTGGGGTGG TCCTCATGGA 1020 

GGTGGGAGGG GATCACCTGG GTTCCAGGCC 1080 

CCGGTGGGAG CAGAAAAGGC CAGACCCTGC 114 0 

CCACACAGCG GGAAGGCTGC TGGGAACAGG 1200 

CTTGCAGCTC TTCCAGCTGG GGACTGGTGC 1260 

AGGCTGTCCA CCTTGCTTGG CTCACTGGGA 1320 

GTACTTCTGC AGGAAAAAAA AAAAAGGATG 1380 

GG AGGC CGAA GTTGTTCCCA TTTATCCAGT 144 0 

TTCTTTTGCA GAACTACTGT GTGTCTGTTC 1500 

TTGCATGAAT TAAGACGTTT TAATTTCTTT 1560 

TGGGACTGAA TGGGGAGGGA TCCTTTGTGT 1620 

TTGGGACCAC TGGCTGATCA CATCACCTCT 168 0 

AGAATAATAC TTGCCTACCT ACCTCACGGG 1740 

TTTTTTTTTT TGTACAGAGC TTTTAAGCAT 1800 



20 



Seq ID No: 173 Protein sequence r 
Protein Accession #: AK021806.1 



25 1 11 21 31 41 51 

I > I I I I 

TVLFLRADED FVSYTPRDKQ NLHENLQGLG PGVRVESLEL AIRKEIHDFA QLSENTYHVY 60 
HNTEDLWGEP HAVAIHGEDD LHVTEEVYKR PLFLQPTYRY HRLPLPEQGS PLEAQIiDAFV 120 
SVLRETPSLL QLRDAHGPPP ALVFSCQMGV GRTNIiGMVLG TLIIJiHRSGT TSQPBAAPTQ 180 
JU AKPLPMEQFQ VIQSPLRMVP QGRRMVEEVD RSIM 



Seq ID NO: 174 DNA sequence 
Nucleic Acid Accession #: NM 016580.2 
35 coding sequence: 1212-4766 (Underlined sequences correspond to start and stop codons) 
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GGGAAGCGGG 
AGACCATGGG 
. TCCAGAGGTA 
AACGTCACTG 
TTTAATCCTC 
AGATGAGGTA 
AGTTGCAAAG 
CAGCTGCTTT 
ACAAGATTGG 
GAGAGTTGGG 
ACGTGCAAGC 
CCCTCAGCAG 
CCGGCCTCAT 
TTGGCATTGA 
TGCTTCTCTG 
ACTGGGTCAG 
GTTCCCCAGT 
CTTCCAATGA 
GACAAGGTTG 
AACACTGGAG 
TGGCGGTAAG 
ACTTATTTCT 
CAGAGGAAGT 
AGAGGCGGAG 
TTCAGGTGGA 
TGTGCCGACA 
CTCTGATCCA 
AAGGCGAGCA 
ACAGAGCTCT 
GTGAGCACTT 
TAGTGGTGAA 
ATGACAATGG 
CCAATGACAA 
CTGCACCTGG 
GGGAGGTGGA 
TTGATGCCAA 
CCTAOGAGGT 



11 
I 

AGGAGAGCCA 
CACCCTCATA 
GCCATAGGGT 
CCTGTGACTC 
ACAGTTTCCT 
ATGGAGGCCC 
TCAGAATTTG 
CCAGTGAGAC 
GAAAAAGACA 
GAAGGGGAGA 
AGGTGGAGGC 
CCGCCTTGCT 
CCTCCTCAGC 
AACTCACAGC 
TTTCCCCAGC 
GTCCCTCTTA 
AGGGGGTCTC 
AGATGGCCTT 
TGCCGGCAGC 
GACTTGTCCA 
CATGATGCAA 
TTTAGGGGAT 
GCCATCTGGT 
GCAAGCTGGG 
CTCTGAGGAA 
GTGGGATCCC 
TGTGGAGATC 
GGAGCTGGAA 
TGACCCAGAC 
TGCCTTGGAT 
GGAGCTGGAC 
GAACCCCCCC 
TAGCCCTGCG 
TACGCTTCTC 
GTTCTTCCTC 
GACAGGCCAG 
GGATGTTCAG 
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CACGGTCAAG 
AGTCAGTGTG 
GTGACAAGTT 
GGGGCCAGGC 
GCTGAAAGGG 
AGGAAAGTTA 
AACTCAGGCA 
AAAAACGGGT 
GGAATGAGAG 
GAAACAGGAA 
TCAAGTTTTC 
GC6TGGACAG 
GGCAGGCCAC 
CCTCCCTTTT 
CTAGCAACTG 
GACCAGCTCT 
CCCTGGCAAT 
GCCTTGGGGT 
TCTGGGGGAA 
GCCTTGAAAG 
CTTCTGCAAC 
TGTCAGGAGG 
ACAGTGATCG 
GCTGCCTTCC 
GGCTTGCTCA 
TGCCTGGTTT 
CAAGTGCTGG- 
ATCTCTGAGA 
ACAGGCCCTA 
GTCATTGTGG 
AGGGAAATCC 
AA6TCAGGTA 
TTTGCTGAGA 
ATAAAACTGA 
AGTAAGCACA 
GTCATTCTGC 
GCAAGGGACC 



31 
I 

TTGCACAGGT 
GGCAGGGACT 
GTGCAGATTA 
CCAGGCCAAA 
CTACTATTCT 
AGTGACTTGT 
GTTTACCTCT 
GATCAGGGCA 
GGGAACAATG 
ACATGACTTG 
TGCTCACTTG 
CAGCTTCCCA 
TTAGCTTCAC 
CCT6TAG6TG 
TTTGGCAGTC 
TGTCCATCAT 
TCTTGATCGG 
CCTGCTTGTT 
GGAGCACGGG 
AACTCTAGTG 
TTCTGCTGGG 
TGACCACTCT 
GGAAGCTGTC 
AGGTGTTGCA 
GCACAGGCAG 
CCTTTGATGT 
ACATCAATGA 
GCX5CCTCTCT 
ACACCCTGCA 
GCCCTGATGA 
ATTCATTTTT 
CCAGCTTGGT 
GTTCACT6GC 
CC6CCACAGA 
TGCCTCCAGA 
GTCGACCTCT 
TGGGTCCX^A 



41 

I 

TCTTGCAGCT 
GCCCCAGGGC 
CAACACTCAC 
GCCCTTCCTA 
TACTCCCATC 
CCCAGATGAC 
GATGGCTGCT 
GAGTCAAGAC 
GGGGAAAAGA 
CCCGGGAGGG 
GTGATGCAGA 
TCTGGCCTGT 
AGGAAATGCT 
GGGTTTCCAT 
AGAGTCCCAC 
TTGCTGAAGT 
CGTTTGGACA 
TCATAATCAT 
GCTGATCAA6 
GTTTCTGAAT 
GCTTTTGGGG 
CACGGTGAAA 
CCAGGAACTG 
GCTGCCTCAG 
GCGGCTGGAT 
GCTTGCCACA 
CCACCAGCCA 
GCGAACCCGG 
CACCTACACT 
GACCAAACAT 
TGATCTGGTG 
CAAGGTCAAC 
ACTG6AAATC 
CCCTGACCAA 
G6TGCT6GAC 
AGACTAT6AA 
TCCTATCCCA 



51 



TCTG6AATCA 
CAATCCAAGA 
CCCTTGCAAT 
CATCATTTCG 
CCCACTCTAC 
ACCGCTGGTA 
CTGTTAATCA 
AGAGAGGTAA 
TAGGAACAAA 
GCATCAGTCC 
GGCTCCCTTT 
CCCCGGAGCC 
CTTTCTCTAA 
AGGAAAAAGC 
ATCCTGCTCA 
GGACCAACTA 
TCTCAGATCG 
CTAACTATGG 
CCATCCAGGA 
CTA6CCCACT 
CCAGGTGGCT 
TACCAAGTGT 
GGCCGGGAGG 
GCGCTCCCCA 
CGAGAGCAGC 
GGGGATTTGG 
CGGTTTCCCA 
ATCCCCCTGG 
CTGTCTCCCA 
GCAGAACTCA 
TTAACTGCCT 
GTCTTGGACT 
CAAGAAGATG 
GGCCCCAATG 
ACCTTCAGTA 
AAGAACCCTG 
GCCCATTGCA 
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AAGTTCTCAT CAAGGTTCTG GATGTCAAT6 ACAACATCCC AAGCATCCAC GTCACATGGG 
CCTCCCAGCC ATCACTGGTG TCAGAAGCTC TTCCCAAGGA CAGTTTTATT GCTCTTGTCA 
TGGCAGATGA CTTGGATTCA GGACACAATG GTTTGGTCCA CTGCTGGCTG AGCCAAGAGC 
TGGGCCACTT CAGGCTGAAA AGAACTAATG GCAACACATA CATGTTGCTA ACCAATGCCA 
5 CACTGGACAG AGAGCAGTGG CCCAAATATA CCCTCACTCT GTTAGCCCAA GACCAAGGAC 
TCCAGCCCTT ATCAGCCAAG AAACAGCTCA GCATTCAGAT CAGTGACATC AACGACAATG 
CACCTGTGTT TGAGAAAAGC AGGTATGAAG TCTCCACGCG GGAAAACAAC TTACCCTCTC 
TTCACCTCAT TACCATCAAG GCTCATGATG CAGACTTGGG CATTAATGGA AAAGTCTCAT 
ACCGCATCCA GGACTCCCCA GTTGCTCACT TAGTAGCTAT TGACTCCAAC ACAGGAGAGG 
10 TCACTGCTCA GAGGTCACTG AACTATGAAG AGATGGCCGG CTTTGAGTTC CAGGTGATCG 
CAGAGGACAG CGGGCAACCC ATGCTTGCAT CCAGTGTCTC TGTGTGGGTC AGCCTCTTGG 
ATGCCAATGA TAATGCCCCA GAGGTGGTCC AGCCTGTGCT CAGCGATGGA AAAGCCAGCC 
TCTCCGTGCT TGTGAATGCC TCCACAGGCC ACCTGCTGGT GCCCATCGAG ACTCCCAAT6 
GCTTGGGCCC AGCGGGCACT GACACACCTC CACTGGCCAC TCACAGCTCC CGGCCATTCC 
15 TTTTGACAAC CATTGTGGCA AGAGATGCAG ACTCGGGGGC AAATGGA6AQ CCCCTCTACA 
GCATCCGCAG TGGAAATGAA GCCCACCTCT TCATCCTCAA CCCTCATACG GGGCAGCTGT 
TCGTCAAT6T CACCAATGCC AGCA6CCTCA TTGGGAGTGA GTGGGAGCTG GAGATAGTAG 
TAGAGGACCA G6GAAGCCCC CCCTTACAGA CCCGAGCCCT GTTGAGGGTC ATGTTTGTCA 
CCAGTGTGGA CCACCTGAGG GACTCAGCCC GCAAGCCTGG GGCCTTGAGC ATGTCGATGC 
20 TGACGGTGAT CTGCCTGGCT GTACTGTTGG GCATCTTCGG GTTGATCCTG GCTTTGTTCA 
TGTCCATCTG CCGGACAGAA AAGAAGGACA ACAGGGCCTA CAACTGTCGG GAGGCCGAGT 
CCACCTACCG CCAGCAGCCC AA6AGGCCCC AGAAACACAT TCAGAAGGCA GACATCCACC 
TCGTGCCTGT GCTCAGGGGT CAGGCAGGTG AGCCTTGTGA AGTCGGGCAG TCCCACAAAG 
ATGTGGACAA GGAGGCGATG ATGGAAGCAG GCTGGGACCC CTGCCTGCAG GCCCCCTTCC 
25 ACCTCACCCC GACCCTGTAC AGGACGCTGC GTAATCMGG CAACCAGGGA GCACCGGCGG 
AGAGCCGAGA GGTGCTGCAA GACACGGTCA ACCTCCTTTT CAACCATCCC AGGCAGAGGA 
ATGCCTCCCG GGAGAACCTG AACCTTCCCG JiGCCCCAGCC TGCCACAGGC CAGCCACGTT 
CCAGGCCTCT GAAGGTTGCA GGCAGCCCCA CAGGGAGGCT GGCTGGAGAC CAGGGCAGTG 
AGGAAGCCCC ACAGAGGCCA CCAGCCTCCT CTGCAACCCT GAGACGGCAG CGACATCTCA 
30 ATGGCAAAGT GTCCCCTGAG AAAGAATCAG GGCCCCGTCA GATCCTGCGG AGCCTGGTCC 
GGCTGTCTGT GGCTGCCTTC GCCGAGCGGA ACCCCGTGGA GGAGCTCACT GTGGATTCTC 
CTCCTGTTCA GCAAATCTCC CAGCTGCTGT CCTTGCTGCA TCAGGGCCAA TTCCAGCCCA 
AACCAAACCA CCGAGGAAAT AAGTACTTGG CCAAGCCAGG AGGCAGCAGG AGTGCAATCC 
CAGACACAGA TGGCCCAAGT GCAAGGGCTG GAGGCCAGAC A6ACCCAGAA CAGGAGGAAG 
35 GGCCTTTGGA TCCTGAAGAG GACCTCTCTG TGAAGCAACT GCTAGAAGAA GAGCTGTCAA 
GTCTGCTGGA CCCCAGCACA GOTCTGGCCC TGGACCGGCT GAGCGCCCCT GACCCGGCCT 
GGATGGCGAG ACTCTCTTTG CCCCTCACCA CCAACTACCG TGACAATGTG ATCTCCCCGG 
ATGCTGCAGC CACG6AGGAG CCAAG6ACCT TCCAGACGTT CGGCAAGGCA GAGGCACCAG 
AGCTGAGCCC AACAGGCACG AGGCTGGCCA GCACCTTTGT CTCGGAGATG AGCTCACTGC 
40 TGGAGATGCT GCTGGAACAG CGCTCCAGCA TGCCCGTGGA GGCCGCCTCC GAGGCGCTGC 
GGCGGCTCTC GGTCTGCGGG AGGACCCTCA GTTTAGACTT GGCCACCAGT GCAGCCTCAG 
GCATGAAAGT GCAAGGGGAC CCAGGTGGAA AGACGGGGAC TGAGGGCAAG AGCAGAGGCA 
GCAGCAGCAG CAGCAGGTGC CTG TGAA CAT ACCTCAGACG CCTCTGGATC CAAGAACCAG 
GGGCCTGAGG ATCTGTGGAC AAGAGCTGGT TTCTAAAATC TTGTAACTCA CTAGCTAGCG 
45 GCGGCCTGAG AACTTTAGGG TGACTGATGC TACCCCCACA GAGGAGGCAA GAGCCCCAGG 
ACTAACAGCT GACTGACCAA AGCAGCCCCT TGTAAGCAGC TCTGAGTCTT TTGGAGGACA 
GGGACGGTTT GTGGCTGAGA TAAGTGTTTC CTGGCAAAAC ATATGTGGAG CACAAAGGGT 
CAGTCCTCTG GCAGAACAGA TGCCACGGAG TATCACAGGC AGGAAAGGGT GGCCTTCTTG 
GGTAGCAGGA GTCAGGGGGC TGTACCCTGG GGGTGCCAGG AAATGCTCTC TGACCTATCA 
50 ATAAAGGAAA AGCAGTGATT CAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID No: 175 Protein aeauence: 
Protein Accession #: NP_0S7664.1 



1 11 21 31 41 51 

I I I I I I 

MMQLLQIiLIjG LLGPGGYLPL IjGDCQEVTTL tvkyqvsebv PSGTVIGKLS QELGREERRR 60 

QAGAAFQVIxQ IiPQALPIQVD SEEGI*IjSTGR RIiDREQLCRQ WDPCLVSFDV LATGDLALIH 120 

60 VEXQVIjDIND HQPRFPKGEQ EIiEISESASL RTRIPIiDRAL DPDTGPNTLH TYTLSPSEHF 180 

ALDVIVGPDE TKHAELIWK ELDREIHSFF DLVLTAYDNG NPPKSGTSLV KVNVIiDSNDN 240 

SPAFAESSLA liEIQEDAAPG TLLIKLTATD PDQGPNGEVE FFLSKHMPPE VIjDTFSIDAK 300 

TGQVIIiRRPIi DYEKNPAYEV DVQARDLGPN PIPAHCKVLI KVLDVNDNIP SIHVTWASQP 360 

SLVSEALPKD SFIALVMADD LiDSGHNGIiVH CWLSQEIiGHF RUCRTNGNTY MLLTNATliDR 420 

65 EQWPKYTLTIi LAQDQGLQPL SAKKQIjSIQI SDINDNAPVF EKSRYBVSTR ENNLPSLHLI 4 80 

TIKAHDADLG INGKVSYRIQ DSPVAHLVAI DSNTGEVTAQ RSLNYEEMAG FEFQVIAEDS 540 

GQPMLiASSVS VWVSLLDAND NAPEWQPVL SDGKASIjSVL VNASTGHLLV PXBTPNGLGP 600 

AGTDTPPLAT HSSRPPLLTT IVARDADSGA NGEPIiYSIRS GNEAHLFILN PHTGQLFVNV 660 

TNASSLIGSE WELEIWBDQ GSPPIjQTRAL LRVMFVTSVD HLRDSARKPG ALSMSMIjTVI 720 

70 CliAVIiLGlFG LILALPMSIC RTEKKDNRAY NCRBABSTYR QQPKRPQKHI QKADIHLVPV 780 

LRGQAGEPCE VGQSHKDVDK EAMMEAGWDP CLQAPFHIiTP TLYRTLRNQG NQ6APAESRE 840 

VLQDTVOTjLF NHPRQRNASR ENLNIjPEPQP ATGQPRSRPL KVAGSPTGRI. AGDQGSBEAP 900 

QRPPASSATIi RRQRHIiNGKV SPEKESGPRQ ILRSLVRLSV AAFAERNPVE ELTVDSPPVQ 960 

QlSQLhSXAM QGQFQPKPNH RGNKYIiAKPG GSRSAIPDTD GPSARAGGQT DPEQEEGPIiD 1020 

75 PEEDIjSVKQL LEEELSSIjLD PSTGIALDRL SAPDPAWMAR LSIiPIiTTNYR DNVISPDAAA 1080 

TEEPRTFQTF GKAEAPELSP TGTRIASTPV SBMSSLIiEML LEQRSSMPVE AASBAIiRRI*S 1140 
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VCGRTLSLDL ATSAASGMKV QGDPGGKTGT EGKSRGSSSS SRCL 

Seq ID NO: 176 DMA sequence 

Nucleic Acid Accession U: AIj109712.1 

coding sequence: 2-128 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

in ' ' I I I I 

lU GAG2CTCTTT GGGCCAGCOG GGCTGCTGCA GACAGACAGG AAGCACGCCT GACGCTCCTC 60 

TACCCTCG6G CAGCACAGCG GGGCTGQGAC TCACTCTAGC TTGCCCAGCA ACTTGCTTTC 120 

CTGTGTGAAC TCTGGCAGGC TGCCCTCTCT GTGCAAAGCT GCCACTGGGG CCTGCTCAGG 180 

GTGGCCTGGA ACTTGGAGGT GG6CAGTCAG GGCCTAGGAT GGGCCTGTGT CACCAGGGCA 24 0 

TGTGCCCTTG GGCCAGTTAC TTCCTCTCAG AGCCTTGGGC TCCTCCTCTG AGGATCGGGC 3 00 

13 TTGTTGGTGT GAAATGAGGT GAGCATGTTG AGTTGGGGAG CAGCAGGACA CGCACCTGCA 360 

GGCAGCCGCC CTGGCCACGC TCCCTCCCTA CCTTCCGAGT CCTGGGACA6 ACACAGTAGA 420 

GCACAGCGGG CCAGCCTGCT CTCTTCTCTG TCTACTTTTT GCAGAAGAGT CAACAGATAC 480 

AACAGGCCCA GGGAGGTGCC CCTGGGGGCC CCAGTCCCCA TCACTCCAAG GGGCAGTCCT 540 

on GCAAGTGACA AGGTGGGCCC AATCCCTGTG GAACAGGTCT CTGAGGACCA CAGAGTG6GG 600 

ZU CCCCAGGGAA AGCTGGGAGC CGAGCTAGAG GCAGGCAGCA AGTAAGGGCA AAGCTGTGCC 660 

CCTGCCCGGA AGACCTTCCT GCCCCCAGAA CCCGACCCTC CGCAGATAGC CCTCCCTGGG 720 

CAGCAGCCCC CCAGCTTCCA AGGCCCGTGC CTCACCAGAC GCCATGCTCT CACGGACTTG 780 

TTTGCTGCTC TGTACCCTGC AGATCTGCCC CAGAGGAGCA GGTGAAAAGC CGCGCCTGCC 840 

GAGGTGCTGT GGCGGTGGAG TTTTGGGCAG AGGAGTGGGG GGAAGAGTTT CTCACTTTTA 900 

ZD AGATTCTCCA AATCCAAGAT GAAGTCATGC TGTGCTTTGG AATGGTAGAT GCTCATTTAT 960 
GTAAAATCAT AATAAATGTT ACACAAACTG TTAAAAAAAA AAAAAAAAAA AAAAAA 



30 
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Seq ID No: 177 Protein sequence: 
Protein Accession #: AL109712.1 

1 11 21 31 41 51 

' < I I i I 

VSLGQPGCCR QTGSTPDAPL PSGSTAGLGI* TLACPATCFP V 



35 Seq ID NO: 178 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 3-107 (underlined sequences correspond to start and stop codons) 

40 1 11 21 31 41 51 

' > I I I I 

AATGGAGCAC TCCAAAGAAC GATTTGACCA ATAGCATTTC TTCTCTGGGG GTTGTATTTC 60 

AAAGCATGCA ACTCTCCAGG GAACCAGAAC TAAATTGCTT AAAATGAAGT CATTCCTCAG 120 

ATTAACTTCC TCAGATAAAG TGTCAGCGGT CTGCAGAAAC GAAGAAGACA AAACTGAGAT 180 

TATCACTCAT AATTCTCTTA CTTACTATGT CAGTGAAACA ATGAGTTTGC ATTTTTGCAA 240 

TCCTAGAACA TTCTTCATTA GCCCTGGGTC ATGACCTCTT CCAGTTAATT CTCTTTCACA 300 

CCTTTAGGAA AGATTTAAGA T6AACCTTCA ATAGGATATT AACATAACTC ATAGCCAATA 360 

CCACAGCTGC CTTTCAAATT AATGAGGTTA ATTCTTCTCC AGCAAACATG AGTTTGTCTT 420 

TGGCATTTTA AATGCTTCCC ATT6ATCTGA CATTTTGCTG TTTCAAGTTT TAAAGGGCTC 480 

DKJ AAATCAAAGA CTATTGATAA CTGAGCAAAG AGCGAAGATC CAGAAATACG AAAACATTGT 54 0 

CTTTTTTTTT CCATGAAAAA CAATCATAGC CTTTTGAATT CAATCGAAGT TTCTACATTA 600 

GCCATCTAAG ACTTATTTAA TTATTTCTGT TCTCAGTCAA GCTAATTCAA GT6AATGAAC 660 

AGTATTGACT TTTAAAATCT TTTTTAAATT TTTTTAAATC TTTAGTTTAT TAAGTTTGTA 720 

GAAAAGCTCT GGGGCCATGA CCACTTACGT AAATGTTTCA 6TTTAAAAAC AAAAGATTCA 780 

DO GGCCTCTAAT TTGAGCCAAA TCCAGGTGAT CTTGTTTGAA ATTTTTGATG AATTTGAAAA 840 

GATGAAAGTG GAACTTTTAA CATTCATGTT CCCCAAATTT TTCACTGGGA AGGGATGCTA 900 

ATTGCCTACT TAAGATATAA GTTCAAGAAT AACATTTTCA TAGAAAATTC AGAAAACTGC 960 

TTGACACAGC AGTGACATAG TTAGATGTGG CTCAGATGCC TTCCAAACCT GAGGGTCCCC 1020 

AAAGATTTCT TTACCAGTTG TTTTTAACTA TGAATCTTAA TCTTGTTCAT TCCCCTGCCA 1080 

OU AAACAAATTT AAAAG 



Seq ID No: 179 Protein sequencer 
Protein Accession #: none found 



1 11 21 31 41 51 

I I i I I I 

WSTPKNDLTN SISSLGWFQ SMQLSRBPEIi NCLK 



70 Seq ID NO: 180 DNA aemience 

Nucleic Acid Accession #: none found 

coding sequence: 2-176 (underlined sequences correspond to start and stop codons) 



75 



11 21 31 41 51 



264 



NSDOCID: <WO. 02079492A2.I^> 



wo 02/079492 



PCT/IJS02/04915 



CCGGGTGGGG CCTCGGGATG CAGGCGCCGG TGCCCGGGCC CCTGGGCCTG CTGGACCCCG 60 

CAGAAGGGCT TTCGAGGAGG AAGAAGACGT CGCTCTGGTT TGTGGGGTCT CTGCTGCTGG 120 

TGTCCGTCCT CATAGTCACC GTCGGGCTGG CTGCATCAGC AGGACGGAGA ATGTGACCGT 180 

TGGGGGCTAC TACCCAGGGA TCATTCTCGG CTTTGGATCT TTCTTAGGAA TTATTGGCAT 240 

CAACTTGGTG GAGAATAGAA GGCAAATGCT GGTGGCAGCG ATCGTGTTTA TCAGTTTTGG 300 

CGTG6TGGCC GCCTTCTGCT GCGCCATCGT GGACGGCGTA TTTGCAGCAC AGCACATTGA 360 

ACCGAGGCCC CTCACCACGG GAAGATGCCA GTTTTACTCC AGTGGGGTGG GGTACTTGTA 420 

CGATGTCTAC CAGACAGAGG TGAGCAGGAG CACTGAGATT CATGTGGGTT TTGCTCAGCT 480 

AACCCCGCCG ACCCCACGCG GTTTTCCCTG CACATAGGCG TGGTCTGAAT ATTTGGATTC 540 

TAATAGTTCC TGGGGGTCAC CCCTGCAGCT GGTGAACCGT TGATGCCCCC TGTGTAAGGG 600 

ACCTTGACAT TTCGATGTGC TGTATTTCAC TCTGGAGTCA GAGTTCTGGA CTTGCTTCAT 660 

TAAATCACAA CAGTCTCAGA AAACAACCGC ACCACCCCGC AATCCCACCA AAGGGGCGCG 720 
CCGTCCCTAA GAGTTATCCC 

Seq ID No: 181 Protein sequence,; 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

RVGPRDAGAG ARAPGPAGPR RRAFEEEEDV AIjVCGVSAAG VRPHSHRRAG CISRTENV 

Seq ID NO: 182 DNA sequence 

Nucleic Acid Accession #: AK001S79.1 

Coding sequence: 1150-2637 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

TTTTCTCTGC TTTTCGCTAC CCCGGTCACT CTCATTTCTC tCCCCTATTC CTrGTCTCTT 60 

CCCCCATCCC CCTTTCTCCT GTCCTCCCCC TGCCTCTACA GTGGTTCTCC CCGCTGAGCT 120 

GCCACCAGCT GCTGGGCCCC GGGCTGCTGC GGCTGGGCCG CCTATGGCTG CGGTCCCCCT 180 

CCCATACAGC CCCGGCCCCT GGTCTCTGGC TGTCAGGGTT TGGCCTCCTT CGTGGTGACC 240 

ACCTCTTCCT GTGCTCAGCG CCGQGCCCAG GCCCCCCAGC CCCTGAGGAC ATGGTGCATC 300 

TGCGGCGGCT ACAGGAGATC AGT6TGGTTT CTGCAGCTGA CACCCCAGAT AAGAAAGAGC 360 

ATTTGGTCCT GGTGGAGACA GGAAGGACCC TGTATCTGCA AGGAGAGGGC CGGCTGGACT 420 

TCACGGCATG GAACGCAGCC ATTGGGGGCG OGGCTGGTGG GGGCGGCACA GGGCTGCAGG 480 

AGCAGCAGAT GAGCCGGGGT GACATCCCCA TCATCGTGGA TGCCTGCATC AGTTTTGTTA 54 0 

CCCAGCATGG GCTCCGGCTG GAAGGTGTAT ACCGGAAAGG GGGCGCTCGT GCCCGCAGCC 600 

TGAGACTCCT GGCTGAGTTC CGTCGGGATG CCCGGTCGGT GAAGCTCCGA CCAGGGGAGC 66 0 

ACTTTGTGGA GGATGTCACT GACACACTCA AACGCTTCTT TCGTGAGCTC GATGACCCTG 720 

TGACCTCTGC ACGGTTGCTG CCTCGCTGGA GGGAGGCTGC TGGTATTCCT AAGATCCCTG 780 

AGAGCCAAGG CCCAACCAGG ATCTCTGCCT TCCCCCACCA GAATCCATGG TTTGGCAGCC 840 

CTCCGCCCCA TCACTTCCCA CCCTGGGGGA TCATCCAGAG ACTTGGCTCA GGGGGAGGTG 900 

GGAAGGGGGC AGAGACACAT CCATCCTGCA TTTGTGCCTA AAAATCCCTC CCTCTGTACC 960 

AGCTGCCACT CTTTCTTCCC GGGTCCTCCC CAACCCTCCT CCATTCCATC CCCAGAGCTG 1020 

CCCCAGAAGA ATCAGCGCCT GGAGAAATAT AAAGATGTGA TTGGCTGCCT GCCGCGGGTC 1080 

AACCGCCGCA CACTGGCCAC CCTCATTGGG CATCTCTATC GGGTGCAGAA ATGTGCGGCT 1140 

CTAAACCAG A TGT GCACGCG GAACTTGGCT CTGCTGTTTG CACCCAGC6T GTTCCAGACG 1200 

GATGGGCGAG GGGAGCACGA GGTGCGAGTG CTGCAAGAGC TCATTGATGG CTACATCTCT 1260 

GTCTTTGATA TCGATTCTGA CCAGGTAGCT CAGATTGACT TGGAGGTCAG TCTTATCACC 1320 

ACCTGGAAGG ACGTGCAGCT GTCTCAGGCT GGAGACCTCA TCATGGAA6T TTATATAGAG 1380 

CAGCA6CTCC CAGACAACTG TGTCACCCTG AAGGTGTCCC CAACCCTGAC TGCTGAGGAG 1440 

CTGACTAACC AGGTACTGGA GATGCGGGGG ACAGCAGCTG GGATGGACTT GTGGGTGACT 1500 

TTTGAGATTC GCGAGCATGG GGAGCTGGAG CGGCXSICTGC ATCCCAAGGA AAAGGTCTTA 1560 

GA6CAGGCTT TACAATGGTG CCAGCTCCCA GAGCCCTGCT CAGCTTCCCT GCTCTTGAAA 1620 

AAAGTCCCCC TGGCCCAAGC TGGCTGCCTC TTCACAGGTA TCCGACGTGA GAGCCCACGG 1680 

GTGGGGCTGT TGCGGTGTCG TGAGGAGCCA CCTCGCTTGC TGGGAAGCCG CTTCCAGGAG 1740 

AGGTTCTTTC TGCTGCGTGG CCGCTGCCTG CTGCTGCTCA AGGAGAAGAA AAGCTCTAAA 1800 

CCAGAACX3GG AGTGGCCTTT GGAAGGTGCC AAGGTCTACC TGGGAATCCG CAAGAAGTTA 1860 

AAGCCCCCAA CACCGTGGGG CTTCACATTG ATACTAGAGA AGATGCACCT CTACTTGTCC 1920 

TGCACTGACG AGGATGAAAT GTGGGATTGG ACCACCAGCA TCCTTAAAGC CCAGCACGAT 1980 

GACCAGCAGC CAGTGGTCTT ACGACGCCAT TCCTCCTCTG ACCTTGCCCG TCAGAAGTTT 204 0 

GGCACTATGC CTTTGCTGCC TATCCGTGGG GATGACAGTG GAGCCACCCT CCTCTCTGCC 2100 

AATCAGACCC TGCGGCGACT ACACAACCGG AGGACCCTGT CCATGTTCTT TCCAATGAAG 2160 

TCATCCCAGG GGTCTGTGGA GGAGCAAGAG GAGCTGGAGG AGCCTGTGTA CGAGGAGCCA 2220 

GTGTATGAGG AAGTAGGGGC CTTCCCTGAG TTGATCCAGG ACACTTCTAC CTCCTTCTCC 2280 

ACCACACGGG AGTGGACAGT GAAGCCAGAQ AACCXTCCTCA CCAGCCAGAA GTCATTGGAT 2340 

CAACCCTTTC TCTCCAAGTC AAGCACCCTT GGCCAGGAGG AGAGGCCACC TGAGCCCCCT 2400 

CCAGGCCCCC CTTCAAAGAG CAGTCCCCAG GCACGGGGGT CCCTAGAGGA ACAGCTGCTC 2460 

CAGGAGCTCA GCAGCCTCAT CCTGAGGAAA GGAGAGACCA CTGCAGGCCT GGGAAGTCCT 2520 

TCCCAGCCAT CCAGCCCCCA ATCCCCCAGC CCCACTGGCC TTCCAACACA GACACCTGGC 2580 

TTCCCCACCC AACCCCCATG CACTTCXa^GT CCACCCTCCA GCCAGCCCCT CACATGACCC 2640 

TAGGACCAGC AGTCTGAGAG GGTAGGTACC AGAAGACCCA GAAACTCTTA TCGTGGCACT 2700 

GTTGCAGCTT CCTCTGCCCT GGCTGGAAAG ACTCCAGAAT CCAGTGTGGT GCTGTGGAAG 2760 

GAGCACTGGA CTAAAGGCTT CAGTGGCTGC GTGTCCCAGG ACAGGTCATG GCCCCTCTCT 2820 

GGGCCCAGCC CATTTATCTA TACCATGAGG TAACTGAAGT AAGGA6AGCA GTGAATGTCA 2880 



265 



wo 02/079492 



PCT/US02/(M9I5 



AACTGTGTTT 
TTTATATATT 
ACTGCATTAA 
CCTGTCCCAC 
GGGAGCTGGG 



CTTAGAGCCA 
TGATACGTAG 
AGAAAAAAAA 
TTCCTACATT 
ACTCX5GGGAG 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



TAAGCCCCAC ATATTATCCC TGAACAAGGG CAGCTCCTGC 2940 

GGGTTCCATG AGAGATTTTG GGTTTTAAAG GAATGGTTTT 3000 

TGCTTTGGAA ACCAGAGGCC TGGGTGATGT TAAAGTCTAT 3060 

CTGGGACTAC CGTGAAGCCT GGAGTAGGGA GAGCGAGTTT 3120 
TCAAAAATAG ATGAGTAATT GTCAATAAAC CTGGGAACC 



Seq ID No: 183 Protein aecniencer 
Protein Accession #: AK001579.1 



1 
I 

MSLTHSNASF 
SHPGGSSRDL 
RLEKYKDVIG 
HEVRVLQELI 
NCVTIjKVSPT 
WCQIiPEPCSA 
RGRCIiLLLKE 
EMWDWTTSIL 
RLHNRRTIiSM 
TVKPENPLTS 
LILRKGETTA 



11 
I 

VSSMTLPIiHG 
AQGEVGRGQR 
CLPRVNRRTL 
DGYISVFDID 
LTAEBLTNQV 
SLLLKiCVPLA 
KKSSKPEREW 
KAQHDDQQPV 
FFPMKSSQGS 
QKSIiDQPFLS 
GliGSPSQPSS 



21 
I 

CCLAGGRLLV 
HIHPAPVPKN 
ATLIGHLYRV 
SDQVAQXDLE 
LEMRGTAAGM 
QAGCLFTGIR 
PLEGAKVYLG 
VLRRHSSSDL 
VEEQEELEEP 
KSSTLGQEER 
PQSPSPTGLP 



31 
I 

FLRSLRAKAQ 
PSLCTSCHSF 
QKCAALNQMC 
VSLITTWKDV 
DLWVTFEIRB 
RESPRVGLLR 
IRKKLKPPTP 
ARQKFGTMPL 
VYEEPVYEEV 
PPEPPPGPPS 
TCyrPGFPTQP 



41 
I 

PGSLPSPTRI 
FPGPPQPSSI 
TRNLAZ^FAP 
QLSQAGDIilM 
HGELERPLHP 
CREEPPRLLG 
WGFTIiILEKM 
LPIRGDDSGA 
GAFPELIQDT 
KSSPQARGSL 
PCTSSPPSSQ 



51 

I 

HGLAALRPIT 
PSPELPQKNQ 
SVFQTDGRGE 
BVYIEQQLPD 
KEKVLEQAIjQ 
SRFQERFFLIi 
HLYMCTDED 
TLLSANQTLR 
STSFSTTREW 
EEQLLQEIjSS 
PLT 



60 
120 
IBO 
240 
300 
360 
420 
460 
540 ' 
600 



Seq ID NO: 184 DMA sequence 

Nucleic Acid Accession #: none found 

coding sequence: 1-81 (underlined sequences correspond to start and stop codons) 



1 
I 

GTAGAGTTAG 
GTATTAAGCT 
AATCTCAACA 
CTTTCTCTAT 
AACCCTAACT 
CACTGCACCA 
ACTGAAGATG 
AACATGTAAT 
TGTTCAGCTG 
AAGTTTTGTT 
ACAAAGGCAT 
GGATGGCAGT 
TGTGCCTCTT 
CCATTAGCTA 
GTCATGTGCT 
TTGCTTG6TT 
ACTTGGATGC 
TTCTTTTGTC 
CAGGATGGTT 
AGTAAGCTTT 
TGGCTTTTCA 
GCTTATTCTT 
CACCTCCTTG 
GCACTTAAAA 
TGCAGGTTAG 
CACCGTTCTA 
GACACAGTCA 
TTTTGCAGAA 



11 21 

I I 
TGTCAATGTG CTTAGAATAT 
TAAAAAG TTA AT TCAGTTTA 
TAAGAAGTCA AAATGTAATG 
AATTTCATCA GTATGTCCTC 
CTGCTAATTA TAAGCTA6GC 
GCTTTGTTAT CTGTAAAATG 
AGAGAACATG ATATGTGTAA 
GAATGTAGTA ATAGTAATTA 
TAACAGAATA CCCAAAATAA 
ACGAATTCAG ACAATCCAGG 
CTTTCCTGAT TTCTGCCAGT 
TCCAGCCCTG GTTACGCCCA 
CACTTTAATC ATAGCTCCCA 
ATGCTTGCTT ACATGGTCAC 
CAATTAATAT CCAAGTGTCC 
GCATTCTTCT TAGCATAAGC 
CTCAGTTGTC CTTTCATTTA 
ATCAGCACCA TCACTACCAC 
GCAACAACCA CCATAGGGAC 
TGAAAATGTA GGTCAGATCA 
TATTACTCAA AAGAAAACXTT 
CCTCTTACTT TATCTCTGTA 
CTGCTTGTCC TATACTCCTA 
TAGAAAAAAA AAAAAAAAAA 
AAGTCATGGA GCTGGGATCT 
TTCAACCCCA TTGCCTAGAG 
GCTCTTTCTG AGAAAAGGCA 
GAAAAC 



31 
I 

ACCAAATTCA 
AGGAATATAA 
CT6CCAGATA 
TCCCTTTTCT 
AAGTAATCTT 
ATGATAATAC 
AGTGCCTTCC 
TTTTATTTTC 
CAGTTTTAAA 
GCTTTTATAG 
CTCAATGCAT 
TATTAGCACA 
CTAGATGCAC 
ACTTAGTTTC 
AATTACTGAG 
CACATTCTTT 
GAAATGCTCC 
TGCCTTCTTC 
TTTTTGCTTC 
TGTCTCTCTC 
AAAACTTTGC 
TTGCTCTTCC 
AAAGAAGTTC 
AGCTCA6A6A 
AAATCCATGT 
GTGCTTGATT 
GCTCAGCATT 



41 
I 

TAAACATTTT 
ACCAAATTAT 
ACAATATCAA 
CCTATTTGTC 
GGACAAGTTA 
CAACACCTTC 
ACAATACCCA 
TTTTGATTCA 
CAAATTAAAG 
ATGCACCAGG 
GGGTTGCAAT 
CAGAAAGAAA 
CCACTACTTC 
CAGAGAGACA 
AAAAAAAGAA 
TTATGAAGTT 
TTGGACATCC 
AAAGCCACCA 
TACTTCCACA 
TTCTCTTCAA 
TGTGAGATCT 
TCACTCTACT 
AGTCTTCCCT 
GGCTGAGTTG 
CAGTCTGACT 
6CTCAATAAT 
TCCATGAGAT 



51 

I 

CTCTAAAAAA 

TTTATATTTG 

AGGTATTTTT 

AAATTTTAGC 

TTTGACCTCT 

TTCTTGGGGT 

GAACATAGCA 

GTTGGGACTA 

TTTTGTTGTG 

ATCAGCAGGT 

CCAGAGTCCA 

GAGAAAGGGA 

TGCTGATACT 

TGTCTGGACA 

ACTAGCACCT 

GTCCTCAGTT 

TGAATCTGAC 

CGTTCTGTCC 

CAATAGCCAG 

AACCCTCCGA 

ATGTGACCCG 

CCAGCCATCC 

TATGATATTT 

TCCAAGGTCA 

ATGAGTTCTG 

AGATTCCATG 

CCGCACATCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



Seq ID No: 185 Protein aequencf*- 
Protein Accession # : none found 



65 ^ 11 21 31 41 51 

' > I i I I 

VELVSMCLEY TKFINIFSKK VLSLKS 

Seq ID NO: 186 DNA seoueng^ 
/U Nucleic Acid Accession NM_002203.2 

coding sequence: 43-3588 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

7<; I I I I I i 

/O CTGCAAACCC AGCGCAACTA CGGTCCCCCG GTCAGACCCA GGftlSGGGCC AGAACGGACA 60 
GGGGCCGCGC CGCTGCCX3CT GCTGCTGGTG TTAGC6CTCA GTCAAGGCAT TTTAAATTGT 120 



266 



MSOOCtD: <WO___02079402A2J.> 



wo 02/079492 



PCT/US02/04915 



TGTTTGGCCT ACAATGTTGG TCTCCCAGAA 
CAGTTTGGGT ATGCAGTGCA GCAGTTTATA 
TCACCCTGGA GTGGCTTTCC TGAGAACCGA 
CTATCCACTG CCACATGTGA AAAACTAAAT 
5 ACTGAGATGA AAACCAACAT GAGCCTCGGC 
GGTTTTCTCA CATGTGGTCC TCTGTGGGCA 
GGTGTGTGTT CTGACATCAG TCCTGATTTT 
CAGCCCTGCC CTTCCCTCAT AGATGTTGTG 
CCTTGGGATG CAGTAAAGAA TTTTTTGGAA 
10 ACAAAGACAC AGGTGGGGTT AATTCAGTAT 
AACACATATA T^AACCAAAGA AGAAATGATT 
GGGGACCTCA CAAACACATT CGGAGCAATT 
GCTTCTGGTG GGCGACGAAG TGCTACGAAA 
CATGATGGTT CAATGTTGAA AGCTGTGATT 
15 TTTGGCATAG CAGTTCTTGG GTACTTAAAC 
AAAGAAATAA AAGCGATCGC TAGTATTCCA 
GAAGCAGCTC TACTAGAAAA GGCTGGGACA 
ACTGTTCAAG GAGGAGACAA CTTTCAGATG 
TACTCTTCTC AAAATGATAT TCTGATGCTG 
20 ACCATTGTCC AGAAGACATC TCATGGCCAT 
ATTCTGCAGG ACAGAAATCA CAGTTCATAT 
GGAGAAAGCA CTCACTTTGT TGCTGGTGCT 
CTATATAGTG TGAATGAGAA TGGCAATATC 
ATTGGCTCCT ATTTTGGTAG TGTGCTGTGT 
25 GACGTGCTCT TGGTAGGTGC ACCAATGTAC 
GTCTACCTGT TTACTATCAA AAAGGGCATT 
GAGGGCATTG AAAACACTCG ATTTGGTTCA 
GATGGCTTTA ATGATGTGAT TGTTGGTTCA 
TACATTTACA ATGGTCATCA GGGCACTATC 
30 TCCGATGGAG CCTTTAGGAG CCATCTCCAG 
GATTTAAATG GGGATTCCAT CACCGATGTG 
CTCTGGTCAC AAAGTATTGC TGATGTAGCT 
ACTTTGGTCA ACAAGAATGC TCAGATT^TT 
CCTACTAAGC AAAACAATCA AGTGGCCATT 
35 TTTTCATCCA GAGTAACCTC CAGGGGGTTA 
AAGAATATGG TAGTAAATCA AGCACAGAGT 
CCCTCTGATG TTGTCAACTC TTTGGATTTG 
ACTAGCCCTG CCCTTGAAGC CTATTCTGAG 
AAAGACTGTG GTGAGGATGG ACTTTGCATT 
40 CCAGCTGCTC AAGAACAACC CTTTATTGTC 
GTAACACTGA AAAATAAAAG GGAAAGTGCA 
GAAAACTTGT TTTTTGCATC ATTCTCCCTA 
GTGGCTGCAT CTCAGAAGTC TGTTGCCTGC 
CAACAGGTGA CTTTTACTAT TAACTTTGAC 
45 TCTCTCAGTT TCCAAGCCTT AAGTGAAAGC 
AACCTCAAAA TTCCTCTCCT GTATGATGCT 
AATTTTTATG AAATCTCTTC GGATGGGAAT 
GTTGGTCCAA AATTCATCTT CTCCCTGAAG 
GCAACTGTAA TCATCCACAT CCCTCAGTAT 
50 ACTGGGGTGC AAACAGACAA GGCTGGTGAC 
AAAATAGGAC AAACATCTTC TTCTGTATCT 
GAATTGAACT GCAGAACTGC TTCCTGTAGT 
ATGAAAGGAG AATACTTTGT TAATGTGACT 
TCAACX3TTCC AGACAGTACA GCTAACGGCA 
55 ATATATGTGA TTGAAGATAA CACT6TTACG 
AAAGCCGAA6 TACCAACAGG AGTTATAATA 
TTAGCTCTGG TTGCAATTTT ATGGAAGCTC 
ACCAAAAATC CAGATGAGAT TGATGAGACC 
ACCTGCAGTG GGAACCGGCA GCATCCCAGC 
60 TTTTTAAATC CCATATTTTT TTTATCATGT 
AAAACTGCAG GTCAGTTTGG ATGAAGAAAT 
GTAGGGAAAT AATAGGGAAA ATACCTATTT 
AACTGGCTGG CCCAGAGTTT ACATTCTAAT 
CAAGCATGAC AACTTTTAAA GAAAAATATG 
65 TCTCTTTAAA ATATTTGTCT TTAAACAGCA 
GTACTTCCAC TTGTGTATAT TTTAATGAAT 
ACAGGTTTTT TCAATTTATG CTGCTCATCC 
TAATTTTATT TATAAACTAG GTAAAATTTG 
CTTCCACACC CCATCTTGCT CTAATGATCA 
70 TACCTCCTAT ATGTCCATTT AAGTTAGGAG 
TTTTGTTTAA AACTCAGAAT ATAACAT TTA 
TGTGCCAGAG 6AAGGAAAAG GAGGAAATTT 
CTTCTAGGAT TTGTTTGGCT GACTGGCAGT 
TTCTTTGGCA ACCTTCCTCC TCCCTT ACTG 
75 TATTATAGAA GCCCTCTACA GCCTGACTTT 
TTACCCCTCA TCCAAAGTTC CCACTCCTTC 



GCAAAAATAT TTTCCGGTCC TTCAAGTGAA 180 
AATCCAAAAG GCAACTGGTT ACTGGTTGGT 24 0 
ATGGGAGATG TGTATAAATG TCCTGTTGAC 300 
TTGCAAACTT CAACAAGCAT TCCAAATGTT 360 
TTGATCCTCA CCAGGAACAT GGGAACTGGA 42 0 
CAGCAATGTG GGAATCAGTA TTACACAACG 4 80 
CAGCTCTCAG CCAGCTTCTC ACCTGCAACT 54 0 
GTTGTGTGTG ATGAATCAAA TAGTATTTAT 600 
AAATTTGTAC AAGGCCTTGA TATAGGCCCC 660 
GCCAATAATC CAAGAGTTGT GTTTAACTTG 72 0 
GTAGCAACAT CCCAGACATC CCAATATGGT 780 
CAATATGCAA GAAAATATGC CTATTCAGCA 840 
GTAATGGTAG TTGTAACTGA CGGTGAATCA 900 
GATCAATGCA ACCAT6ACAA TATACTGAGG 960 
AGAAACGCCC TTGATACTAA AAATTTAATA 1020 
ACAGAAAGAT ACTTTTTCAA TGTGTCTGAT 1080 
TTAGGAGAAC AAATTTTCAG CATTGAAGGT 1140 
GAAATGTCAC AAGTGGGATT CAGTGCAGAT 1200 
6GTGCAGTGG GAGCTTTTGG CTGGAGTGGG 1260 
TTGATCTTTC CTAAACAAGC CTTTGACCAA 1320 
TTAGGTTACT CTGTGGCTGC AATTTCTACT 1380 
CCTCGGGCAA ATTATACCGG CCAGATAGTG 1440 
ACGGTTATTC AGGCTCACCG AGGTGACCAG 1500 
TCAGTTGATG TGGATAAAGA CACCATTACA 1560 
ATGAGTGACC TAAAGAAAGA GGAAGGAAGA 1620 
TTGGGTCAGC ACCAATTTCT TGAAGGCCCC 1680 
GCAATTGCAG CTCTTTCAGA CATCAACATG 174 0 
CCACTAGAAA ATCAGAATTC TGGAGCTGTA 1800 
CGCACAAAGT ATTCCCAGAA AATCTTGGGA 1860 
TACTTTGGGA GGTCCTTGGA TGGCTATGGA 1920 
TCTATTGGTG CCTTTGGACA AGTGGTTCAA 1980 
ATAGAAGCTT CATTCACACC AGAAAAAATC 2040 
CTCAAACTCT GCTTCAGTGC AAAGTTCAGA 2100 
GTATATAACA TCACACTTGA TGCAGATGGA 2160 
TTTAAAGAAA ACAAT6AAAG 6TGCCTGCAG 2220 
TGCCCCGAGC ACATCATTTA TATACAGGAG 2280 
CX3T6TGGACA TCAGTCTGGA AAACCCTGGC 2340 
ACT6CCAAGG TCTTCAGTAT TCCTTTCCAC 2400 
TCTGATCTAG TCCTAGATGT CCGACAAATA 24 60 
AGCAACCAAA ACAAAAGGTT AACATTTTCA 2 520 
TACAACACTG GAATTGTTGT TGATTTTTCA 2580 
CCGGTTGATG GGACAGAAGT AACATGCCAG 2640 
GATGTAGGCT ACCCTGCTTT AAAGAGAGAA 2700 
TTCAATCTTC AAAACCTTCA GAATCAGGCG 2760 
CAAGAAGAAA ACAAGGCTGA TAATTTGGTC 2 820 
GAAATTCACT TAACAAGATC TACCAACATA 288 0 
GTTCCTTCAA TCGTGCACAG TTTTGAAGAT 2940 
GTAACAACAG GAAGTGTTCC AGTAAGCATG 3000 
ACCAAAGAAA AGAACCCACT GATGTACCTA 3060 
ATCAGTTGTA ATGCAGATAT CAATCCACT6 3120 
TTCAAAAGTG AAAATTTCAG GCACACCAAA 3180 
AATGTTACCT GCTGGTTGAA AGACGTTCAC 3240 
ACCAGAATTT GGAACGGGAC TTTCGCATCA 3300 
GCTGCAGAAA TCAACACCTA TAACCCTGAG 3360 
ATTCCCCTGA TGATAATGAA ACCTGATGAG 3420 
GGAAGTATAA TTGCTGGAAT CCTTTTGCTG 34 80 
GGCTTCTTCA AAAGAAAATA TGAAAAGATG 354 0 
ACAGAGCTCA GTAGCTGAAC CAGCAGACCT 3600 
CAGGGTTTGC TGTTTGCGTG CATGGATTTC 3660 
CGTAGGTAAA CTAACCTGGT ATTTTAAGAG 3720 
TGTGGGGGGT GGGGGAGGTG CGGGG GGCA G 37 80 
TATATGATGG GGGAAAAAAA GTAATCTTTA 3 840 
TTGCATTGTG TCAGAAACAT GAAATGCTTC 3900 
ATACTCTCAG ATTTTAAGGG GGAAAACTGT 3960 
ACTACAGAAG TGGAAGTGCT TGATATGTAA 4020 
ATTGATGTTA ACAAGAGGGG AAAACAAAAC 4080 
AAAGTTGCCA CAGATGATAC TTCCAAGTGA 4140 
TTGTTGGTTC CTTTTATACC ACGGCTGCCC 4200 
AAACATGCTT GAATAACTGA GCTTAGA6TA 4260 
AGGGGGCGAT ATAGA6ACTA AGGCACAAAA 4320 
TGTAAAATCC CATCTGCTAG AA6CCCATCC 4380 
CCTTTCTCTT TTAGGAGGCA CAACAGTTCT 4440 
AACCTAGTGA ATTTTTGAAA GATGAGTAAT 4500 
AACCACTCTC CCACCTCCTG GTGGTACCAT 4560 
CTCTCCA6CG GTCCAAAGTT ATCCCCTCCT 4620 
AGGACAGCT6 CTGTGCATTA GATATTAGGG 4680 
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10 



GGGAAAGTCA 
TTCAGGGAGC 
CTAAATGITC 
CCAGAAGTTA 
GCTGTCTTGT 
GTTCAAAAGG 
TTTTAACAAG 
ATTTCTACTT 
CTTAGATTAA 
AAACTTCAGA 
TTTAGTTTTA 
CAAGAATTTG 



TCTGTTTAAT 
TATTTTCATT 
GAAT6TTATG 
CAGATGAGGC 
TTCTGAAGTA 
TAGATCCTGA 
CACCCCAGTC 
TTTGCACCTT 
AATTCACAGA 
GTCCTCATTA 
AAAGTCTATG 
ACTTGGAAAA 



TTACACACTT 
TAGTGCTAAA 
6GATGTAAAC 
ACT6GAAACC 
CTTTTTCTTC 
GATGATTTGG 
ACTAGGATGC 
ATTTTCTCTG 
CACTACATAT 
TAAAATGGGA 
ATCT6ATCTG 
G 



GCATGAATTA 
CAAGTAAGAA 
AATGTAAAGT 
ACCACCAAAT 
CACAAGAGT6 
TCAGATTGGG 
AGATGGACCA 
TTCCTGAGCC 
CTAAAGCTTT 
AGACTGAGCT 
GACTTCCTAT 



CTGTATATAA 
AAATAAGCTA 
AAAACACTCT 
TAGCAGGTGC 
AATTTGACCT 
ATAAG6CCCA 
CACTTTGAGA 
CCCACATTCT 
GACAAGTCCT 
GGAGTTCAGC 
AATACAAATA 



ACTCCTTAAC 
GAGTGAATTT 
CAGGATTTCA 
ACCTTCTGTG 
AGGCAAGTTT 
GCAATCTGCA 
AACACCACCC 
CTAGGAGAAA 
TGACCTCTAT 
AGTGATGCTT 
CACAATCCTC 
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4600 
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4920 
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5040 
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5340 



Seq ID NO: 187 Protein aemience- 
1j Protein Accession #: NP 002194.1 
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MGPERTGAAP 

NWXiLVGSPWS 

RNMGTGGFLT 

ESNSIYPWDA 

QTSQYGGDLT 

HDNIIjRFGIA 

IPSIEGTVQG 

KQAFDQILQD 

AHRGDQIGSY 

QFIiEGPEGIE 

SQKILGSDGA 

FTPEKITLVN 

NERCLQKNMV 

FSIPPHKDCG 

IWDFSENIiF 

NLQKQASIiSF 

VHSFEDVGPK 

ADINPLKIGQ 

NGTFASSTFQ 

AGIZjIiLLALV 



11 

I 

LPLLLVLALS 

GFPENRMGDV 

CGPIiWAQQCXS 

VKNFIiEKPVQ 

NTFGAIQYAR 

VLGYLNRNAL 

GDNFQMEMSQ 

RNHSSYLGYS 

FGSVLCSVDV 

NTRFGSAIAA 

FRSHLQYFGR 

KNAQIILKLC 

VNQAQSCPEH 

EDGLCISDLV 

FASFSLPVDG 

QAXiSESQEEN 

FIFSLKVTTG 

TSSSVSPKSE 

TVQLTAAAEI 

AILWKIiGFFK 



21 
I 

QGILNCCIiAY 
YKCPVDLSTA 
MQYYTTGVCS 
GLDIGPTKTQ 
KYAYSAASGG 
DTKHLIKEIK 
VGFSADYSSQ 
VAAISTGBST 
DKDTITDVIiI* 
LSDINMD6FM 
SLDGYGDLNG 
FSAKFRPTKQ 
IIYIQEPSDV 
LDVRQIPAAQ 
TEVTCQVAAS 
KADNLVNLKI 
SVPVSMATVX 
NFRHTKELNC 
NTYNPEIYVI 
RKYEKMTKNP 



31 
I 

NVGLPEAKIP 

TCEKLNLQTS 

DISPDFQLSA 

VGLIQYANNP 

RRSATKVMW 

AIASIPTERY 

NDILMLGAVG 

HFVAGAPRAN 

VGAPMYMSDL 

OVIVGSPIjEN 

DSITDVSIGA 

NNQVAIVYNI 

VNSLDLRVDI 

EQPFIVSNQN 

QKSVACDVGY 

PLLYDAEIHL 

IHIPQYTKEK 

RTASCSNVTC 

EDNTVTIPIiM 

DEIDETTEIjS 



41 

I 

SGPSSEQFGY 

TSIPNVTEMK 

SFSPATQPCP 

RWFNLNTYK 

VTDGESHDGS 

FPNVSDEAAL 

AFGWSGTIVQ 

YTGQIVLYSV 

KKSEGRVYZiF 

QNSGAVYIYN 

FGQWQLWSQ 

TZiDADGFSSR 

SLENP6TSPA 

KRLTPSVTIjK 

PALKREQQVT 

TRSTNINFYE 

NPLMYLTGVQ 

WliKDVHMKGE 

IMKPDEKABV 



51 
I 

AVQQFINPKG 

TNMSLGLILT 

SLIDWWCD 

TKEEMIVATS 

MLKAVIDQCN 

LEKAGTLGEQ 

KTSHGHLIFP 

NENGNITVIQ 

TIKKGILGQH 

GHQGTIRTKY 

SZADVAIEAS 

VTSRGIiFKEN 

LEAYSETAKV 

NKRESAYNTG 

PTINFDPNLQ 

ISSDGNVPSI 

TDKAGDISCN 

YFVNVTTRIW 

PTGVIIGSII 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 188 DWA aemiene^ 

Nucleic Acid Accession #: NM_002210.1 

coding sequence. 42-318B (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



GGCTACCGCT 
GACGGCTGCG 
TGTGCCGCGC 
GTTACTTCGG 
TCGTGGGAGC 
TCAAATGTGA 
ATAGAGATTA 
CTGTGAGGTC 
A6ATGAAACA 
TTGAGTATGC 
6AGGATTCAG 
TTTATTGGCA 
CCAATGTTTA 
TTGATGACAG 
ATGACTTTGT 
ATGGGAAGAA 
GATTTTCTGT 
CACCTCTCTT 
TGTCTCTACA 
TTGCACGGTT 
ATATTGCAAT 
ATGGAAGATC 
CTCGAAGCAT 
ATGGATATCC 
CCAGACCAGT 
ACAATAAAAC 
TCTGCTTAAA 
TTCTTTTGGA 
GGTCCCCAAG 
AATTGATAGC 
TTTTTATGGA 



CCCGGCTTGG 
CCTCGGTCCC 
CTTCAACCTA 
CTTCGCCGTG 
TCCCAAAGCA 
CTGGTCTTCT 
TGCCAAGGAT 
GAAACAGGAT 
GGAGCXIAGAG 
TCCATGTAGA 
CATTGATTTT 
AGGTCAGCTT 
CAGCATCAA6 
CTATTTGGGT 
TTCAGGAGTT 
CATGTCCTCC 
AGCTGCCACT 
CATGGATCGT 
GAGAGCTTCA 
TGGCAGTGCC 
TGCTGCTCCA 
AACAGGCTTG 
GCCACCAAGC 
AGACTTAATT 
TATCACTGTA 
CTGCTCACTG 
GGCAGATGGC 
TAAACTCAAG 
TCACTCCAAG 
GTATCTGCGG 
ATATCGGTTG 



CGTCCCGCGC 
CGCGGCCTCC 
GACGTGGACA 
GATTTCTTCG 
AACACCACCC 
ACCCGCCGGT 
GATCCATTGG 
AAAATTTTGG 
CCTGTTGGAA 
TCACAAGATA 
ACTAAAGCTG 
ATTTCGGATC 
TATAATAACC 
TATTCTGTGG 
CCAAGAGCAG 
TTATACAATT 
GACATTAATG 
GGCTCTGATG 
GGA6ACTTCC 
ATAGCTCCTT 
TATGGGGGTG 
AACGCAGTCC 
TTTGGCTATT 
GTAGGAGCTT 
AATGCTGGTC 
CCTGGAACAG 
AAAGGAGTAC 
CAAAAGGGAG 
AACATGACTA 
GATGAATCTG 
GATTATAGAA 



GCACTTCGGC 
CGCTTCTTCT 
GTCCTGCC6A 
TGCCCAGCGC 
AGCCTGGGAT 
GCCAGCCAAT 
AATTTAAGTC 
CCTGTGCCCC 
CATGCTTTCT 
TTGATGCTGA 
ACAGAGTACT 
AAGTGGCAGA 
AATTAGCAAC 
CTGTCGGAGA 
CAAGGACTTT 
TTACTGGCGA 
GAGATGATTA 
GCAAACTCCA 
AGACGACAAA 
TGGGAGATCT 
AAGATAAAAA 
CATCTCAAAT 
CAATQAAAGG 
TTGGTGTAGA 
TTGAAGTGTA 
CTCTCAAAGT 
TTCCCAGGAA 
CAATTCGACG 
TTTCAAGGGG 
AATTTAGAGA 
CAGCTGCTGA 



I 



GATGGCTTTT 
CTCGG6ACTC 
GTACTCTGGC 
GTCTTCCCGG 
TGTGGAAGGA 
TGAATTTGAT 
CCATCAGTGG 
ATTGTACCAT 
TCAAGATGGA 
TGGACAGGGA 
TCTTGGTGGT 
AATCGTATCT 
TCGGACTGCA 
TTTCAATGGT 
GGGAATGGTT 
GCAGATGGCT 
TGCAGATGTG 
AGA6GTGGGG 
GCTGAATGGA 
GGACCAGGAT 
AGGAATTGTT 
CCTTGAAGG6 
AGCCACAGAT 
TCGAGCTATC 
CCCTAGCATT 
TTCCTGTTTT 
ACTTAATTTC 
AGCACTGTTT 
GGGACTGATG 
CAAACTCACT 
TACAACAGGC 



51 
I 

CC6CCGCGGC 
CTGCTACCTC 
CCCGAGGGAA 
ATGTTTCTTC 
GG6CAGGTCC 
GCAACAGGCA 
TTTGGAGCAT 
TGGAGAACTG 
ACAAAGACTG 
TTTTGTCAAG 
CCTGGTAGCT 
AAATACGACC 
CAAGCTATTT 
GATGGCATAG 
TATATTTATG 
GCATATTTCG 
TTTATTGGAG 
CAGGTCTCAG 
TTTGAGGTCT 
GGTTTCAATG 
TATATCTTCA 
CAGTGGGCTG 
ATAGACAAAA 
TTATACAGGG 
TTAAATCAAG 
AATGTTAG6T 
CAGGTGGAAC 
CTCTACAGCA 
CAGTGTGAGG 
CCAATTACTA 
TTGCAACCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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960 
1020 
1080 
1140 

1200 

1260 

1320 ■ 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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GCCAGGTGGT 
TTCGTTTCAG 
TCC3^AAGCTC 



TTCTTAACCA GTTCACGCCT GCTAACATTA GTCGACAGGC TCACATTCTA CTTGRCTGTG 

gSgacaa ?gtctgtaaa cccaagctgg aagtttctgt agatagtgat caaargaa^ 

TCTATATT^ GGATGACAAC CCTCTGACAT TGATTGTTAA GGCTCAGAAT CAAGGAGAAO 

gSc™ga agctgagctc atcgtttcca ttccactgca ggctgatttc atcggggttc 

?ccS^ TgSgCCTTA GCAAGACTTT CCTGTGCATT TAAGACAGRA ARCCftAACTC 
TCCGAAACAA TGAAGCCTTA ^^^^^^^ TGAAGGCTC6 AACTCAACTC TTAGCTQGTC 

tgtgcaccag cagtcagaga tggatacttc tgtgaaattt sacttacaar 

TCCAAAGL-K. AAATCTATTT GACAAAGTAA QCCCAGTrGT ATCTCACAAA GTTGATCTTG 

2tg™gc ^Ig^^g ATAAGAGGAG tctcgagtoc ^tcatatc ™^c^^ 

TTPPATVACTG GGAGCACAAG GASAACCCTG AGACTGAAGA AGATGTTGGG CCAGTTGTTC 
Igca^Sa ?gagctga^ AACAATGGTC CAAGTTCATT CAGCAAGGCA ATGCTCCATC 
TTCAOTG^C ™C^TM AATAATAACA CTCTGTTGTA TATCCTTCAT TATGATATTG 
iScCA^T GAACTGCACT TCAGATATGG A6ATCAACCC TTTGAGAATT AAGATCTCAT 
CT^S^C ^TGAAAAG AATGACACGG TTGCCGGGCA AGGTGAGCGG GACCATCTCA 
S?^^ ^ATCTTGCC CTCAGTGAAG GAGATATTCA CACTTTGGGT TGTGGAGTTG 
^^Stcctt GAAGATTGTC TGCCAAGTTG GGAGATTAGA CAGAGGAAAG AGTGCAATCT 
TOtSaAA CraTTACTG TGGACTGAGA CTTTTATGAA TAAAGAAAAT CAGAATCATT 
ScG?CT GCTTCATTTA ATGTCATAGA GTTTCCTTAT AAGAATCTTC 

cSttgagga tatcaccaac tccacattgg ttaccactaa tgtcacctgg ggcattcagc 
^S^^AT gcctgtgcct gtgtgggtga tcattttagc agttctagca ggattgttgc 
tS^? ™g?at^ gtaatgtaca ggatgggctt ttttaaacgg gtccggccac 
ctS^gmga acaagaaagg gagcagcttc aacctcatga aaatggtgaa ggaaactcag 

^OTA^CT GCAGTTTTTA AGTTATGCTA CATCTTGACC CACTAGAATT AGCAACTTTA 
™I^TT AAACTTTCTT CATGAGGAGT AAAAATCCAA GGCTTTACTG CTGATAGTGC 



^Chkcllk ^olZ^l^k ?ATrrGTCAA CCTTCTCCTT ATAAATAAGT 
?^gIcATAC ATTTAATAAC ATAGGGTGAC TTGTGTTTTT AGGTATTTAA ATM.TAAAAT 

^Sgggat aotttttatt caatgtatat aagacaggta gtgcctgatt tactacttta 
?I^Stag tacctccttc agttactgtt tctgatttaa tgtacggaac ™tttgtt 
GTT^?^G ttgttgttgt tgttgtttta aagcagtcca aatttggacc ttagcaatca 

MAGGTACTT AATGTTAATA CATATTACAC TACAGTTTAC TTTTCAGAAT 

IS^CT ^aStgc atgaacttgg atttttttaa tcactcatat ggtagaattt 
tat^caS tacatgatac catccaaatt cttgctttta ataacaaagg tacaatattt 
tootag™ tgaaaaSctg gtagatccta ttacacttct gtttatatta aatccacaat 
S?Ittac atttttaact tgtataaatt ttaggtcaaa tccttcaagc caacctatac 

tI^TCAG ?TCCATAATC ACAAATGGCT CTTTTGTGTA ATrGTTTAAT TTCACCTGAA 

otaaIgcca tatggagttg gaaattattt ccaaagcata tttattccat 

?GmMTC TGGCTATTTA CAGTATAAAA AAAGCATTTT ATTAAAATAC TGTGTAGTTC 

Smatag ttgcttatgc atatagtaag tattacattc ttagagtaga gcagagtttt 
Stagtat ta^tttattt tcctccattc atgtactttt ccttatattt ccaaaactgt 
?ac?gSt ScLgat cagtgagaaa tctttacagt tgacaggaac ctggacccct 
?ACCCC^ ttatgagtaa tgcttggaat aaaaaactct taaggcaact cactgattta 

C^CTAG^ TAGCATGATG TTACAGGAAT ATTACCTCTG TTTAAGCAAG GTAATGTGTA 

^?Sg?^ togctgtcag aataacttct aaaaggtatt tttataagca gttcaagtta 
^Iaaacct tttaaacctt tctgaagttc gttagtataa attacttttc ^g^ttatt 
^t^gcc acataggtgg caagttgtag ttttatatgg ctctgtagag tgctqaacct 

TCT^GAA TATATGATTT ATTCACAGTT CCTCAAGGCC TGGGGATGAT GATCAGTTAT 
IcCTA^T GTGCAATTAC ATCATCTTGT ACATTAGAAA TGGAGAGTTT AATAGCTCTT 
tSgSgt CCTCATTAGG TAATGATAAA TATTTCCCTT AAATAATTGA CTATTTTGCT 

GTOmSAA amSttgaa atttatcttg ccatatctca taattocatg cacaagttga 
ctcS^ ^tgaSaXta tattcgtaaa ataggagcac atttagttga ggtatacaag 
gS^™ macaa^cc ttctatttta gctttagtga atttcaaaag taatgggtct 

?S^ATA6 ATTTTTATTA GTAGCTTGAA AGAGCTTAAT CATATGCAGT AAGTATTTTT 
AOTACCAATA AATTTAAAAT TTTTTAAGAA AAATATTTTT ATCCTAGGGC CAAGTGTTGC 
S^^^ TCACTAAGTT AGTCTATAAC AAATTTTACC CTAACAGTTT TACCACCTAG 
^^T^ TOCTGAAAAT ATGTTGGATA GAAAGTCACT CTTTGGCAAA AGTGTTAGAA 
^^d^l TCCTTTTATG GCATCTATCT TGAAAGTAAT CTTGTATTGG 
IcATTGAAAG ATGCTGTAAT TTAGAAATTA ACATGATATC TTAAATTACC TTTATGAAAT 
ATAG?™? Al^AGCAT AGATTTTCCT TCAAAAAATG AACATTTATA T ATCTACA AA 
SSga AG^™? GAAAGCCTAC TnCTGAAGA AAATGGMGG ATTTTTTTTT 

^Stgat?a aatatcaaaa aattgcccta tgaaaacttt aaatctctaa aacatttgaa 
atScSt atttgtgatt tattgagaat aaaaatccat tttgaaatgt aaaattto 
SctSatt cagttttaag aaaacatgaa tgaactagaa gatattaaaa A^TTT^a 

??^TAAGAA ATATTGATAC TGATATTGAT TTITATATAG 6TJj™;TTT ^AGAATTGAT 

attttgagaa aaatacatgt gagtcatttt ttctgtttct cttttctctt aacgattatc 
actgtaattc tgaatct 
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Seg ID NO: 189 y^otein sequence; 
Protein Accession #: NP_002201.1 



1 

ilAPPPRRRliR 
SSFMFLLVGA 
HQWFGASVRS 

gqgfcqggfs 
rtaqaipdds 



11 
I 

lgprglplll 
pkanttqpgi 

KQDKILiACAP 
IDPTKADRVL 
YLGYSVAVGD 



21 

1 

SGIoLLPLCRA 
VEGGQVLKCD 
LYHWRTEMKQ 
LGGPGSFVWQ 
PNGDGIDDFV 



31 



41 

I 



51 



FNLDVDSPAE YSGPEGSYFG FAVDFFVPSA 
WSSTRRCQPI EFDATGNRDY AKDDPIiEFKS 
EREPVGTCFL QDGTKTVEYA PCRSQDIDAD 
GQIiISDQVAE IVSKYDPNVY SIKYNNQIAT 
SGVPRAARTL GMVYIVDGKN MSSliYNFTGE 
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QMAAYPGFSV 
LNGPEVFARP 
LEGQWAARSN 
PSILNQDNKT 
ALFIiYSRSPS 
TTGLQPIIiNQ 
AQNQGEGAYE 
TQLIiAGLRFS 
DHIFLPIPNW 
IliHYDIDGPM 
TliGCGVAQCL 
FPYKNLPIED 
FKRVRPPQBE 



AATDXNGDDY 
GSAIAPLGDL 
PPSFGYSMKG 
CSLPGTALKV 
HSKNMTISRG 
FTPANZSRQA 
AELIVSIPLQ 
VHQQSEMDTS 
EHKENPETBB 
NCTSDMEINP 
KIVCQVGRLD 
ITNSTIjVTTN 
QBREQLQPHB 



ADVFIGAPtiF 
DQDGFNDIAI 
ATDIDKN6YP 
SCFNVRPCLK 
GLMQCEELIA 
HILLDCGEDN 
ADFIGWRNN 
VKFDLQIQSS 
DVGPWQHIY 
LRIKISSLQT 
RGKSAILYVK 
VTWGIQPAPM 
NGEGHSBT 



MDRGSDGKLQ 
AAPYGGEDKK 
DLIVGAFGVD 
ADGKGVLPRK 
YXiRDESEFRD 
VCKPKLEVSV 
BALARLSCAF 
NLFDKVSPW 
ELRNNGPSSF 
TEKNDTVAGQ 
SLLWTETFMN 
PVPVWVIIIjA 



EVGQVSVSLQ 
GIVYIFNGRS 
RAILYRARPV 
IiNFQVELLIjD 
KLTPITIPME 
DSDQKKIYIG 
JCTENQTRQW 
SHKVDIiAVIiA 
SKAMIiHIiQHP 
GERDHI.ITKR 
KENQNHSYSL 
VLAGIfXiLliAV 



RASGDFQTTK 
TGIiNAVPSQI 
ITVNAGLEVY 
KIiKQKGAIRR 
YRLDYRTAAD 
DDNPLTLIVK 
CDLGNPMKAG 
AVEZRGVSSP 
YKYNNNTLLY 
DIiALSEGDIH 
KSSASPMVIE 
LVFVMYRMGP 
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Seq ID NO: 190 DNA sequence 
Nucleic Acid Accession #: NM_004864 

coding sequence: 26-952 (underlined sequences correspond to start and stop codons) 



CGGAACGAGG 
TCAGATGCTC 
GGCCGAGGC6 
ATTCCGAGAG 
CTGGGAAGAT 
AGTGCGGCTG 
GGGGCTCCCC 
AAGGTCGTGG 
GCCCGCGCTG 
ATCTTCGTCC 
CCGCAGAGCG 
TCTGCACACG 
ACGGGAGGTG 
CATGCACGCG 
CTGCTGCGTG 
GTCGCTCCAG 
GGTCCTTCCA 
GGGCTCAAGG 
TTATTTATTA 
ACTGTGTATT 
AAAA 



11 
I 

GCAACCTGCA 
CTGGTGTTGC 
AGCCGCGCAA 
TTGCGGAAAC 
TOGAACACCG 
GGATCCGGCG 
GAGGCCTCCC 
GACGTGACAC 
CACCTGCGAC 
GCACGGCCCC 
CGTGCGCGCA 
GTCCGCGCGT 
CAAGTGACCA 
CAGATCAAGA 
CCCGCCAGCT 
ACCTATGATG 
CTGTGCACCT 
TTCCTGAGAC 
TTAATTTATT 
TATTTAAAAC 



21 
I 

CAGCCATGCC 
TGGTGCTCTC 
GTTTCCCGGG 
GCTAC6AG6A 
ACCTCX3TCCC 
GCCACCTGCA 
GCCTTCACCG 
GACCGCTGCG 
TGTCGCCGCC 
AGCTGGAGTT 
ACGGGGACGA 
CGCTGGAAGA 
TGTGCATCGG 
CGAGCCTGCA 
ACAATCCCAT 
ACTTGTTAGC 
GCGCGGGGGA 
ACCCGATTCC 
GGGGTGACCT 
TCTGGTGATA 



31 

I 

CGGGCAAGAA 
GTGGCTGCCG 
ACCCTCAGAG 
CCTGCTAACC 
GGCCCCTGCA 
CCTGCXITATC 
GGCTCTGTTC 
GCGTCAGCTC 
GCCGTCGCAG 
GCACTTGCGG 
CTGTCCGCTC 
CCTGGGCTGG 
CGCGTGCCCG 
CCGCCTGAAG 
GGTGCTCATT 
CAAAGACTGC 
GGCGACCTCA 
TGCCCAAACA 
TCTTGGGGAC 
AAAATAAA6C 



41 

I 

CTCAGGACX^G 

CATGGGGGCG 

TTGCACTCCG 

AGGCTGCGGG 

GTCCGGATAC 

TCTCGGGCCG 

CX3GCTGTCCC 

AGCCTTGCAA 

TCX3GACCAAC 

CCGCAAGCC6 

GGGCCCGGGC 

GCCGATTGGG 

AGCCAGTTCC 

CCCX3ACACGG 

CAAAAGACCG 

CACTGCATAT_ 

GTTGTCCTGC 

GCTGTATTTA 

TCGGGGGCTG 

TGTCTGAACT 



51 
I 

TGT^TGGCTC 
CCCTGTCTCT 
AAGACTCCAG 
CCAACCAGAG 
TCACGCCAGA 
CCCTTCCCGA 
CGACGGCGTC 
GACCCCAAGC 
TGCTGGCAGA 
CCAGGG6GC6 
GTTGCTGCCG 
TGCTGTCGCC 
GGGCGGCAAA 
AGCCA6CGCC 
ACACCGGGGT 
GAGCAGTCCT 
CCTGTGGAAT 
TATAAGTCTG 
GTCTGATGGA 
GTTAAAAAAA 



60 
120 
180 
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300 
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600 
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1020 
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Seq ID NO: 191 Protein aecruence: 
Protein Accession #: NP 004855 



MPGQELRTVN 
EDLLTRLRAN 
HRAIiFRIiSPT 
ELHLRPQAAR 
IGACPSQFRA 
LAKDCHCI 



11 

I 

GSQMIjLVLLV 
QSWEDSNTDL 
ASRSWDVTRP 
GRRRARARNG 
ANMHAQIKTS 



21 

I • 

LSWLPHGGAL 
VPAPAVRILT 
LRRQLSIiARP 
DDCPLGPGRC 
LHRLKPDTEP 



31 
I 

SLAEASRASF 
PEVRLGSGGH 
QAPALHLRLS 
CRIiHTVRASL 
APCCVPASYN 



41 

I 

PGPSEIJiSBD 
LHLRISRAAL 
PPPSQSDQLL 
EDLGWADWVIf 
PMVLIQKTDT 



51 
I 

SRFRELRKRY 60 

PBGLPEASRXj 120 

AESSSARPQL 180 

SPREVQVTMC 240 

GVSLQTYDDI* 300 



Seq ID NO: 192 DNA sequence 

Nucleic Acid Accession #: XM_061731.1 

Coding sequence: 1-567 (underlined sequences correspond to start and stop codons) 



ATGA GAAAAQ 
AGAAAGTTTC 
GTAAAAGAAC 
CTGTTGTTGA 
TCCCCTCCAA 
TGGGGTGGCA 
AGTCTTTCAC 
GAGGTCTTCC 
GCTGGATGGT 
ATGGTTTATT 



11 
I 

GAAATGAGGG 
TCAAAGAA6A 
CTTTCTCTCT 
TGTCCACAGA 
TGTGCACCAA 
AAGACACCAG 
TCACCAAACA 
AGCCACTTTC 
GGATTTATCA 
CTAAAGAAAC 



21 
I 

AGAGAACACA 
TGGCATTACA 
GATTGGACTT 
CACTGGCAA6 
ATCACGTAAA 
(3AGCAATACT 
TTCCCACAAG 
AGAGCCAGGT 
GAGCTGTCAG 
TGA GTGft 



31 



I 



GAAGAGGGCA 
TTGCACATCT 
6ACACACAGA 
(3ACAGGTTTA 
AATGGGGATA 
GATCTTCCTA 
CCTGTCCCTG 
GTAGAAGCA6 
GTTCCTTCCT 



41 

I 

GGCTTGCTCA 
CTCTGTGTCT 
AGGATCTCAG 
CCAACATACT 
AT6ACTCCCC 
TCAGA6ACCC 
AGCATCAGTG 
AGATGGAAGT 
CAACCCTTGC 



51 




1 

GCTTGCTCAA 


60 


CTCTATTGCT 


120 


TAAAGATTTG 


180 


GCTGTCACAC 


240 


TGCCTTCACA 


300 


TGGGGGCAAG 


360 


TGACCAGAGA 


420 


6TTCGCTGAT 


480 


AAGAAAGAAG 


540 



Seq ID NO: 193 Protein sequence; 
Protein Accession #: XP 061731.1 
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MRKGNEGENT EEGRIiAQLAQ RKFLKEDGIT LHISLCLSIA VKEPPSLIGL DTQKDLSKDL 60 

LLLMSTDTGK DRFTNILLSH SPPMCTKSRK NGDNDSPAFT WGGKDTRSNT DLPIRDPGGK 120 

SLSLTKHSHK PVPEHQCDQR EVPQPLSEPG VEAEMEVFAD AGWWIYQSCQ VPSSTIiARKK 180 
MVYSKETE 



Seq ID NO: 194 DNA seofuence 

Nucleic Acid Accession #: NM_005415.2 

Coding sequence: 371-2410 (underlined sequences correspond bo start and stop codons) 
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1 
I 

GAGCTGTCCC 
CGCCTCGATC 
TTCTCTTCTC 
CCAGACAACA 
CGTAGTTTAC 
CTCATATTCT 
ACTCCAGAGA 
TTTGGTGGAC 
CTCCGTGGGA 
GACCCTGAAG 
GGGGGCCAAA 
GACTCAAGGG 
ACTCGTGGCT 
TATTGGTTTC 
AATTGTGATG 
CTTCCTGGTT 
TTTGCCAGTT 
AGCACCGTTG 
ATGTGCAGTT 
AATTGAACGA 
CTTGAAAGAA 
TGTTTCTGAG 
CTCATTCAAA 
CTTGAAAGAG 
GAACCTTGTC 
GTATCACACC 
TGCCAAGGTG 
CTATACTTCC 
AGGTGAACAG 
GCGAATTCGA 
ATCTGAGATA 
TGGCTCTCTA 
CCTGCAGATC 
TGCCATTGGG 
AGTGGCAACA 
GGTTTGGGGA 
TAGTGGCTTC 
CCTTCCCATC 
GTCCAAGAAG 
AGTCCCCATT 
CAGAATGTGA 
TCCTGCTCCC 
TGGGAGCAGA 
GTCTCAAAAT 
TCCTTCTGGG 
CATTATGTTT 
CATGAAGAGC 
ACATGCACAG 
AAGTAGAGTC 
GGGAGCTTCT 
TTT6CAAGCA 
GCAATCTTGG 
GTGAACTTTG 
GAATAAAAAA 



11 

I 

CGGTGCCGCC 
TCCTCGTCTC 
CGCCATGGAA 
GATGCCCATA 
AGTATTTAAT 
GTTTACACAT 
ATGG CAACGC 
TACCTATGGA 
GCCAATGATG 
CAAGCCTGCA 
GTGAGCGAAA 
CTACTGATGG 
TCGTTTTTGA 
TCCCTCGTGG 
TCTTGGTTCG 
CGTGCATTCA 
TT CT ATGCCT 
CTGGGCTTTG 
TTCTGTGCCC 
GAAATAAAGT 
GACCATGAAG 
GTAGGGCCTG 
CTTGGAGATT 
GT^AACCAGCA 
CAGTTCAGTC 
GTGCATAAGG 
GGAGATTGCA 
TATACCATGG 
AAGGGCGAAG 
ATGGACAGTT 
GACAT6AGTG 
GAA6AATGGT 
CTTACAGCCT 
CCTCTGGTTG 
CCAATATGGC 
AGAA6AGTTA 
AGTATTGAAC 
AGTACAACAC 
GCTGTTGACT 
TCTGGAGTTA 
AGCTGTTTGA 
CTGAAGAATG 
GGAGGGAAGT 
TAGCTGTGTA 
CTGTGAATTC 
TAATGTTGTC 
CGTTTGACAG 
GGATTTAACA 
CTTGGTACTC 
TAGA6G6ATG 
GTTTATTGAC 
TTATTTCTTT 
GGCAAGTTAA 
GCCTACAGTT 



21 

I 

GACCCGGGCC 
CCGCTCCGCC 
TTCTGCTCCG 
CGCAGCGTAT 
TTTATATAAT 
CTTGAAAGGC 
TGATTACCAG 
TGCTCATCCT 
TAGCAAATTC 
TCCTAGCTAG 
CCATCCGGAA 
CC6GCTCAGT 
AGCTCCCTAT 
CAAAGGGGCA 
TGTCCCCACT 
TCCTCCATAA 
GCACAGTTGG 
ACAAACTTCC 
TTATCGTCTG 
GTAGTCCTTC 
AAACAAAGTT 
CCACTGTGCC 
TGGAGGAAGC 
TAGATAGCAC 
AAGCCGTCAG 
ATTCCGGCCT 
TGGGAGACTC 
CAATATGTGG 
AAATGGAGAA 
ACACCAGTTA 
TCAAGGCAGC 
ATGACCAGGA 
GCTTTGGGTC 
CTTTATATTT 
TTCTACTCTA 
TCCAGACCAT 
TGGCATCTGC 
ATTGTAAAGT 
GGCGTCTCTT 
TCAGTGCTGC 
GATTAAAATT 
ATTACAGTGT 
GTTACTTGTG 
AAATAGCCCG 
CTGTACATAT 
TCTGAAGATG 
AGCATGCTCT 
ACAAAAATAT 
TGCCCTCCTG 
AGGTTCTTTG 
TGTTATTGCT 
AAGATTTCTG 
ATGGGACAGC 
TTTA6AAAAA 



31 
I 

GTGCCGTGTG 

CTCCCTTTTC 

TGCTTTTAGC 

AGCAGTAACT 

ATATATTATT 

GCTCAGTAGT 

TACTACAGCT 

GGGCTTCATT 

TTTTGGTACA 

CATCTTTGAA 

GGGCTTGATT 

CAGTGCTATG 

TTCTGGAACC 

G6AGGGTGTC 

GCTTTCTGGA 

GGCAGATCCA 

AATAAACCTC 

TCTGTGGGGT 

GTTCTTTGTA 

TGAAAGCCCC 

GTCTGTTGGT 

CCTCCAGGCT 

TCCAGAGAGA 

CGTGAATGGT 

CAACCAAATA 

GTACAAAGAG 

CGGTGACAAA 

CATGCCTCTG 

6CTGACATG6 

CTGCAATGCT 

GATGGGTCTA 

TAAGCCTGAA 

ATTCGCCCAT 

GGTTTATGAC 

TGGTGGTGTT 

G6G6AA6GAT 

CCTCACTGTG 

GGGCTCTGTT 

TCGTAACATT 

CATCATGGCA 

TGTGTCAATG 

TAACAGAAGA 

CTATAACTGC 

GGTTCCACTG 

TTCTCTACTT 

ACTTGTGATT 

GCGTTGTTGG 

AACTACAACT 

TCAGTAGTGG 

AACACA6TGA 

AAGAAGAAGT 

6CAGTGTGGG 

CTTCCATGTT 

ACCCX3AATTC 



41 

I 

CCCGTGGCTC 
CCTGGATGAA 
CCTCCTGAGC 
CCCCAGCTCG 
TATTATAGCA 
TCTCTTACTA 
GCTACCGCCG 
ATTGCATTTG 
GCTGTGGGCT 
ACAGTGGGCT 
GACGTGGAGA 
TTTGGTTCTG 
CATTGTATTG 
AAGTGGTCTG 
ATTATGTCTG 
GTT CCTAATG 
TTTTCCATCA 
ACCATCCTCA 
TGTCCCAGGA 
TTAATGGAAA 
GATATTGAAA 
GTGGTGGAGG 
GAGAGGCTTC 
GCAGTGCAGT 
AACTCCAGTG 
CTACTCCATA 
CCCTTAAGGC 
GATTCATTCC 
CCTAATGCAG 
GTGTCTGACC 
GGTGACAGAA 
GTCTCTCTCC 
GGTGGCAAT6 
ACAG6AGAT6 
GGTATCTGTG 
CTGACACCGA 
GTGATTGCAT 
GTGTCTGTTG 
TTTATGGCCT 
ATCTTCAGAT 
TTTGGGACCA 
CTGACAAGAG 
TTTTGTGCTA 
GCTCCTGCTG 
TTTGTATCAG 
TTTTTTTCTT 
TTTCACCAGC 
TCCXrTTGTAG 
CAGGATCTAT 
AAATTTAAAT 
AAGAAAGAAA 
ATGGATGAAT 
CATTTGTCTA 



SI 

I 

CAGCCGCTGC 
CTTGCGTCCT 
CAAAGAAACC 
GTTTCTGTGC 
TTTTTGATAC 
AACAACCACT 
CTTCTGGTCC 
TCTTGGCATT 
CAGGTGTAGT 
CTGTCTTACT 
TGTACAACTC 
CTGTGTGGCA 
TTGGTGCAAC 
AACTGATAAA 
GAATTTTATT 
6TTTGCGAGC 
TGTATACTGG 
TCTCGGTGGG 
TGAAGAGAAA 
AAAAGAATAG 
ACAAGCATCC 
AGAGAACAGT 
CCAGCGTGGA 
TGCCTAATGG 
GCCACTCCCA 
AATTACATCT 
GCAATAATAG 
GTGCCAAAGA 
ACTCCAAGAA 
TTCACTCAGC 
AAGGAAGTAA 
TCTTCCAGTT 
ACGTAA6CAA 
TTTCTTCAAA 
TTGGTCTGTG 
TCACACCCTC 
CAAATATTGG 
GCTGGCTCCG 
GGTTTGTCAC 
ATGTCATCCT 
TCTTAGGTAT 
TCTTTTTATT 
AATATGAATT 
AGGTCCCCTT 
GCTTCAATTC 
TTTTTTAAAC 
TTCTGCCCTC 
TCTCTTATAT 
TGGCATATTC 
TAGTAACTTT 
AAGCCTGTTG 
GAAGTGGAAT 
CCTCTTAACT 



60 
120 
180 
240 
300 
360 
420* 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



70 



75 



Seq ID NO: 195 Protein sequence; 
Protein Accession #: NP 005406.2 



MATLITSTTA 
QAClIiASlFE 
SFLKIiPISGT 
rAFIIjHKADP 
FCAliIVWFFV 



11 

I 

ATAASGPLVD 
TVGSVLLGAK 
HCIVGATIGF 
VPNGLRAIiPV 
CPRMKRKIER 



21 

1 

yiiWMLILGFI 
VSETIRKGLI 
SLVAKGQEGV 
FYACTVGINL 
EIKCSPSESP 



31 
I 

lAFVIiAFSVG 
DVEMYNSTQG 
KWSELIKIVM 
FSIMYTGAPI* 
IjMEKKNSXiKE 



41 



51 



ANDVANSFGT AVGSGWTLK 60 

LLMAGSVSAM FGSAVWQLVA 120 

SWFVSPLLSG IMSGILFPLV 180 

LGFDKLPLWG TILISVGCAV 240 

DHEETKIiSVG DIBNKKPVSE 300 



271 
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10 



15 



20 
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35 
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45 



50 



VGPATVPLQA 
QFSQAVSNQI 
VTMAICGMPL 
DMSVKAAMGL 
PIjVALYLVYD 
SIELASAIiTV 
SGVISAAIMA 



WBERTVSFK 
NSSGHSQYHT 
DSPRAKBGEQ 
GDRKGSNGSL 
TGDVSSKVAT 
VIASNZGIjPI 
IPRYVILRM 



LGDIiEEAPER 
VHKDS6LYKE 
KGBEMEKLTW 
EEWYDQDKPE 
PIWIiLIiYGGV 
STTHCiCVGSV 



ERLPSVDLKE 
LLHKIiHIiAKV 
PNADSKKRIR 
VSLIiFQFLQI 
GICVGLWVW6 
VSVGWLRSKK 



ETSIDSTVNG 
GDCMGDSGDK 
MDSYTSYCNA 
LTACFGSFAH 
RRVIQTMGKD 
AVDWRLFRNI 



AVQLPNGNLV 360 

PLRRNNSYTS 420 

VSDLHSASEI 4 80 

GGNDVSNAIG 540 

LTPITPSSGF 600 

PMAWPVTVPI 660 



Seq ID NO: 196 DNA geouence 

Nucleic Acid Accession #: NM_000020.1 

Coding sequence: 283-1794 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



AGGAAACGGT 
AGAAACATTT 
GAGCGAGCCC 
CCAGCGCTGG 
AGGCTAGCGC 
AGGAAAGGCC 
TCTCGGGGCC 
CGGGGGGCCT 
CGGGGCTGCG 
CACTACTGCT 
CAACCTCCTT 
CTGGCCTTGC 
CAGGAGAA6C 
TCTGAGCAGG 
GGCTCAGGGC 
TGTGTGGGAA 
GCCGTCAAGA 
AACACAGTAT 
CGCAACTCGA 
GACTTTCTGC 
GCATGCGGCC 
GCCCACCGCG 
GCCGACCTGG 
AACCCGAGAG 
ACGGACTGCT 
GAGATTGCCC 
GATGTGGTGC 
CAGACCCCCA 
ATGATGCGGG 
AAGACACTAC 
AGCACCTGAT 
CTATCTGGGT 
TGCTCGGCCC 



TTATTAGGAG 
TTGCTCCAGC 
CTCCCCGGCT 
CGGTGCAACT 
CCCGCCACCC 
TTCTGATGCT 
CGCTGGTGAC 
GGTGCACAGT 
GGAACTTGCA 
GCGACAGCCA 
CGGAGCAGCC 
TGGCCCTGGT 
AGCGTGGCCT 
GCGACAC6AT 
TCCCCTTCCT 
AAGGCCGCTA 
TCTTCTCCTC 
TGCTCAGACA 
GCACGCAGCT 
AGAGACAGAC 
TGGCGCACCT 
ACTTCAAGAG 
GCCTGGCTGT 
TGGGCACCAA 
TTGAGTCCTA 
GCCGGACCAT 
CCAATGACCC 
CCATCCCTAA 
AGTGCTGGTA 
AAAAAATTAG 
TCCTTTCTGC 
AGAGGTAGTG 
CCAGCCCACC 



GGAGTGGTGG 
CCCCATCCCA 
CCAGCCCGGT 
GCGGCCGCGC 
GCAGAGCGGG 
GCTGATGGCC 
CTGCACGTGT 
AGTGCTGGTG 
CAGGGAGCTC 
CCTCTGCAAC 
GGGAACAGAT 
GGCCCTGGGT 
GCACAGCGAG 
GTTGGGGGAC 
GGTGCAGAGG 
TCGCGAAGTG 
GAGGGATGAA 
CGACAACATC 
GTGGCTCATC 
GCTGGAGCCC 
GCACGTGGAG 
CCGCAATGTG 
GATGCACTCA 
GCGGTACATG 
CAAGTGGACT 
CGTGAATGGC 
CAGCTTTGAG 

ccggctggct 
cccaaacccc 
caacagtcx:a 
ctgcaggggg 
tgagtgtggt 
cagccaaaaa 



AGCTGGGCCA 
GTCCCGGGAG 
CCGGGGCCGC 
GGTGGAGGGG 
CCCAGAGGGA 
TTGGTGACCC 
GAGAGCCCAC 
CGGGAGGAGG 
TGCAGGGGGC 
CACAACGTGT 
GGCCAGCTGG 
GTCCTGGGCC 
CTGG6AGAGT 
CTCCTGGACA 
ACAGTGGCAC 
TGGCGGGGCT 
CAGTCCTGGT 
CTAGGCTTCA 
ACGCACTACC 
CATCTGGCTC 
ATCTTCGGTA 
CTGGTCAAGA 
CAGGGCAGCG 
GCACCCGAGG 
GACATCTGGG 
ATCGTGGAGG 
GACATGAAGA 
GCAGACCCGG 
TCTGCCCGAC 
GAGAAGCCTA 
CTGGGGGGGT 
GTGTGCTGGG 
TACAGCTGGG 



GGCAGGAAGA 
GCTGCCGCGC 
GCCGGACCCC 
AGGTGGCCCC 
CCATGACCTT 
AGGGAGACCC 
ATTGCAAGGG 
GGAGGCACCC 
GCCCCACCGA 
CCCTGGTGCT 
CCCTGATCCT 
TGTGGCATGT 
CCAGTCTCAT 
GTGACTGCAC 
GGCAGGTTGC 
TGTGGCAC66 
TCCG6GA6AC 
TCGCCTCAGA 
ACGAGCACGG 
TGAGGCTAGC 
CACAGGGCAA 
GCAACCTGCA 
ATTACCTGGA 
TGCTGGACGA 
CCTTTGGCCT 
ACTATAGACC 
AGGTGGTGTG 
TCCTCTCAGG 
TCACCGCGCT 
AAGTGATTCA 
GGGGGGCAGT 
6ATGGGCAGC 
CTGAAACCTG 



51 

I 

CGCTGGAATA 
CAGCTGCGCC 
AGCCCGCCGT 
GGTCCGCCGA 
GGGCTCCCCC 
TGTGAAGCCG 
GCCTACCTGC 
CCAGGAACAT 
GTTCGTCAAC 
GGAGGCCACC 
GGGCCCCGTG 
CCGACGGAGG 
CCTGAAAGCA 
CACAGGGAGT 
CTTG6TGGAG 
TGA6AGTGTG 
TGA6ATCTAT 
CATGACCTCC 
CTCCCTCTAC 
TGTGTCCGCG 
ACCAGCCATT 
GTGTTGCATC 
CATCGGCAAC 
GCAGATCCGC 
GGTGCTGTGG 
ACCCTTCTAT 
TGTGGATCAG 
CCTAGCTCAG 
GCGGATCAAG 
ATAGCCCAGG 
GGATGGTGCC 
TGC6CCTGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



Seq ID NO: 197 Protein a^rp,^^r^^ . 
Protein Accession #: NP 000011.1 



55. 



60 



65 



70 



75 



MTLGSPRKGL 
RHPQEHRGCG 
LIIiGPVLAIiL 
OCTTGSGSGL 
RETEIYNTVIi 
RLAVSAACGL 
YLDIGNNPRV 
YRPPFYDWP 
TALRIKKTIiQ 



11 
I 

LMLLMAIiVTQ 
NliHRELCRGR 
ALVAIjGVLGL 
PFLVQRTVAR 
LRHDNILGFI 
AHLHVEIFGT 
GTKRYMAPEV 
NDPSPEDMKK 
KZSMSPEKPK 



21 

I 

GDPVKPSRGP 
PTEFVNHYCC 
WHVRRRQEKQ 
QVAIjVECVGK 
ASDMTSRNSS 
QGKPAIAHRD 
LDEQIRTDCF 
WCVDQQTPT 
VIQ 



31 
I 

LVTCTCESPH 
DSHLCNHNVS 
RGXiHSELGES 
GRYGBVWRGIi 
TQLWLITHYH 
FKSRNVLVKS 
ESYKWTDIWA 
IPNRLAADPV 



41 

I 

CKGPTCRGAW 
LVIiEATQPPS 
SLILKASEQG 
WHGESVAVKI 
EHGSLYDFLQ 
NLQCCIADLG 
FGLVLWEIAR 
LSGLAQMMRE 



51 
I 

CTWIiVREEG 
EQPGTDGQLA 
DTMLGDIiLDS 
FSSRDEQSWF 
RQTLEPHIiAL 
liAVMHSQGSD 
RTIVNGIVED 
CWYPNPSARIi 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 198 DNA aemienip^ 

Nucleic Acid Accession #: NM_003 199.1 

Coding sequence: 200-2203 (underlined sequences correspond to start and stop codons) 



1 

I 

CGGGGGGATC 
GGAGGCAGCA 
GACATGAACG 
TTTGTGTGAT 
AGAGCTGAGT 
AAATGGACCA 
TAGCTCAGGG 



11 
I 

TTGGCTGTGT 

GGCGCGGGAG 

CCGCCTCGGC 

TTTGCTAAAA. 

GATTTACTGG 

ACTTCTTTGG 

TCCTGGGGGA 



21 
I 

GTCTGCGGAT 
CGGGCGCAGG 
GCCGGCGGTG 
TGCATCACCA 
ATTTCAGTGC 
CAAGTGGACA 
ATGGAG6ACA 



31 

I 

CTGTAGTGGC 
AGCAGGCGGC 
CACGGAGAGC 
ACAGCGAATG 
GATGTTTTCA 
TTTTACTGGC 
TCCAA6CCCG 



41 

I 

GGCGGCGGCG 
GGCGGTGGCG 
CCCTTCTCGC 
GCTGCCTTAG 
CCTCCTGTGA 
TCAAATGTAG 
TCCAGGAACT 



51 
I 

GCG6C6GCGG 60 

6CGGCGGTTA 120 

GCGCGGGCGG 180 

GGACGGACAA 240 

GCAGTGGGAA 300 

AAGACAGAAG 360 

ATGGAGATGG 420 



272 
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GACTCCCTAT 

TTTTGTCAAT 

AGAATCAAAC 

CAACCCAGGA 

TAATCCCCGA 

AAAAGTTCCT 

CAATAGGGAC 

CTTCTTCATG 

TCAGCCTGGC 

CTGTAGCCTG 

TTCCAGTCTT 

TTCCTGTACG 

AGCCGGCAGC 

AGATCACACT 

TCTCTCAGCA 

TTATGAAGGA 

TGATGCTATT 

TCATGGGGAC 

CTCAGGGTAT 

TCGTGAAGAT 

TCCACAGCTT 

CAGAGGCATG 

ATCCGATGAC 

AGATGACGAC 

ACCAGAGCAG 

TCTGCGGGTC 

CCTCAAGAGT 

CCTCAGTCTG 

AA6AAGGGA6 

CCCTGGAATG 

CATTGCTTCA 

AAACACTTCT 

GCAGAC6CAA 

AGAAAAAAGT 

AGTATGTACA 



GACCACATGA 
TCCAGAATAC 
TTACAGGGTT 
ACCCTTTCGC 
AGGAGGCCTC 
CCAGGTTTGC 
TCGCCAGGCT 
CAAGATGGCC 
TATGCAGGAA 
CATCCACATG 
CCTCCGATGT 
CCTCCTGCCA 
TCCCAGACTG 
AACAACAGCT 
GGCACAGCT6 
CCCTTACACT 
CATGTTCTCC 
ATGCATGGAA 
GGAACCGGCC 
GGCGTGGCCC 
CCTGTCCAGT 
CCACCAGGAC 
GAGGGTGATG 
AAGAAGGATA 
AAGGCAGAGC 
CGTGACATCA 
GACAAGCCCC 
GAGCAGCAA6 
GAAGA6AAGG 
GGAGACGCAT 
TTAAAACAA6 
CCTTAACCCC 
GAGGTTT C AG 
GCAACTTGAG 
GGCTGAGACA 



CCAGCAGGGA 
AAAGTAAAAC 
GCCACCAGCA 
CCACCAAACC 
TTCACAGTAG 
CATCTTCAGT 
ATCCTTCCTC 
ATCACAGCAG 
TGTTGG6CAA 
AACGTTT6AG 
CCACTTTCCA 
ACGGGACAGA 
GAGATGCTCT 
TTTCATCAAA 
TTTGGTCTAG 
CTTTGCAAAG 
GGAACCATGC 
TCATTGGACC 
TTCTTTCAGC 
TGAGAGGCAG 
CTGCX3ACTTC 
TACAGGGGCA 
AGAACCTGCA 
TCAT^TCAAT 
GTGAGAAGGA 
ACGAGGCTTT 
AGACCAAGCT 
TCCGA6AAAG 
TGTCCTCGGA 
CGAATCACAT 
AGACCACTTC 
CATTTTTGTA 
CATTCCCAAT 
GGACGACTTT 
CAGCCCAGAG 



CCTTGGGTCA 
AGAAAGGGGC 
GAGTCTCCTT 
TGGTTCCCAG 
TGCCATGGAG 
CTATGCTCCA 
CAAACCAGCA 
TGACCCTTGG 
CTCTTCTCAT 
CTATCCATCA 
TCGTA6TG6T 
CAGTATAATG 
GGGGAAAGCA 
CCCTTCAACT 
AAATGGA6GA 
CCGAATTGAA 
AGTGGGCCCA 
TTCTCATAAT 
CAACAGACAT 
CCATTCTCTT 
CCCTGACCTG 
GAGTGTCTCC 
AGACACX3AAA 
TACTAGCAAT 
GCGGAGGATG 
CAAAGAGCTC 
CCTGATCCTC 
GAATCT6AAT 
GCCTCCCCCT 
GGGACAGATG 
CTTAACAGCT 
ATATAAGACA 
TATCAAAAAA 
CTTTAACATA 
ACTGAACGGC 



CATGACAATC 
TCATACTCAT 
GGAGGTGACA 
TACTATCAGT 
GTACAGACAA 
TCAGCAAGCA 
ACCAGCACTT 
AGCTCCTCCA 
ATTCCACAGT 
CACTCCTCAG 
ACAAACCATT 
GCAAATAGAG 
CTTGCTTCGA 
CCTGTTGGCT 
CA6GCCTCAT 
GATCGTTTAG 
TCCACAGCTA 
GGAGCCATGG 
TCACTCATGG 
CTGCCAAACC 
AACCCACCCC 
TCTGGCAGCT 
TCTTCGGAGG 
AATGACGATG 
GCCAACAATG 
GGCCGCATGG 
CACCAGGCGG 
CCGAAAGCTG 
CTCTCCTTGG 
TAAA AGGGTC 
GTATTATCTT 
AGTCTGAGTA 
CAGAAAAACA 
TCATTCAGAA 



TCTCTCCACC 
CTTATGGGAG 
TGGATATGGG 
ATTCTAGCAA 
AGAAAGTTCG 
CTGCCGACTA 
TCCCTAGCTC 
GTGG6ATGAA 
CCAGCAGCTA 
CAGACATCAA 
ACAGCACCTC 
GAAGCGGGGC 
TCTATTCTCC 
CTCCTCCATC 
CGTCTCCTAA 
AAAGACTGGA 
TGCCTGGTGG 
GTGGTCTGGG 
TGGGGACCCA 
AGGTTCCGGT 
AGGACCCTTA 
CTGAGATCAA 
ACAAGAAATT 
AGGACCTGAC 
CCCGAGAGCG 
TGCAGCTCCA 
TGGCCX3TCAT 
CGTGTCTGAA 
CCGGCCCACA 
CAAGTFGCCA 
AAACCCACAT 
GTTATGAATC 
AAAAAAAGAA 
TGTGCAAAGC 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



Seq ID NO: 199 Protein sequence; 
Protein Accession NP 003190.1 



MHHQQRMAAIi 
NGGHPSPSRN 
CHQQSIiLGGD 
PSSVYAPSAS 
MLGNSSHIPQ 
NGTDSZMANR 
VWSRNGGQAS 
IIGPSHNGAM 
SATSPDLNPP 
IKSZTSNNDD 
QTKIiIilIiHQA 
SNHMGQM 



11 

I 

GTDKELSDLL 
YGDGTPYDHM 
MDMGNPGTLS 
TADYNRDSPG 
SSSYCSIjHPH 
GSGAAGSSQT 
SSPNYBGPIiH 
GGLGSGYGTG 
QDPYRGMPPG 
EDLTPEQKAE 
VAVZIiSIiEQQ 



21 
I 

DFSAMFSPPV 
TSRDLGSHDN 
PTKPGSQYYQ 
YPSSKPATST 
ERIiSYPSHSS 
GDALGKAIiAS 
SI.QSRIEDRL 
LLSANRHSLM 
Z.QGQSVSSGS 
RBKERRMANN 
VRBRNLHPKA 



31 
I 

SSGKNGPTSL 
LSPPPVNSRI 
YSSNNPRRRP 
FPSSFFMQDG 
ADINSSIiPPM 
lYSPDHTHNS 
BRLDDAIHVL 
VGTHREDGVA 
SEIKSDDEGD 
ARKRLRVRDZ 
ACLKRRBBEK 



41 
I 

ASGHFTGSNV 
QSKTERGSYS 
XiHSSAMEVQT 
HHSSDPWSSS 
STFHRSGTNH 
FSSNPSTPVG 
RNHAVGPSTA 
ItRGSHSIjIiPN 
ENLQDTKSSE 
NBAFKEIiGRM 
VSSEPPPLSL 



51 

I 

EDRSSSGSWG 
SYGRESNIiQG 
KKVRKVPPGL 
SGMNQPGYAG 
YSTSSCTPPA 
SPPSIiSAGTA 
MPGGHGDMHG 
QVPVPQLPVQ 
DKKIiDDDKKD 
VQZjHUCSDKP 
AGPHPGMGDA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO: 200 DNA sequence 

Nucleic Acid Accession #: BC005987 (1-1286), BE888744 (1287-1756) 

Coding sequence: 124-525 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



GGCAGAAGAG GAAGATTTCT GAAGAGTGCA GCTGCCTGAA CCGAGCCCTG CCGAACAGCTT 60 

GAGAATTGCA CTGCAAC CAT GA GTGAGAAC AATAAGAATT CCTTCSGAGAG CrAGCCTACGG 120 

CAACTAAAAT GCCATTTCAC CTGGAACTTG ATGGA(3<3GAG AAAACTCCTT GGATGATTTT 180 

GAAGACAAAG TATTTTACCG <3ACTGAGTTT CAGAATCIGTG AATTCAAAGC (3^CAAT6TGC 240 

AACCTACTGG CCTATCTAAA GCa^CCTCAAA GGGCAAAACG AGGCAGCCCT G(3AATGCTTA 300 

CGTAAAGCTG AAGAGTTAAT CCAGCaAGAG CATGCTGACC AGGCA(3AAAT CAGAA6TCTG 360 

GTCACCTGGG GAAACTATGC CTGGGTCTAC TATCACATOG GCCX3ACTCTC AGA<3GTTC3\G 420 

ATTTATGTAG ACAAGGT6AA ACATGTCTGT GAGAAGTTTT CCAGTCCCrTA TAQAATTGAG 480 

AGTCCAGAGC TTGACTGTGA GGAAGGGTGG ACACGGTTAA AGTGTGGARG AAACCAAAAT 540 

GAAAGAGCGA AGGTGTGCTT TGAGAAG<3<n' CTGGAAAAGA AGCCAAAGAA CCCAGAATTC 600 

ACCTCTGGAC TGGCAATAGC AAGCTACCGT CTGGACAACT GGCCACCATC TCAGAACGCC 660 

ATTGACCCTC TGAGGCAAGC CATTC(3GCT6 AATCCTGACA ACCAGTACCT TAAAGTCCTC 720 

CTGGCTCTGA AGCTTCAT7VA GATGCGTGAA GAAGGTGAAG AGGAAGGTGA AGGAGAGAAG 780 

TTAGTT(3AAG AAGCCTTGGA GAAAGCCCCA GGTGTAACAG ATGTACTTCG CAGTGCAGCC 840 

AACOTTTATC GAAGAAAAGA TGAGCCAGAC AAAGC6ATTG AACTGCTTAA AAAGGCTTTA 900 

GAATACATAC CAAACAATGC CTACCTGCAT TGCCAAATTG GGTGCTGCTA TA6GGCAAAA 960 



273 



wo 02/079492 



PCT/US02/04915 



10 



15 



GTCTTCCAAG 
CTAATAGGAC 
CX5TGTCTGTT 
TATTACTTCC 
CTGCGGTATG 
TTTATAGAGG 
CTGCAAAAAA 
GTCTTGGCAT 
AGGGGTTTGG 
ATAGAGATGT 
GGTATTCAAA 
CACTGGACAG 
GGGAGAGGGA 
CTCTTTGCGG 



TAATGAATCT 
ACX3CTGTGGC 
CCATTCTTGC 
AAAAGGAATT 
GCAACTTTCA 
GTGTAAAAAT 
TTGCCAAAAT 
TCCTTCAGGA 
AGTCTGGAAG 
GGTGCCCACT 
ATATGTAATG 
GGTTATGTTA 
CAGATTGGGG 
AACTTC 



AAGAGAGAAT 
TCATCTGAAG 
CAGCCTCCAT 
CAGTAAAGAG 
6CTGTACCAA 
AAACCAGAAA 
GCGACTTTCT 
GCTGAATGAA 
CCTCATCCCT 
AGGCTACTGC 
ACTGGTATGG 
AACCTGAATT 
GGTCGTCCAG 



GGAATGTATG 
AAAGCTGATG 
GCTCTAGCAG 
CTTACTCCTG 
AT6AAGTGTG 
TCAAGGGAGA 
AAAAATGGAG 
AAAATGCAAC 
TCAGCATCAA 
TGAAAGGGAG 
CAAAAGATTG 
GCTGGGTCTT 
GGCTGCGCTA 



GGAAAAGAAA 
AGGCCAATGA 
ATCAGTATGA 
TAGCGAAACA 
AAGACAAGGC 
AAGAAAAGAT 
CAGATTCTGA 
AAGCAGATGA 
GCTGGAATGG 
CTGTUUITTCC 
GACTAAGACA 
AAAAGAGCCC 
AATTATTCTC 



GTTACTGGAA 
TAATCTCTTC 
AGAAGCAGAG 
ACTGCTCCAT 
CATCCACCAC 
GAAAGACAAA 
GGCTTTGCAT 
AGACTCTGAG 
GGA ATGAA GA 
TCCACAAGTT 
CTGGCCATAC 
AAGGAGTTCT 
AATGATTTGT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 201 Protein segueq^f^ . 
Protein Accession #: AAA59191 



20 



25 



30 



35 



40 



45 



50 



55 



60. 



65 



70 



75 



MSENNKNSLE 
KHLKGQNEAA 
KHVCEKFSSP 
ASYRLDNWPP 
EKAPGVTDVIj 
LRENGMYGKR 
FSKELTPVAK 
MRIiSKNGADS 



11 
I 

SSLRQIiKCHF 
IiECLRKAEBI* 
YRIESPEIiDC 
SQNAIDPLRQ 
RSAAKFYRRK 
KliliELIGHAV 
QIjLHLRYGNF 
EALHVLAFLQ 



21 
I 

TWNLMEGENS 
IQQEHADQAE 
EEGWTRIjKCG 
AIRLNPDNQY 
DEPDKAIELIi 
AHLKKADEAN 
QLYQMKCEDK 
ELNBKMQQAD 



31 
I 

LDDPEDKVFY 
IRSLVTWGNY 
GNQNERAKVC 
LKVLIxALKLH 
KKALEYIPNN 
DNIiFRVCSiL 
AIHHFIEGVK 
BOSERGXiESG 



41 

I 

RTEFQNREFK 
AWVYYHMGRIj 
FEKAIiEKKPK 
KMREEGEEEG 
AYLHCQIGCC 
ASLHALADQY 
INQKSREKEK 
SLZPSASSWN 



51 

I 

ATMCNLLAYL 
SDVQIYVDKV 
NPEFTSGliAI 
EGEKLVEEAIi 
YRAKVFQVMN 
EDAEYYFQKE 
MKDKLQKIAK 
GB 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 202 PNA sequence 
Nucleic Acid Accession #: NM_003090 

coding sequence: 57-824 (underlined sequences correspond to start and stop codons) 



GAATTCCGCG 
TCAAGCTGAC 
GGGAGCTGGA 
ACCAGTTTGA 
TGTTGAGAAG 
GACTTGATCA 
AACTGGGTGA 
GAAATCCGGT 
TCAGAGTACT 
TCAAGGGCAA 
ATCCAGGTGC 
AAGCAATCAA 
GGTTGCTGCA 
GTGAAGAAGA 
ATAATAATAG 
TGTGTTAGCA 
TTTGTAATAT 
CACTCTCTAT 



11 
I 

GGAGGCCACG 
GGCGGAGCTG 
CCTCCGGGGG 
TGCTATTGAT 
ACTGAAAACA 
GGCTCTGCCC 
TCTGGACCCT 
AACCAATAAG 
GGATTTCCAG 
ACGGGGTGCA 
TGGTTTGCCA 
GAATGCCATA 
GTCTGGTCAG 
GATGGAAGAA 
GCCCTCTTGG 
AAGTGGAATC 
AAGTTTTGAA 
GCTAAAAAAA 



21 
I 

GGCTTTCCAC 
ATCGAGCAGG 
TATAAAATTC 
TTTTCTGACA 
TTGTTAGTGA 
TGTCT6ACAG 
CTGGCATCTC 
AAGCATTACA 
AAA6TGAAAC 
CAGCTTGCAA 
ACTGACAAAA 
GCAAATGCTT 
ATCCCTGGCA 
GACACAGTCA 
AACAAGTCTT 
TATCAGCATT 
ATCTAAATGT 
AAAAAAAGGA 



31 
I 

AGCGCGGGGG 
CGGCGCAGTA 
CCGTCATTGA 
ATGAGATCAG 
ACAACAACAG 
AACTCATTCT 
TCAAATCGCT 
GATT6TATGT 
TAAAAGAGCX3 
AGGATATT6C 
AGAGAG6TGG 
CAACTCTGGC 
GAGAACGCAG 
CAAACGGGTC 
GCTTTTCGAA 
GTTGAAATGC 
CAATTTTCTA 
ATTC 



41 
I 

AACGGGAGGC 
CACCAACGCG 
AAATCTAGGT 
GT^AACTGGAT 
AATATGCCGT 
CACCAATAAT 
GACTTACCTA 
GATTTATAAA 
TCAGGAAGCA 
CAGGAGAA6C 
GCCATCTCCA 
TGAAGTG6A6 
ATCAG6GCCC 
CTGAG CAGTS 
CATGGTATAA 
TTAA6ACTGC 
CAAATTATAA 



51 
I 

TGCAGGATGG 
GTGCGCGACC 
GCTACGTTAG 
GGTTTTCCTT 
ATAGGTGAGG 
AGTCTCGTGG 
AGTATCCTAA 
GTTCCGCAAG 
GAGAAAATGT 
AAAACTTTTA 
GG66AT6TAG 
AG6CTGAAGG 
ACTOATGATG 
AGGCA6ATGT 
TAGCCTTGTT 
TGCTGATAAT 
AAATAAACTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 203 Protein sequence.' 
Protein Accession ft: NP 003081.1 



1 
I 

MVKLTAELIE 
PLLRRLKTLL 

LRNPVTNKKH 
FNPGAGIjPTD 
DGEEEMEEDT 



11 



21 



31 



41 

I 



51 



QAAQYTNAVR DRELDLRGYK IPVIENLGAT LDQFDAIDPS DNEIRKLDGF 
VNNNRICRIG EGLDQALPCL TELILTNNSL VELGDLDPLA SLKSLTYLSI 
YRLYVIYKVP QVRVLDFQKV KLKERQEAEK MFKGKRGAQL AKDIARRSKT 
KKRGGPSPGD VEAIKNAIAN ASTLAEVERL KGLLQSGQIP GRERRSGPTD 
VTNGS 



60 
120 
180 
240 



Seq ID NO: 2 04 PNA sequence 

Nucleic Acid Accession #: NM_01764 3.1 

Coding sequence: 169-1401 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

> > I I I I 

AATAGCAATA GCTTTATAGC AGCTCCGGTT ACCTGTTTTA AACATGGAAG GAGAGTCGCT 60 

CCCAGATAGC CCTCACGAGT GGCCCTGGAG CAGGGAGTGG TGGAGCAGAT CTTCCTTGTT 120 

TGGGAGGAGC CTGAGGTGGA CCTCGCGTCC TGAGTCTGGA AGGCACCTAT6GGGACCTGC 180 

TGGGGTGATA TCTCAGAAAA T6TGAGAGTA GAAGTTCCCA ATACAGACTG CAGCCTACCT 240 



274 
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ACCAAAGTCT TCTGGATTGC TGG7UVTTGTA AAATTAGCAG GTTACAATGC CCTTTTT^GA 3 00 

TATGAAGGAT TTGAAAATGA CTCTGGTCTG GACTTCTGGT GCAATATATG TGGTTCTGAT 360 

ATCCATCCAG TTGGTTGGTG TGCAGCCAGC GGAAAACCTC TTGTTCCTCC TAGAACTATT 420 

CAGCATAAAT ATACAAACTG GAAAGCTTTT CTAGTGAAAC GACTTACTGG TGCCAAAACA 4 80 

5 CTGCCTCCTG ATTTCTCCCA AAAGGTTTCA GAGAGTATGC AGTATCCTTT CAAACCTTGC 540 

ATGAGAGTAG AAGTGGTTGA CAAGAGGCAT TTGTGTCGAA CACGAGTAGC AGTGGTGGAA 600 

AGTGTAATTG GAGGAAGATT AAGACTAGTG TATGAAGAAA GCGAAGATAG AACAGATGAC 660 

TTCTGGTGCC ATATGCACAG CCCATTAATA CATCATATTG GTTGGTCTCG AAGCATAGGT 720 

CATCGATTCA AAAGATCTGA TATTACAAAG AAACAGGATG GACATTTTGA TACACCACCA 780 

10 CATTTATTTG CTAAGGTAAA AGAAGTAGAC CAGAGTGGGG AATGGTTCAA GGAAGGAATG 840 

AAATTGGAAG CTATAGACCC ATTAAATCTT TCTACAATAT GTGTCGCAAC CATTAGAAAG 900 

GTGCTAGCTG ACGGATTCCT GATGATTGGG ATCGATGGCT CAGAAGCAGC AGACGGATCT 960 

GACTGGTTCT GTTACCATGC AACCTCTCCT TCTATTTTCC CTGTCGGTTT CTGTGAAATT 1020 

AACATGATTG AACTTACTCC ACCCA6AGGT TACACAAAAC TTCCTTTTAA ATGGTTTGAC 1080 

15 TACCTCAGGG AAACTGGCTC CATTGCAGCA CCAGTAAAAC TATTTAATAA GGATGTTCCA 1140 

AATCACGGAT TTCGTGTAGG AATGAMTTA GAAGCAGTAG ATCTCATGGA GCCACGTTTA 1200 

ATATGTGTAG CCACAGTAAC TCGAATTATT CATCX3TCTCT TGAGGATACA TTTTGATGGA 1260 

TGGGAAGAAG AGTATGATCA GTGGGTAGAC TGTGAGTCAC CTGACCTCTA TCCTGTAGGG 1320 

TGGTGTCAGT TAACTGGATA TCAACTACAG CCTCCAGCAT CACAGTGTAA GTTGGTATAC 138 0 

20 AGAAAAGGTG TCCTTTTG TA AA AATCAGCA ATTCTCCAGA GGACTATCTC ACATAAGTCA 144 0 

TCTTATGAGC TCACAGGACA AGAATATACC TATGTCTGAT TGGTTGCCAG GTAAGACATT 150 0 

AAGACTCAAC AACAATATCA CAGAATCAGA CCATGTGTCC CATGGCAATG TGAATCCAAT 1560 

AGTCAATTAC ATAATGACTA TAGAAACACA ACAGTCACCA AATTAAACTA GACTTACTAT 162 0 

TTTAGTGAGT TAAAAATTAC ATACTAAAAG TTTATTGGTA GGTAATAAAT GCTTTTGAGT 1680 

25 AAATAGTGGA AAATGTCTCA TGTTGAGGCT ATGGTTTTGT AGGAACAAGT ACCCTTATTT 1740 

TCAGAGCATC ATGTACTTAA GTATAATGGT CTTGGTAAAG ATAGTTCATA TAAGTTGTAT 1800 

CTAGACAACT GTATCGTCTA AATTGTAAAC AATTATCTAG TACCAATTTT CCCTTTTTAT 1860 

TTTTCAGCAT CAAGAGAAAA CCAATCAGCT TCATCAAAAC AGAAGAAAAA GGCTAAGTCC 1920 

CAGCAATACA AAGGACATAA GAAAAGTGGG TCACCACGTG GTGTTCACAT ACATTTTCTA 1980 

30 ATTGTTAACT AATTGGAGTC ACAGTATTCT TGGACAGAAA ATGATATATC TTGTGAGAAC 2040 

TGATGATTGT GCATTATGTA TTATGCTTAA AGGTGCAGTA TGCCATAAAA GGCAA ACC CT 2100 

TGCAATAATG AGPAACACTG ATATTTTACT AACAGGAGAA ATGATTACCA CAGTATTTAA 2160 

AGTATACGTG GT7VAAGAATA GAGTCTGTCA ATGATTCTTG AAATAATATG TAAAACCTAC 2220 

TGAAAGTTAA TCCTTTTTAA AAACTTTATT TAAAAAGAAA AATTAGCAGC CA6GTGCAGT 2280 

35 GGCTCACGCC TGTAATCCCA GCACTTTAGG AGGCCGAGGC TGGCAGATCA CAAGGTCAGG 2340 

AGATCGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CCACCAAAAA TACAAAAAAT 2400 

CTGCCGGGCG TGGTGGCACA CGCCTGAAGT CCCAGCTACT CAGGAGGCTG AGGCAAGAGA 2460 

ATCACTTGAA CCCAGGAGGC AGAGGTTGCA GTGGGCCAA6 ATCACGCCAC TACATTCCAG 2520 
CTGGGCAACA CAGCAAGACT CTGTCTCAAA AAAAAAAAAA AAAA 

40 

Seq ID HO: 205 Protein aecmence: 
Protein Accession #: NP_060113.1 

1 11 21 31 41 51 

45 I I I I I I 

MGTCWGDISE NVRVEVPNTD CSLPTKVFWI AGIVKLAGYN ALLRYEGPEN DSGLDFWCNI 60 

CGSDIHPVGW CAASGKPIjVP PRTIQHKYTN WKAFLVKRI^T GAKTI»PPDFS QKVSESMQYP 120 

FKPCMRVEW DKRHLCRTRV AWBSVIGGR LRLVYEESED RTDDFWCHMH SPLIHHIGWS 180 

RSIGHRFKRS DITKKQDGHF DTPPHLFAKV KEVDQSGEWF KEGMKIiBAID PLNIiSTICVA 240 

50 TIRKVLADGF LMIGIDGSEA ADGSDWPCYH ATSPSIFPVG FCEINMIELT PPRGYTKIiPF 300 

KWFDYLRETG SIAAPVKLFN KDVPNHGPRV GMKLEAVDLM EPRLICVATV TRIIHRLIiRI 360 
HFDGWEEEYD QWVDCESPDL YPVGWCQLTG YQLQPPASQC KLVYRKGVIiL 



55 

Seq ID NO: 206 DNA sequence 
Nucleic Acid Accession #: NM_012334 

Coding sequence: 223-6399 (underlined sequences correspond to start and stop codons) 

60 1 11 21 31 41 51 

I I I I I I 

GAGACAAAGG CTGCCGTCGG GACGGGCGAG TTAGGGACTT GGGTTTGGGC GAACAAAAGG 60 

TGAGAAGGAC AAGAAGGGAC CGGGCGATGG CAGCAGGGGA GCCCCGCGGG CGCGCGTCCT 120 

CGGGAGTGGC GCCGTGACAC GCATGGTTTC CCCGGACCCG CGGCGGCGCT GACTTCCGCG 180 

65 AGTCGGAGCG GCACTCGGCG AGTCCGGGAC TGCGCTGGAA CAATGGATAA CTTCTTCACC 240 

GAGGGAACAC GGGTCTGGCT GAGAGAAAAT GGCCAGCATT TTCCAAGTAC TGTAAATTCC 300 

TGTGCAGAAG GCATCGTCGT CTTCCGGACA GACTATGGTC AGGTATTCAC TTACAAGCAG 360 

AGCACAATTA CCCACCAGAA GGTGACTGCT ATGCACCCCA CGAACGAGGA GGGCGTGGAT 420 

GACATGGCGT CXTTTGACAGA GCITCATGGC GGCTCCATCA TGTATAACTT ATTCCAGCGG 480 

70 TATAAGAGAA ATCAAATATA TACCTACATC GGCTCCATCC TGGCCTCCGT QAACCCCTAC 540 

CAGCCCATC6 CCGGGCTGTA CGAGCCTGCC ACCATGGAGC AGTACAGCCG GCGCCACCTG 600 

GGCGAGCTGC CCCCGCACAT CTTCGCCATC GCCAACGAGT GCTACCGCTG CCTGTGGAAG 660 

CGCTACXaCA ACCAGTGCAT CCTCATCAGT GGTGAAAGTG GGGCAGGTAA AACCGT^GC 720 

ACTAAATTGA TCCTCAAGTT TCTGTCAGTC ATCAGTCAAC AGTCTTTGGA ATTGTCCTTA 780 

75 AAGGAGAAGA CATCCTGTGT TGAACGAGCT ATTCTTGAAA GCAGCCCCAT CATGGAAGCT 840 

TTCGGCAATG CGAAGACCGT GTACAACAAC AACTCTAGTC GCTTTGGGAA GTTTGTTCAG 900 



275 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



CTGAACATCT 
AAAAACCGAG 
CTGGCAGGGC 
CACTACTTGA 
AG6GAA6TTA 
TCGAGGCTGC 
GCACAGGTTT 
ACACAGCTCA 
ACGCCTCTCA 
GCGTGCTGCT 
TTCAAGTCTA 
GAACAGTTCA 
TTTTCTTTAG 
ATAGACAATG 
AATGAAGAAA 
CAGCATGCGA 
AAGCACTATG 
ACATTTCGAG 
CTTTTTGAAC 
CGGCGGCCTA 
AGCTCCTCTA 
CAGTTTGACC 
AGAATCCGCA 
AAA6TGCTGA 
CTGCAGCTCT 
CGAGAATCCT 
ATGGTGATTC 
TATTGTGTGG 
CACCTGAAAA 
GTTTACAGAC 
GAAGAAAAGA 
GAGCTCCGCG 
AAGAGCCAGA 
GTGGAAGAGA 
CAGGAGCTGT 
CTCCGCAGGC 
TTCGACGAGA 
TTTTCCAGCG 
CCCTACCCAG 
TCCCCCAACC 
GATGACTCTT 
GCG6ACAGCA 
TCCA6CGGC6 
GGCGACTACG 
GTGACCTTCT 
ACCTACAACA 
GATAGTGAAG 
GACTCTGTGT 
GGCCTGATGA 
TTCCGCTCCA 
TCCACGCTGT 
ATGTACTTTG 
AAAGAGATCA 
ACTTTCCACC 
CAGGTCCACG 
CAGAATGCTG 
CCTGATAGAC 
GACACGCCGG 
AGAGTGGAGG 
CC6AAGATGT 
GATTACTACA 
CTCTGCTCTG 
ACCGTGTACG 
CG6T6GTCCA 
CAGCAGCTGA 
TACAAGCGGA 
CTTCCGTATG 
GATGAGGCCA 
CCAATAATCC 
TACTGCCAGC 
TACAGCTGGC 
AAGTATCTCA 
AAATACGCTC 
TCCCGAGATG 
CATGGCGGCG 
GAGAAGCTGA 



GTCA6AAAG6 
TAGTAAGGCA 
TGGAACAT6A 
ATCAGTCTGG 
TTACGGCAAT 
TTGCTGGTAT 
CCTTCAAAAC 
CAGATGCTTT 
ATGTTCAACA 
TTGAGTGGGT 
TTGGCATCCT 
ATATAAACTA 
AACAACTAGA 
GAGAATGCCT 
GCCATTTTCC 
ATAACCACTT 
CTGGAGAGGT 
ATGACCTTCT 
ATGTTTCAAG 
CAGTCAGCTC 
ATCCTTTCTT 
AGGCGGTTGT 
AAGCTGGGTA 
TGAGGAATCT 
ATGATGCCTC 
TGGAACAGAA 
GGGCCCATGT 
TGATAATACA 
AGGCAGCCAT 
AATTGCTGGC 
AGAAACGG6A 
CCCAGCAGGA 
AGGAAGCTGA 
TCCTCCGTCT 
CGCTGACCGA 
TGGAGGAGGA 
TCGACGAGTG 
AGCTGGCTGA 
AGGAGGAGGT 
CCAGCGAGCA 
CAGAGGAGGA 
CGGTGCTGCT 
AGTCCACCTA 
ACTAC6ACCA 
CCAACTCCTA 
GCTCGGGTGC 
AGGACTTTGA 
ACAGCTGTGT 
ACTCTTGGAA 
AGCAGGAGGC 
CCAGGAGAAA 
AAAACGACAG 
TAGATAACAC 
TGATTGCAGA 
CGTCCACGGA 
TGGGCACCTT 
CCAACTCGTT 
AGGAGATGCA 
GCCAGGAATT 
CTTCACTGAA 
AGAGTTCAGA 
TCX3TCCCCCC 
GGCGCAAGCA 
GTGCCATTCA 
TTCAAGATAT 
ACCCGATCCT 
GGGACATAAA 
TCAAGATATT 
AGGGCATCCT 
TTATCAAACA 
AGATCCTGAC 
AGTTCCATCT 
TCTTCACTTA 
AAATAGAAGC 
GCTCCTGCAA 
TCCGAGGCCT 



AAATATTCAG 
AAATCCCGG6 
AGAAAGAGAA 
ATGTGTAGAA 
GGACGT6ATG 
ACTGCATCTT 
AGCTTTGGGC 
GACCCAGAGA 
GGCAGTAGAC 
AATCAAGAAG 
CGACATCTTT 
TGCAAACGAG 
ATATAGCCGG 
GGACTTGATT 
TCAAGCCACA 
TTATGTGAAG 
GCAATATGAT 
CAATTTGCTA 
CCGCAACAAC 
ACAGTTCAAG 
TGTTCGCTGT 
GCTGAACCAG 
TGCGGTCCGA 
GGCTCTGCCT 
CAACAGCGAG 
ACTGGAGAAG 
CTTGGGCTTC 
GAAGAATTAC 
AGTTTTCCAG 
AGAGAAAAGG 
GGAAGAAGAA 
AGAAGAAAGG 
ACTGACCCGT 
GGAGAAAGAA 
GGCTTCCCTG 
AGCGTGCAGG 
TGTCCGGAAT 
GAGCGCATGC 
CGATGAGGGC 
CGGCCACTCA 
CCCATACATG 
CX3CCCCATCA 
CTGCATGCCC 
GGATGACTAT 
C6GCAGCCAG 
CTACCGGTTC 
rrCCAGGTTT 
CACTCTGCCG 
ACGCCGCTG6 
CCTCAAGCAA 
TTGGAAGAAG 
CGAGGAGAAG 
CACCAAGGAG 
GTCCCCAGAA 
CCAGGAGATC 
GGATGTGGGG 
TGTGATCATC 
CCACTGGATA 
CATCGTGAGA 
ACTGAAGAAA 
GAAGAACGCG 
AGATGAGAAG 
CTGTTACCGG 
AAACGTGACT 
CAAGGAGAAC 
TCGATACACC 
TCTCAACTTG 
CAATTCCCTG 
ACAGACAGGG 
GACCAACAAA 
ATGCCTGAGC 
GAAAAGGATA 
CGAATCTCTT 
TCTGATCCAC 
GATCACCATC 
GGCCATGGAG 



GGCGGGAGAA 
GAAAGGAATT 
GAATTTTATT 
GACAAGACAA 
CAGTTCAGCA 
GGGAACATAG 
AGATCT6C6G 
TCAATGTTCC 
AGCAGG6ACT 
ATCAACAGCA 
GGATTTGAAA 
AAACTTCAGG 
GAAGGATTAG 
GAGAAGAAAC 
GACAGCACCT 
CCCAGAGTTG 
GTCCGAGGTA 
AGAGAAAGCC 
CAGGATACCT 
GACTCACTGC 
ATCAAGCCAA 
CTGCGGTACT 
AGACCCTTTC 
GAGGACGTCC 
TGGCAGCTGG 
CGGAGG6AAG 
TTAGCACGAA 
AGAGCATTCC 
AAGCAACTCA 
GAGCAAGAAG 
AGAGAAAGAG 
AGGAAGCAGC 
GAACTGGAGA 
ATCGAGGACC 
CAGAAGCTGC 
GCGGCCCAGG 
ATCGAGCGGT 
GAGGAGAAGC 
TTCGAAGCCG 
GACCAGCGAA 
AACGACACGG 
GTGCAGGACT 
CAGAACGCTG 
GAGGAC6GTG 
TGGTCCCCCG 
AGCTCTGAGG 
GATACAGATG 
TATTTCCACA 
TGCQTCCTCA 
GGCTGGCTCC 
CGCTGGTTTG 
CTCAAGGGCA 
AATGGGATCX3 
GATGCCAGCC 
CAGGAGATGC 
CTGATTGATT 
ACGGCCAACC 
ACCCTGCTGC 
GGATGGTTGC 
CGGTGGTTTG 
CTCAAACTGG 
ATATTCAAAG 
CTCTACACCA 
GACACCAAG6 
TGCCTGAACT 
CATCACCCCT 
CTCAAAGACA 
CAGCAACTGG 
CATGACCTGC 
GTGCCCCACC 
TGCACCTTCC 
CGGGAACAGT 
AAGAAAACCA 
AGGCAGGAAA 
AACTCCCACA 
GACAGCAGGA 



TTGTAGATTA 
ATCACATATT 
TATCTACXSCC 
TCAGTGACCA 
AGGAGGAAGT 
AATTTATCAC 
AGTTACTTGG 
TCAGGGGAGA 
CCCTGGCCAT 
GGATCAAAGG 
ACTTTGAGGT 
AGTACTTCAA 
TGTGGGAAGA 
TTGGCCTCCT 
TATTGGAGAA 
CAGTTAACAA 
TCTTGGAGAA 
GATTTGACTT 
TGAAATGTGG 
ATTCCTTAAT 
ACATGCAGAA 
CAGGGATGCT 
AGGACTTTTA 
GAGGGAAGTG 
GGAAGACCAA 
AGGAAGTGAG 
AACAATACAG 
TTCTGAGGAG 
GAGGTCAGAT 
AAAAGAAGAA 
AGA6;VGAGCX3 
AAGAACTCGA 
AACAGAAGGA 
TGCAGCGCAT 
AGGAGCGGCG 
AGTTCCTCGA 
CCCTGTCGGT 
CCAACTTCAA 
ACGACGACGC 
CAAGTGGCAT 
TGGTGCCCAC 
CCGGGAGCCT 
GGGACTTGCC 
CCATCACTTC 
ACTACCGCTG 
GGGC6CAGTC 
ATGAGCTTTC 
GCTTTCTGTA ■ 
AGGATGAAAC 
ACAAAAAAGG 
TCCTCCGCCA 
CCGTAGAAGT 
ACATCATTAT 
AGTGGTTCAG 
ATGATGAGCA 
CTGTGTGTGC 
GGGTGCTGCA 
AGAGGTCCAA 
ACAAAGAGGT 
TACTCACCCA 
GGACCCTGGT 
AGACAGGCTA 
AGCTGCTCAA 
CCCCGATCGA 
CGGATGTGGT 
TGCACTCCCC 
AAGGCTATAC 
AGTCCATGTC 
GACCTCTGCG 
CCGGCA6TGT 
TGCCGA6TCG 
TTCCAGGAAC 
AAT6CCGAGA 
TGACATCCAC 
CCACTGCTGG 
ACATGTTTGC 



TTTATTAGAA 
TTATGCACTG 
AGA/^AACTAC 
GGAATCCTTT 
TCGGGAAGTG 
TGCTGGTGGG 
GCTGGACCCA 
AGAGATCCTC 
G6CTCTGTAT 
CAATGAGGAC 
TAATCACTTT 
CAAGCATATT 
TATTGACTGG 
AGCCCTTATC 
GCTACACAGT 
TTTTGGAGTG 
GAACAGAGAT 
TATCTACGAT 
AAGCAAACAT 
GGCAACGCTA 
GATGCCAGAC 
GGAGACTGTG 
CAAAAGGTAT 
CACGAGCCTG 
GGTCTTTCTT 
CCACGCGGCC 
AAAGGTCCTT 
GAGATTTTTG 
T6CTCGGAGA 
ACAG6AAGAG 
AAGAGAAGCC 
AGCCTTGCAG 
AAATAAGCAG 
GAAGGAGCAG 
GGACCAGGAG 
GTCCCTCAAT 
GGGAAGCGAA 
CTTCAGCCAG 
CTTCAAGGAC 
CCGGACCAGC 
CAGCCCCAGT 
ACACAACTCC 
CTCCCCAGAC 
CGGCAGCAGC 
CTCTGTGGGG 
CTCGTTTGAA 
ATACCGQCGT 
CATGAAAG6T 
CTTCTTGTGG 
GGGGGGCTCC 
GTCCAAGCTG 
GCGAACGGCA 
GGCC6ATAGG 
CGTGCTGAGT 
GGCAAACCCA 
CTCTGACAGC 
CTGCAACGCC 
AGGGGACACC 
GAAGAACAGT 
CAATTCCCTG 
CCTCAACAGC 
CTGGAACGTC 
CGAGGCCACC 
CACCCCCACC 
GGAACAGATT 
GCTCCTGCCC 
CACCCTTCAG 
TGACCXa\ATT 
GGACGAGCTG 
GG6CAACCTG 
AGGGATTCTC 
C6AGATG6AA 
GTTTGTGCCT 
GGTCTATTGC 
GGAGGTGGTG 
TTTGTTTGAA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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1560 
1620 
1680 
1740 
1600 
1860 
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2100 
2160 
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2280 
2340 
2400 
2460 
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2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4360 
4440 
4500 
4S60 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 



276 

NSOOCIO: <WO 02079492A2 l_> 



wo 02/079492 



PCT/US02/04915 



TACAACGGCC ACGTCGACAA AGCCATTGAA AGTCGAACCG TCGTAGCTGA TGTCTTAGCC 
AAGTTTGAAA AGCTGGCTGC CACATCCGA6 GTTGGGGACC TGCCATGGAA ATTCTACTTC 
AAACTTTACT GCTTCCTGGA CACAGACAAC GTGCCAAAAG ACAGTGTGGA GTTTGCATTT 
ATGTTTGAAC AGGCCCACGA AGCGGTTATC CATGGCCACC ATCCAGCCCC GGAAGAAAAC 
5 CTCCAGGTTC TTGCTGCCCT GCGACTCCAG TATCTGCAGG GGGATTATAC TCTGCACGCT 
GCCATCCCAC CTCTCGAAGA GGTTTATTCC CTGCAGAGAC TCAAGGCCCG CATCAGCCAG 
TCAACCAAAA CCTTCACCCC TTGTGAACGG CTGGAGAAGA GGCGGACGAG CTTCCTAGAG 
GGGACCCTGA GGCGGAGCTT CCGGACAGGA TCCGTGGTCC GGCAGAAGGT CGAGGAGGAG 
CAGATGCTGG ACATGTGGAT TAAGGAAGAA GTCTCCTCTG CTCGAGCCAG TATCAT TGA C 

10 AAGTGGAGGA AATTTCAGGG AATGAACCAG GAACAGGCCA TGGCCAAGTA CATGGCCTTG 
ATCAAGGAGT GGCCTGGCTA TGGCTCGACG CTGTTTGATG TGGAGTGCAA GGAAG6TGGC 
TTCCCTCAGG AACTCTGGTT GGGTGTCAGC GCGGAC6CCG TCTCC6TCTA CAAGCGTGGA 
GAGGGAA6AC CACTG6AAGT CTTCCAGTAT 6AACACATCC TCTCTTTTGG GGCACXICCTG 
GCGAATACGT ATAAGATCGT GGTCGATGAG AGGGAGCTGC TCTTTGAAAC CAGTGAGGTG 

15 GTGGATGTGG CCAAGCTCAT GAAAGCCTAC ATCAGCAT6A TCGTGAAGAA GCGCTACAGC 
ACGACACGCT CCGCCAGCAG CCAGGGCAGC TCCAGGTGAA GGCGGGACAG AGCCCACCTG 
TCTTTGCTAC CTGAACGCAC CACCCTCTGG CCTAQGCTGG CTCCAGTGT6 CCATGCCCAG 
CCAAAACAAA CACAGAGCTG CCCAGGCTTT CTGGAAGCTT CTGGTCTGAG GGAGGTGTCT 
CCGAGOATCC TTTTGCCTGC CGCCTTCATT GATCCTGTAT TAAGCTGTCA ACTTTAACAG 

20 TCTGCACAGT TTCCAAAGCT TTACTACTCT TAGAGGACAC ATGCCTTAAA AAAGGAGGGG 
AGGAACCACG CTGCCACCAA AGCAGCCGGA AGTGCCTTAA CTTGTGGAAC CAACACTAAT 
CGACCGTAAC TGTGCTACTG AAGGGAACTG CCTTTCCCCC TTCTGGGGGA GACTTAACAG 
AGCGTGGAAG GGGGGCATTC TCTGTCAATG ATGCACTAAC CTCCCAACCT GATTTCCCCG 
AATCTGAGGG AAGGTGAGGG AGTGGGAAGG GGGATGGAGA GCTCGAGGGG ACAGTGTGTT 

25 TGAGCTGGAG TGCTGCGGGC AGCCTTTCTC ATGGAATGAC ATGAATCAAC TTTTTTCTTT 
GTTTCATCTT TTAAGTGTAC CTGCTTTGCCT GTTCGTGCAT GTGTTCATAA ACTCAACACT 
TTAATCATGG TTTCATGAGC ATTAAAAAGC AAAGGGAAAA AGGATGTGTA ATGGTGTACA 
CAGTCTGTAT ATTTTAATAA TGCAGAGCTA TAGTCTCAAT TGTTACTTTA TAAGGTGGTT 
TTATTAACAA ACCCAAATCC TGGATTTTCC TGTCTTT6CT GTATTTTGAA AAACACGTGT 

30 TGACTCCATT GTTTTACATG TAGCAAAGTC TGCCATCTGT GTCTGCTGTA TTATAAACAG 
ATAAGCAGCC TACAAGATAA CTGTATTTAT l^AACCACTCT TCAACAGCTG GCTCCAGTGC 
TGGTTTTAGA ACAAGAATGA AGTCATTTTG GAGTCTTTCA TGTCTAAAAG ATTTAAGTTA 
AAAACAAA6T GTTACTTGGA AGGTTAGCTT CTATCATTCT G6ATAGATTA CAGATATAAT 
AACCATGTTG ACTATGGGGG AGAGACGCTG CATTCCAGAA ACGTCTTAAC ACTTGAGTGA 

35 ATCTTCAAAG GACCCTGACA TTAAATGCTG AGGCTTTAAT ACACACATAT TTTATCCCAA 
GTTTATAATG GTGGTCTGAA CAAGGCACCT 6TAAATAAAT CAGCATTTAT GACCAGAAGA 
AAAATAATCT GGTCTTGGAC TTTTTATTTT TATATGGAAA AGTTTTAAGG ACTTGGGCCA 
ACTAAGTCTA CCCACACGAA AAAAGAAATT TGCCTTGTCC CTTTGTGTAC AACCATGCAA 
AACTGTTTGT TGGCTCACAG AAGTTCTGAC AATAAAAGAT ACTAGCT 

40 

Seq ID NO: 207 Protein sequence; 
Protein Accession #: NP_036466 

1 XI 21 31 41 51 

45 I I I I I I 

MDNFFTEGTR VWLRENGQHF PSTVNSCAEG IWFRTDYGQ VFTYKQSTIT HQKVTAMHPT 60 
NEEGVDDMAS LTELHGGSIM YNLFQRYKRN QIYTYIGSII, ASVNPYQPIA GIiYEPATMEQ 120 
YSRRHLGEI>P PHIFAIANBC YRCIjWKRYDN QCILISGBSG AGRTBSTKIil LKFIiSVISQQ 180 
SIiELSLKEKT SCVERAILES SPIMEAFGNA KTVYNNNSSR FGKFVQIjNIC QKGNIQGGRI 240 

50 VDYIiIiEKNRV VRQNPGERNY HIFYALIiAGL EHBEREBFYL STPBNYHYIiM QSGCVBDKTI 300 
SDQESFREVI TAMDVMQFSK BBVRBVSRUj AGIIiHIiGNIE FITAGGAQVS PKTALGRSAE 360 
IjLGIiDPTQIjT DAliTQRSMFL RGEEIIiTPUff VQQAVDSRDS LAMALYACCF EWVIKKINSR 420 
IKGNEDFKSI GILDIFGFEN FEVNHPEQFN INYANEKLQE yPNKHIPSLE QIiEYSREGIjV 480 
WEDIDWIDNG BCIiDLIEKKIj GLIiALINEES HFPQATDSTL LEKLHSQHAN NHFYVKPRVA 54 0 

55 VNNFGVKHYA GEVQYDVRGI LEKNRDTFRD DLIiNLIiRESR FDFIYDLFEH VSSRNNQDTIi 600 
KCX3SKHRRPT VSSQFKDSLH SliMATLSSSN PFFVRCIKPN MQKMPDQFDQ AWLNQLRYS 660 
GMLETVRIRK AGYAVRRPFQ DFYKRYKVIW RNliALPEDVR GKCTSLLQIiY DASNSEWQLG 720 
KTKVFLRESL EQKIiEKRREE EVSHAAMVIR AHVIiGFIiARK QYRKVLYCW IIQKNYRAFL 780 
LRRRFIiHIiKK AAIVFQKQLR GQIARRVYRQ liliAEKREQEE KKKQEEEEKK KREEEERERE 84 0 

60 RERREAEIiRA QQEEETRKQQ ELEAIjQKSQK EAEIiTRELEK QKENKQVEEI IjRIjEKEIEDIj 900 
QRMKEQQELS liTEASLQKIiQ ERRDQELRRL EEEACRAAQE FLESLNFOEI DECVRNIERS 960 
LSVGSEFSSE LAESACBEKP NFNFSQPYPE EEVDEGFEAD DDAFKDSPNP SEHGHSDQRT 1020 
SGIRTSDDSS EEDPYMNDTV VPTSPSADST VLIiAPSVQDS GSLHNSSSGE STYCMPQNAG 108 0 
DLPSPDGDYD YDQDDYEDGA ITSGSSVTFS NSYGSQWSPD YRCSVGTYNS SGAYRFSSEG 1140 

65 AQSSFEDSEE DFDSRFDTDD BLSYRRDSVY SCVTIiPYFHS FliYMKGGLMN SWKRRWCVIiK 1200 
DETFIiWFRSK QBALKQGWLH KKGGGSSTLS RRNWKKRWPV LROSKdMYFE NDSEEKIiKGT 1260 
VEVRTAKEII DNTTKENGID IIMADRTPHL lABSPEDASQ WPSVIiSQVHA STDQEIQBMH 1320 
DEQANPQNAV GTIiDVGLIDS VCASDSPDRP NSFVIITANR VUlCNAOTPB EMHHWlTLIiQ 1380 
RSKGDTRVEG QEFIVRGWLH KBVKNSPKMS SIjKIiKKRWFV LTHNSiDYYK SSEKNAUCUG 1440 

70 TLVLNSIiCSV VPPDEKIFKB TGYWNVTVYG RKHCYRLYTK IiUIEATRWSS AIQNVTDTKA 1500 
PIDTPTQQLI QDIKENCLNS DWEQIYKRN PIIjRYTHHPL HSPLliPIiPYG DINLNLLKDR 1560 
GYTTLQDBAI KIFNSIiQQLE SMSDPIPIIQ GIIiCJTGHDIjR PLRDEIiYCQL IKQTNKVPHP 1620 
GSVGNIiYSWQ ILTCLSCTPL PSRGILKYLK FHUCRIREQF PGTEMEKYAL FTYESLKKTK 1680 
CREFVPSRDB lEALIHRQEM TSTVYCHGGG SCKITINSHT TAGEWEKLI RGLAMEDSRN 1740 

75 MFAIiFEYNGH VDKAIESRTV VADVIAKFEK IiAATSEVGDL PWKFYFKIiYC FLDTDNVPKD 1800 
SVEFAFMFEQ AHBAVIHGHH PAPBBNLQVL AALRLQYIiQG DYTLHAAIPP IjBBVYSLQRIi 1860 



277 



5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 



wo 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



KARISQSTKT FTPCERLEKR RTSFLEGTIiR RSFRTGSWR QKVEEEQMLD MWIKEEVSSA 1920 

RASIIDKWRK FQGMNQEQAM AKYMALIKEW PGYGSTIiFDV ECKEGGFPQE LWLGVSADAV 1980 

SVYKRGEGRP LEVFQYEHII, SFGAPLANTY KIWDERELL PETSEWDVA KLMKAYISMI 2040 

ViCKRYSTTRS ASSQGSSR r^xAowx -iu«a 

Seq ID NO: 208 DWA aeouence 

Nucleic Acid Accession #: XM_05976l.i 

Coding sequence: 124-525 (underlined sequences correspond to start and stop codons) 



CGAAGATCTA 
GATGATCTCA 
GGCATGGCTC 
TCTCTGAGCG 
GATTCTGTAA 
TCAAATACCC 
GTGAGAAGTT 
GGCTGCCGTC 
ATTTTCTGTT 
CTCTGTGTAA 
TACTGCTTCT 
ACAACAGGTC 
TCAGAATGAA 
TGGAGGTGTT 



11 
I 

TCCAAAATCA 
ATCATGTGGA 
TTATGGAAGT 
AGACA6TGAA 
ATGAAACCCA 
AAGATGCTTC 
ACAACTCTGA 
CTTGTGAGGA 
TCAAGCTTCT 
CACTAACATT 
ATTTTGAAAA 
CTAGCATGTA 
TGCAGTTGTG 
TGTTTTCTCC 



21 
I 

AGAAGCCTTT 
TTTGAATGTG 
TAACCTATTA 
GAAAGTGGAA 
GTTTTGTGTT 
AGTGTCCATA 
AGTGAAGCTG 
TGGAGCTTCA 
GTACTTTATG 
TCCAGTAGTC 
AAGAGTTTTT 
TAGCTGCATA 
TGTCTATATT 
AGAATAAAG6 



31 
I 

GATTTAGATG 
TGTACAAGCT 
A6TGGCTTTA 
TATGATCATG 
AATATTCCTG 
GTGGATTACT 
TCCTCCTGTG 
GGCTCCCATC 
GAACTTTGGC 
ACATGTGATT 
TTTCTTTCTA 
GATTTCTTCA 
TTCCCCTCTC 
TATTACTTTA 



41 
I 

TTGCTGTAAA 
TTTCGGGCCC 
TGGTGCCTTC 
GAAAACTCAA 
CTGTGAGAAA 
AT6AGCCAAG 
ACCTTTGCAG 
ATCACTCTTC 
T GTGA TTTAT 
GTTTTGTTTT 
TGGGGTTGCA 
CCTGATCTTT 
AAAATCTTTT 
G 



51 
I 

AGAAAATAAA 
GGGTAGGAGT 
AGAAGCAATT 
CCTCTATTTA 
CTTTAAAGTT 
GAGACAGGCG 
TGATGTCCAG 
AGTCATTTTT 
TTTTAAAGGA 
CGTAGAAGAA 
GGGATGGTGT 
GTGTGGAAGA 
AGAATTTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
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780 
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Seq ID NO: 209 Protein seqnence! 
Protein Accession #: XP 059761.1 



11 



21 



41 



51 



31 

I I I I I I 

MAIiMEVNLLS GFMVPSEAIS LSBTVKKVEY DHGKIiNIiYLD SVNETQFCVN IPAVRNFKVS 
NTQD7VSVSIV DYYEPRRQAV RSYNSBVKLS SCDIjCSDVQG CRPCBDGASG SHHHSSVIFI 
FCFKLLYFME LWL 
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Seq ID NO; 210 DNA sequence 
Nucleic Acid Accession #: NM_015472 

Coding sequence: 258-1460 (underlined sequences correspond to start and stop codons) 



1 
I 

GACACACTCC 
AGCTCACATC 
AGAAGACATG 
GACTCTTATC 
TG6TGATTTT 
AGCAAGTGAT 
TCATGAATCC 
AGCCTGATTC 
GGCCTCGACT 
AGCTGGGCAC 
AGCAGTCCTA 
CGGCCACTGG 
CTAGGAAGGC 
CACCAGTGCC 
ACCAGCAGCA 
CCGCAGGGCT 
TGCGGCTTCA 
TGAGGCAGGA 
TTCAGGCTGC 
GCTCAGATCC 
GCCTGGGGTT 
ATGAGATGGA 
CCCGTTTCCC 
AATCTGAACSA 
CCTTTCTAAC 
TTCCTGGGCC 
GCCrrCCATGA 
ATT6AGAGTA 



11 
I 

TCTACAACAC 
CTGGCGACTC 
TACATTCAGT 
TTGGCATCTG 
CCAGAA GATG 
CCACGTCACG 
GAAGCCTAGC 
GGGCTCGCAC 
GGCTGGGG6T 
CGGC6CGGGT 
CGACGTGACC 
CCAGAGGTAC 
GATGAATCAG 
TCAGAGGTCC 
GATGGCCCCC 
CATGAGTATG 
GAGAATCCAG 
AGCTGCCCTC 
TGTCAACCCA 
TTTCCTCAAT 
AGGGTGCTAC 
TACAGGAGAA 
TGATTTCCTT 
CCT6ATCCCC 
(TrGGCTGTAA 
TCTT6GAAAA 
CTCGTGCTCC 
ACTTAAGTTA 



21 

I 

CAGAGACTCC 

TCAAGAGAGA 

ATCTATTTTG 

CTTCCTGGTG 

AATCCGGCCT 

CAGGACCTAG 

TCGTGGCGGA 

TCGCGCCAGT 

GCCCAGCATG 

GCTGCGG6TA 

GACGAGCTGC 

TTCCTCAATC 

CCTCTGAATC 

ATGGCAGTAT 

AGTACCCTGA 

CCCAATGCGC 

ATGGAGAGAG 

TGTCGACAGC 

CCCACGATGA 

GGAGGGCCAT 

AGTGTCCCCA 

AACGCAGGAC 

GACTGTCTTC 

CTCTTCAATG 

TCACTACCAT 

AGTGATGGAG 

CTCCTTTTTA 

AACATAAATA 



31 
I 

CAAACACAAG 

AACCTCAGAG 

GCATTTTCCC 

GATCTGAACT 

CGGCGCCCCC 

ACACAGACCT 

AGAAGATCCT 

CCA6CACCGA 

TCCGCTCGCA 

GCCCCGCGCA 

CACTGCCCCX: 

ACATAGAAAA 

ATATGAACCT 

CCCAGCCAAA 

GCCAGCAGAA 

TGACCACTCA 

AAAGGATTCG 

TCCCCATGGA 

CCCCAGACAT 

ATCATTCGAG 

CAACTCCGGA 

AAACACCCAT 

CAGGAACAAA 

ATGTAGAGTC 

TGTAACTTGG 

CAGA6CAAGT 

TGTTGCCAGT 

AATATTCTAT 



41 

I 

GCCTTATATT 
TGACTATVAAT 
CAATACATCT 
GACCCATAAG 
TCCGCTCCCG 
CGAAGCCCTC 
GCCGGAGTCT 
CTCGTCGGGC 

crrcGTCGCcc 

GCAGCAC606 
6GGCTGGGAG 
AATCACCACA 
CCACCCTGCC 
TCTCGTQATG 
CCACCCCACT 
GCAGCAGCAG 
AATGCGCCAA 
AGCTGAGACT 
GAGATCCATC 
GGAGCAGAGC 
GGACTTCCTC 
GAACATCAAT 
CGTTGACTTA 
TGCTCTGAAC 
ATGTAGCCAT 
CTGCAGGTGC 
TTAATCATTG 
TTTCATTTTC 



51 

I 

GACTCATTTC 

CTCCATAATG 

CTGCTCATCT 

CCACGCTTAC 

CCGCCTGGGC 

TTCAACTCTG 

TTCTTTAAGG 

GGCCACCCGG 

GCGTCCCTGC 

CACCTCCGCC 

ATGACCTTCA 

T<3GCAAGACC 

GTCAGTTCCA 

AATCACCAAC 

CAGAACCCAC 

CAGCAQAAAC 

GAGGAGCTCA 

CTTGCCCCAG 

ACTAATAATA 

ACTGACAGTG 

AGCAATGTGG 

CCCCAACAGA 

GGAACTTTGG 

AAAAGTGAGC 

GACCTTACAT 

ACCACTTCCC 

CCTGGTTTTG 



Seq ID NO: 211 Protein sequence; 
Protein Accession #: NP_056287.1 



60 
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MNPASAPPPL PPPGQQVIHV TQDLDTDIiEA LFNSVMNPKP SSWRKKILPE SPFKEPDSGS 60 

HSRQSSTDSS GGHPGPRLAG GAQHVRSHSS PASIiQLGTGA GAAGSPAQQH AHLRQQSYDV 120 

TDELPLPPGW EMTFTATGQR .YFLNHISKIT TWQDPRKAMN QPLNHMNLHP AVSSTPVPQR 180 

SMAVSQPNLV MNHQHQQQMA PSTLSQQNHP TQNPPAGLMS MPNAIjTTQQQ QQQKLRLQRI 240 

5 QMERERIRMR QEELMRQEAA LCRQLPMEAE TLAPVQAAVN PPTMTPDMRS ITNNSSDPFIi 300 

NGGPYHSREQ STDSGLGLGC YSVPTTPEDF LSNVDEMDTG ENAGQTPMNI NPQQTRPPDF 360 
LDCLPGTNVD liGTLESEDLI PLFNDVESAIi NKSEPFLTWL 

Seq ID NO: 212 DNA seoruence 
10 Nucleic Acid Accession #: NM 018174 

Coding sequence: 176-2194 (underlined sequences correspond to start and stop codons) 



CATCTCCCCC AACCTGGGGG TCGTGTTCTT CAACGCCTGC GAGGCCGCGT CGCGGCTGGC 60 

15 GCGCGGCGAG GATGAGGCGG AGCTGGCGCT GAGCCTCCT6 GCGCAGCTGG GCATCACGCC 120 

TCTGCCACTC AGCCGCGGCC CC6TGCCAGC CAAACCCACC GT6CTCTTCG AGAAG ATGG G 180 

CGTGGGCCGG CTGGACATGT ATGTGCTGCA CCCGCCCTCC GCCGGCGCCG AGCGCACGCT 240 

GGCCTCTGTG TGCGCCCTGC TGGTGTGGCA CCCCGCCGGC CCCGGCX3AGA AGGTGGTGCG 300 

CGTGCTGTTC CCCGGTTGCA CCCCGCCGGC CTGCCTCCTG GACGGCCTGG TCCGCCTGCA 360 

20 GCACTTGAGG TTCCTGCGAG AGCCCGTGGT GACGCCCCAG GACCTGGAGG GGCCGGGGCG 420 

AGCCGAGAGC AAAGAGAGCG TGGGCTCCCG GGACAGCTCG AAGAGAGAGG GCCTCCTGGC 480 

CACCCACCCT AGACCTGGCC AGGAGCGCCC TGGGGTGGCC CGCAAGGAGC CAGCACGGGC 540 

TGAGGCCCCA CGCAAGACTG AGAAAGAAGC CAAGACCCCC CGGGAGTTGA AGAAAGACCC 600 

CAAACCGAGT GTCTCCCGGA CCCAGCCGCG GGAGGTGCGC CGGGCAGCCT CTTCTGTGCC 660 

25 CAACCTCAAG AAGACGAATG CCCAGGCGGC ACCCAAGCCC CGCAAAGCGC CCAGCACGTC 720 

CCACTCTGGC TTCCCGCCGG TGGCAAATGG ACCCCGCAGC CCGCCCAGCC TCCGATGTGG 780 

AGAAGCCAGC CCCCCCAGTG CAGCCTGCGG CTCTCCGGCC TCCCAGCTGG TGGCCACGCC 840 

CAGCCTGGAG CTGGGGCCGA TCCCAGCCGG GGAGGAGAAG GCACTGGAGC TGCCTTTGGC 900 

CGCCAGCTCA ATCCCAAGGC CACGCACACC CTCCCCTGAG TCCCACCGGA GCCCCGCAGA 960 

30 GGGCAGCGAG CGGCTGTCGC TGAGCCCACT GCGGGGCGGG GAGGCCGGGC CAGACGCCTC 1020 

ACCCACAGTG ACCACACCCA CGGTGACCAC GCCCTCACTA CCCGCAGAGG TGGGCTCCCC 1080 

GCACTCGACC GAGGTGGACG AGTCCCTGTC GGTGTCCTTT GAGCAGGTGC TGCCGCCATC 1140 

CGCCCCCACC AGTGAGGCTG GGCTGAGCCT CCCGCTGCGT GGCCCCCGGG CGCGGCGCTC 1200 

GGCTTCCCCA CACGATGTGG ACCTGTGCCT GGTGTCACCC TGTGAATTTG AGCATCGCAA 1260 

35 GGCGGTGCCA ATGGCACCGG CACCTGCGTC CCCCGGCAGC TCGAATGACA GCA6TGCCCG 1320 

GTCACA6GAA CGGGCAGGTG GGCTGGGGGC CGAGGA6ACG CCACCCACAT CGGTCA6CGA 1380 

GTCCCTGCCC ACCCTGTCTG ACTCGGATCC CGTGCCCCTG GCCCCOGGTG CGGCAGACTC 1440 

AGACGAAGAC ACAGAGGGCT TTGGAGTCCC TCGCCACGAC CCTTTGCCTG ACCCCCTCAA 1500 

GGTCCCCCCA CCACTGCCTG ACCCATCCAG CATCTGCATG GTGGACCCCG AGATGCTGCC 1560 

40 CCCCAAGACA GCACGGCAAA CGGAGAACGT CAGCCGCACC CGGAAGCCCC TGGCCCGCCC 1620 

CAACTCACGC GCTGCCGCCC CCAAAGCCAC TCCAGTGGCT GCTGCCAAAA CCTU^GGGGCT 1680 

TGCTGGTGGG GACCGTGCCA GCCGACCACT CAGTGCCCGG AGTGAGCCCA GTGAGAAGGG 174 0 

AGGCCGGGCA CCCCTGTCCA GAAAGTCCTC AACCCCCAAG ACTGCCACTC GAGGCCCGTC 1800 

GGGGTCAGCC AGCAGCCGGC CCGGGGTGTC AGCCACCCCA CCCAAGTCCC CGGTCTACCT 1860 

45 GGACCTGGCC TACCTGCCCA GCGGGAGCAG CGCCCACCTG GTGGATGAGG AGTTCTTCCA 1920 

GCGCGTGCGC GCGCTCTGCT ACGTCATCAG TGGCCAGGAC CAGCGCAAGG AGGAAGGCAT 1980 

GCGGGCCGTC CTGGACGCGC TACTGGCCAG CAAGCAGCAT TGGGACCGTG ACCTGCAGGT 2040 

GACCCTGATC CCCACTTTCG ACTCGGTGGC CATGCATACG TGGTACGCAG AGACGCACGC 2100 

CCGGCACCAG GCGCTGGGCA TCACGGTGTT GGGCAGCAAC GGCATGGTGT CCATGCAGGA 2160 

50 TGACGCCTTC CCGGCCTGCA AGGTGGAGTT CTAGCCCCAT CGCCGACACG CCCCCCACTC 2220 
AGCCCAGCCC GCCTGTCCCT AGATTCAGCC ACATCAGAAA TAAACTGTGA CTACACTTG 

Seq XD NO: 213 Protein sequence: 
Protein Accession #: NP 060644.1 

55 

MGVGRIjDMYV LHPPSAGAER TIASVCA1.lv WHPAGPGEKV VRVIiFPGCTP PACLliDGIiVR 60 

LQHIiRFLREP WTPQDLEGP GRAESKESVG SRDSSKREGL LATHPRPGQE RPGVARKEPA 120 

RAEAPRKTEK EAKTPRELKK DPKPSVSRTQ PREVRRAASS VPNLKKTNAQ AAPKPRKAPS 180 

TSHSGFPPVA NGPRSPPSLR CGBASPPSAA CGSPASQLVA TPSLELGPIP AGEEKAIiELP 240 

60 3LAASSIPRPR TPSPESHRSP AEGSERI^SLS PIiRGGEAGPD ASPTVTTPTV TTPSLPAEVG 300 

SPHSTEVDES LSVSFEQVLP PSAPTSEAGL SLPLRGPRAR RSASPHDVDL CLVSPCEFEH 360 

RKAVPMAPAP ASPGSSNDSS ARSQERAGGL GAEETPPTSV SESLPTLSDS DPVPLAPGAA 420 

DSDEDTEGFG VPRHDPLPDP LKVPPPLPDP SSICMVDPEM LPPKTARQTE NVSRTRKPIjA 480 

RPNSRAAAPK ATPVAAAKTK GliAGGDRASR' PLSARSEPSE KGGRAPLSRK SSTPKTATRG 540 

65 PSGSASSRPG VSATPPKSPV YliDLAYLPSG SSAHIiVDEEF FQRVRALCYV ISGQDQRKEE 600 

GMRAVIiDAIi ASKQHWDRDL QVTIiIPTFDS VAMHTWYABT HARHQALGIT VIX3SNGMVSM 660 
QDDAFPACKV EP 

Seq ID NO: 214 DWA seq^ienc^ 
7U Nucleic Acid Accession #: NM_002019.1 

Coding sequence: 250-4266 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

75 GCGGACACTC CTCTCGGCTC CTCCCCGGCA GCGGCGGCGG CTCGGAGCGG GCTCCGGGGC 60 

TCGGGTGCAG C6GCCAGC6G GCCT6GCGGC GAGGATTACC CGGGGAAGT6 GTTGTCTCCT 120 
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GGCTGGAGCC GCGAGACGGG CGCTCAGGGC GCGGGGCCGG CGGCGGCGAA CGAGAGGACG IBO 
GACTCTGGCX3 GCCGGGTCGT TGGCCGGGGG AGCGCGGGCA CCGGGCX5AGC AGGCCGCGTC 240 
GCGCTCACCa.^GTCAGCTA CTGGGACACC GGGGTCCTGC TGTGCGCGCT GCTCAGCTGT 300 
CTGCTTCTCA CAGGATCTAG TTCAGGTTCA AAATTAAAAG ATCCTGAACT GAGTTTAAAA 360 
GGCACCCAGC ACATCATGCA AGCAGGCCAG ACACTGCATC TCCAATGCAG GGGGGAAGCA 420 

GCCCATAAAT GGTCTTTGCC TGAAATGGTG AGTAAGGAAA GCGAAAGGCT GAGCATAACT 4 80 

AAATCTGCCT GTGGAAGAAA TGGCAAACAA TTCTGCAGTA CTTTAACCTT GAACACAGCT 540 

CAAGCAAACC ACACTGGCTT CTACAGCTGC AAATATCTAG CTGTACCTAC TTCAAAGAAG 600 

AAGGAAACAG AATCTGCAAT CTATATATTT ATTAGTGATA CAGGTAGACC TTTCGTAGAG 660 

ATGTACAGTG AAATCCCCGA AATTATACAC ATGACTGAAG GAAGGGAGCT CGTCATTCCC 720 

TGCCGGGTTA CGTCACCTAA CATCACTGTT ACTTTAAAAA AGTTTCCACT TGACACTTTG 780 

ATCCCTGATG GAAAACGCAT AATCTGGGAC AGTAGAAAGG GCTTCATCAT ATCAAATGCA 840 

ACGTACAAAG AAATAGGGCT TCTGACCTGT GAAGCAACAG TCAATGGGCA TTTGTATAAG 900 

ACAAACTATC TCACACATCG ACAAACCAAT ACAATCATAG ATGTCCAAAT AAGCACACCA 960 

CGCCCAGTCA AATTACTTAG AGGCCATACT CTTGTCCTCA ATTGTACTGC TACCACTCCC 1020 

TTGAACACGA GAGTTCAAAT GACCTGGAGT TACCCTGATQ AAAAAAATAA GAGAGCTTCC 1080 

GTAAGGCGAC GAATTGACCA AAGCAATTCC CATGCCAACA TATTCTACAG TGTTCTTACT 1140 

ATTGACAAAA TGCAGAACAA A6ACAAAGGA CTTTATACTT GTCX3TGTAAG GAGTGGACCA 1200 

TCATTCAAAT CTGTTAACAC CTCAGTGCAT ATATATGATA AAGCATTCAT CACTGTGAAA 1260 

CATCGAAAAC AGCAGGTGCT TGAAACCGTA GCTGGCJ^AGC GGTCTTACCG GCTCTCTATG 1320 

AAAGTGAAGG CATTTCCCTC GCCGGAAGTT GTATGGTTAA AAGATGGGTT ACCTGCGACT 1380 

GAGAAATCTG CTCGCTATTT GACTCGTGGC TACTCX5TTAA TTATCAAGGA CGTAACTGAA 1440 

GAGGATGCAG GGAATTATAC AATCTTGCTG AGCATAAAAC AGTCAAATGT GTTTAAAAAC 1500 

CTCACTGCCA CTCTAATTGT CAATGTGAAA CCCCAGATTT ACGAAAAGGC CGTGTCATCX3 1560 

TTTCCAGACC CGGCTCTCTA CCCACTGGGC AGCAGACAAA TCCTGACTTG TACCGCATAT 1620 

GGTATCCCTC A ACCTA CAAT CAAGTGGTTC TGGCACCCCT GTAACCATAA TCATTCCGAA 1680 

GCAAGGTGTG ACTTTTGTTC CAATAATGAA GAGTCCTTTA TCCTGGATGC TGACAGCAAC 174 0 

ATGGGAAACA GAATTGAGAG CATCACTCAG CGCATGGCAA TAATAGAAGG AAAGAATAAG 1800 

ATGGCTAGCA CCTTGGTTGT GGCTGACTCT AGAATTTCTG GAATCTACAT TTGCATAGCT 1860 

TCCAATAAAG TTGGGACTGT GGGAAGAAAC ATAAGCTTTT ATATCACAGA TGTGCCAAAT 1920 

GGGTTTCATG TTAACTTGGA AAAAATGCCG ACGGAAGGAG AGGACCTGAA ACTGTCTTGC 1980 

ACAGTTAACA AGTTCTTATA CAGAGACGTT ACTTGGATTT TACTGCGGAC AGTTAATAAC 204 0 

AGAACAATGC ACTACAGTAT TAGCAAGCAA AAAATGGCCA TCACTAAGGA GCACTCCATC 2100 

ACTCTTAATC TTACCATCAT GAATGTTTCC CTGCAAGATT CAGGCACCTA TGCCTGCAGA 2160 

GCCAGGAATG TATACACAGG GGAAGAAATC CTCCAGAAGA AAGAAATTAC AATCAGAGAT 2 220 

CAGGAAGCAC CATACCTCCT GCGAAACCTC AGTGATCACA CAGTGGCCAT CAGCAGTTCC 2280 

ACCACTTTAG ACTGTCATGC TAATGQTGTC CCCGAGCCTC AGATCACTTG GTTTAAAAAC 2340 

AACCACAAAA TACAACAAGA GCCTGGAATT ATTTTAGGAC CAGGAAGCAG CACGCTGTTT 2400 

ATTGAT^GAG TCACAGAAGA GGATGAAGGT GTCTATCACT GCAAAGCCAC CAACCAGAAG 2460 

GGCTCTGTGG AAAGTTCAGC ATACCTCACT GTTCAAGGAA CCTC6GACAA GTCTAATCTG 2520 

GAGCTGATCA CTCTAACATG CACCTGTGTG GCTGCGACTC TCTTCTGGCT CCTATTAACC 2580 

CTCCTTATCC GAAAAATGAA AAGGTCTTCT TCTGAAATAA AGACTGACTA CCTATCAATT 2640 

ATAATGGACC CAGATGAAGT TCCTTTGGAT GAGCAGTGTG AGCGGCTCCC TTATGATGCC 2700 

AGCAAGTGGG AGTTTGCCCG GGAGAGACTT AAACTGGGCA AATCACTTGG AAGAGGGGCT 2760 

TTTGGAAAAG TGGTTCAAGC ATCAGCATTT GGCATTAAGA AATCACCTAC GTGCCGGACT 2 820 

GTGGCTGTGA AAATGCTGAA AGAGGGGGCC ACGGCCAGCG AGTACAAAGC TCTGATGACT 2 880 

GAGCTAAAAA TCTTGACCCA CATTGGCCAC CATCTGAACG TGGTTAACCT GCTGGGAGCC 2 940 

TGCACCAAGC AAGGAGGGCC TCTGATGGTG ATTGTTGAAT ACTGCAAATA TGGAAATCTC 3 000 

TCCAACTACC TCAAGAGCAA ACGTGACTTA TTTTTTCTCA ACAAGGATGC AGCACTACAC 3060 

ATGGAGCCTA AGAAAGAAAA AATGGAGCCA GGCCTGGAAC AAGGCAAGAA ACCAAGACTA 3120 

GATAGCGTCA CCAGCAGCGA AAGCTTTGCG AGCTCCGGCT TTCAGGAAGA TAAAAGTCTG 3180 

AGTGATGTTG AGGAAGAGGA GGATTCTGAC GGTTTCTACA AGGAGCCCAT CACTATGGAA 324 0 

GATCTGATTT CTTACAGTTT TCAAGTGGCC AGAGGCATGG AGTTCCTGTC TTCCAGAAAG 3300 

TGCATTCATC GGGACCTGGC AGCGAGAAAC ATTCTTTTAT CTGAGAACAA CGTGGTGAA6 3360 

ATTTGTGATT TTGGCCTTGC CCGGGATATT TATAAGAACC CC6ATTATGT GAGAAAAGGA 3420 

GATACTCGAC TTCCTCTGAA ATGGATGGCT CCCGAATCTA TCTTTGACAA AATCTACAGC 3480 

ACCAAGAGCG ACGTGTGGTC TTACGGAGTA TTGCTGTGGG AAATCTTCTC CTTAGGTGGG 3540 

TCTCCATACC CAGGAGTACA AATGGATGAG GACTTTTGCA GTCGCCTGA6 GGAASBGCATG 3600 

AGGATGAGAG CTCCTGAGTA CTCTACTCCT GAAATCTATC AGATCATGCT GGACTGCTGG 3660 

CACAGAGACC CAAAAGAAAG GCCAAGATTT GCAGAACTTG TGGAAAAACT AGGTGATTTG 3720 

CTTCAAGCAA ATGTACAACA GGATGGTAAA GACTACATCC CAATCAATGC CATACTGACA 3780 

G GAA ATAGTG GGTTTACATA CTCAACTCCT GCCTTCTCTG AGGACTTCTT CAAGGAAAGT 384 0 

ATTTCAGCTC CGAAGTTTAA TTCAGGAAGC TCTGATGATG TCAGATATGT AAATGCTTTC 3900 

AAGTTCATGA GCCTGGAAAG AATCAAAACC TTTGAAGAAC TTTTACCGAA TGCCACCTCC 3 960 

ATGTTTGATG ACTACCAGGG CGACAGCAGC ACTCTGTTGG CCTCTCCCAT GCTGAAGCGC 4020 

TTCACCTGGA CTGACAGCAA ACCCAAGGCC TCGCTCAAGA TTGACTTGAG AGTAACCAGT 4 080 

AAAAGTAAGG AGTCX3GGGCT GTCTGATGTC AGCAGGCCCA GTTTCTGCCA TTCCAGCTGT 414 0 

GGGCACGTCA GCGAAGGCAA GCGCAGGTTC ACCTACGACC ACGCTGAGCT GGAAAGGAAA 4 200 

ATCGCGTGCT GCTCCCCGCC CCCA6ACTAC AACTCGGTGG TCCTGTACTC CACCCCACCC 4260 

ATCTAGAGTT TGACACGAAG CCTTATTTCT AGAAGCACAT GTGTATTTAT ACCCCCAGGA 4 320 

AACTAGCTTT TGCCAGTATT ATGCATATAT AAGTTTACAC CTTTATCTTT CCATGGGAGC 4380 

CAGCTGCTTT TTGTGATTTT TTTAATAGTG CTTTTTTTTT TTGACTAACA AGAATGTAAC 4 44 0 

TCCA6ATAGA GAAATAGTGA CAAGTGAAGA ACACTACTGC TAAATCCTCA TGTTACTCAG 4 500 

TGTTAGAGAA ATCCTTCCTA AACCCAATGA CTTCCCTGCT CCAACCCCCG CCACCTCAGG 4 560 

GCACGCAGGA CCAGTTTGAT TGAGGAGCTG CACTGATCAC CCAATGCATC ACGTACCCCA 4620 

CTGGGCCAGC CCTGCA6CCC AAAACCCAGG GCAACAAGCC CQTTAGCCCC AGGGGATCAC 4680 
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TGGCTGGCCT GA6CAACATC TCGGGA6TCC TCTAGOVGGC CTJJVGACATG TGAGGAGG]^ 4740 

AAGGAAAAAA AGCAAAAAGC AAGGGAGAAA AGA6AAACCG 6GAGAAGGCA TGA6AAA6AA 4800 

TTTGAGACGC ACCATGTGGG CACGGAGGGG GACGGGGCTC AGCAATGCCA TTTCAGTGGC 4 860 

TTCCCAGCTC TGACCCTTCT ACATTTGAGG GCCCAGCCAG GAGCAGATGG ACAGCGATGA 4 920 

5 GGGGACATTT TCTGGATTCT GGGAGGCAAG AAAAGGACAA ATATCTTTTT TGGAACTAAA 4 980 

GCAAATTTTA GACCTTTACC TATGGAAGTG GTTCTATGTC CATTCTCATT CGTGGCATGT 504 0 

TTTGATTTGT AGCACTGAGG GTGGCACTCA ACTCTGAGCC CATACTTTTG GCTCCTCTAG 5100 

TAAGATGCAC TGAAAACTTA GCCAGAGTTA GGTTGTCTCC AGGCCATGAT GGCCTTACAC 5160 

TGAAAATGTC ACATTCTATT TTGGGTATTA ATATATAGTC CAGACACTTA ACTCAATTTC 5220 

10 TTGGTATTAT TCTGTTTTGC ACAGTTAGTT GTGAAAGAAA GCTGAGAAGA ATGAAAATGC 528 0 

AGTCCTGAGG AGAGTTTTCT CCATATCAAA ACGAGGGCTG ATGGAGGAAA AAGGTCAATA 534 0 

AGGTCAAGGG AAGACCCCGT CTCTATACCA ACCAAACCAA TTCACCAACA CAGTTGGGAC 54 0 0 

CCAAAACACA GGAAGTCAGT CACGTTTCCT TTTCATTTAA TGGGGATTCC ACTATCTCAC 5460 

ACTAATCTGA AAGGATGTGG AAGAGCATTA GCTGGCGCAT ATTAAGCACT TTAAGCTCCT 5520 

15 TGAGTAAAAA GGTGGTATGT AATTTATGCA AGGTATTTCT CCAGTTGGGA CTCAGGATAT 5580 

TAGTTAATGA GCCATCACTA GAAGTUUU^GC CCATTTTCAA CTGCTTTGAA ACTTGCCTGG 5640 

GGTCTGAGCA TGATGGGAAT AGGGAGACAG GGTAGGAAAG GGCGCCTACT CTTCAGGGTC 5700 

TAAAGATCAA GTGGGCCTTG GATCX^TTAAG CTGGCTCTGT TTGATGCTAT TTATGCAA6T 5760 

TAGGGTCTAT GTATTTAGGA TGCGCCTACT CTTCAGGGTC TAAAGATCAA GTGGGCCTTG 5820 

20 GATCGCTAAG CTGGCTCTGT TTGATGCTAT TTAT6CAAGT TAGGGTCTAT GTATTTAGGA 5880 

TGTCTGCACC TTCTGCAGCC AGTCAGAAGC TGGAGAGGCA ACAGTGGATT GCTGCTTCTT 594 0 

GGGGAGAAGA GTATGCTTCC TTTTATCCAT GTAATTTAAC TGTAGAACCT GAGCTCTAAG 6000 

TAACCGAAGA ATGTATGCCT CTGTTCTTAT GTGCCACATC CTTGTTTAAA GGCTCTCTGT 6060 

ATGAAGAGAT GGGACCGTCA TCAGCACATT CCCTAGTGAG CCTACTGGCT CCTGGCAGCG 612 0 

25 GCTTTTGTGG AAGACTCACT AGCCAGAAGA GAGGAGTGGG ACAGTCCTCT CCACCAAGAT 6180 

CTAAATCCAA ACAAAAGCAG GCTAGAGCCA GAAGAGAGGA CAAATCTTTG TTGTTCCTCT 624 0 

TCTTTACACA TACGCAAACC ACCTGTGACA GCTGGCAATT TTATAAATCA GGTAACTGGA 6300 

AGGAGGTTAA ACTCAGAAAA AAGAAGACCT CAGTCAATTC TCTACTTTTT tTTTTTTTTT 6360 

TCCAAATCAG ATAATAGCCC AGCAAATAGT GATAACAAAT AAAACCTTAG CTGTTCATGT 6420 

30 CTTGATTTCA ATAATTAATT CTTAATCATT AAGAGACCAT AATAAATACT CCTTTTCAAG 6480 

AGAAAAGCAA AACCATTAGA ATTGTTACTC AGCTCCTTCA AACTCAGGTT TGTAGCATAC 654 0 

ATGAGTCCAT CCATCAGTCA AAGAATGGTT CCATCTGGAG TCTTAATGTA GAAAGAAAAA 6600 

TGGAGACTTG TAATAATGAG CTAGTTACAA AGTGCTTGTT CATTAAAATA GCACTGAAAA 6660 

TTGAAACATG AATTTU^CTGA TAATATTCCA ATCATTTGCC ATTTATGACA AAAATGGTTG 6720 

35 GCACTAACAA AGAAC6A6CA CTTCCTTTCA GAGTTTCTGA GATAATGTAC GTGGAACAGT 6780 

CTGGGTGGAA TGGGGCTGAA ACCATGTGCA AGTCTGTGTC TTGTCAGTCC AAGAAGTGAC 6840 

ACCGAGATGT TAATTTTAGG GACCCGTGCC TTGTTTCCTA GCCCACAAGA ATGCAAACAT 6900 

CAAACAGATA CTCGCTAGCC TCATTTAAAT TGATTAAAGG AGGAGTGCAT CTTTGGCCGA 6960 

CAGTGGTGTA ACTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGGGTGTG 7020 

40 GGTGTATGTG TGTTTTGTGC ATAACTATTT AAGGAAACTG GAATTTTAAA GTTACTTTTA 7080 

TACAAACCAA GAATATATGC TACAGATATA AGACAGACAT GGTTTGGTCC TATATTTCTA 7140 

GTCATGATGA ATGTATTTTG TATACCATCT TCATATAATA TACTTAAAAA TATTTCTTAA 7200 

TTGGGATTTG TAATCGTACC AACTTAATTG ATAAACTTGG CAACTGCTTT TATGTTCTGT 7260 

CTCCTTCCAT AAATTTTTCA AT^TACTAAT TCAACAAAGA AAAAGCTCTT TTTTTTCCTA 7320 

45 AAATAAACTC AAATTTATCC TTGTTTAGAG CAGAGAAAAA TTAAGAAAAA CTTTGAAATG 7380 

GTCTCAAAAA ATTGCTAAAT ATTTTCAATG GAAAACTAAA TGTTAGTTTA GCTGATTGTA 7440 

TGGGGTTTTC GAACCTTTCA CTTTTTGTTT GTTTTACCTA TTTCACAACT GTGTAAATTG 7500 

CCAATAATTC CTGTCCATGA AAATGCAAAT TATCCAGTGT AGATATATTT GACCATCACC 7560 

CTATGGATAT TGGCTAGTTT TGCCTTTATT AAGCAAATTC ATTTCAGCCT GAATGTCTGC 7620 

50 CTATATATTC TCTGCTCTTT GTATTCTCCT TTGAACCCGT TAAAACATCC TGTGGCACTC 

Seq ID NO: 215 Protein sequence i 
Protein Accession #: NP_002010.1 

55 

1 11 21 31 41 51 

I I i I I I 

MVSYWDTGVL LCALLSCLLL TGSSSGSKLK DPELSLKGTQ HIMQAGQTLH LQCRGEAAHK 60 

WSLPEMVSKE SERLSITKSA CGRNGKQFCS TLTLNTAQAN HTGFYSCKYL AVPTSKKKET 120 

60 ESAIYIFISD TGRPFVEMYS EIPEIIiOfrB GRELVIPCRV TSPNITVTIiK KFPU>TIjIPD 180 

GKRIIWDSRK GFIISNATYK EIGLLTCEAT VNGHLYKTNY LTHRQTNTII DVQISTPRPV 240 

ICLLRGHTIjVIi NCTATTPLNT RVQMTWSYPD EKNKRASVRR RIDQSNSHAN IFYSVLTIDK 300 

MQNKDKGLYT CRVRSGPSFK SVNTSVHIYD KAFITVKHRK QQVLETVAGK RSYRLSMKVK 360 

AFPSPEWWIi FCDGLPATEKS ARYLTRGYSL IIKDVTEEDA GNYTIIiI*SIK QSNVFKNIjTA 420 

65 TLIVNVKPQI YBKAVSSFPD PALYPLGSRQ ILTCTAYGIP QPTIKWPWHP CNHNHSBARC 480 

DFCSNNEESF ILDADSNMGN RIESITQRMA XXEGKNKMAS TLWADSRIS GIYICIASNK 540 

VGTVGRNISP YITDVPNGFH VNliBKMPTEG EDIjKIjSCTVN KFLYRDVTMI LURTVNNRTM 600 

HYSISKQKMA ITKEHSITLM LTZHHVSLOD SGTYACRARN VYTGEEILQK KEITIRDQEA 660 

PYIiliRNIiSDH TVAXSSSTTL DCHANGVPEP QITWFKNmiK IC2QEPGIIIiG PGSSTLFIBR 720 

70 VTEEDB6VYH CKATNQKGSV ESSAYLTVQG TSDKSNLBIiI TIiTCTCVAAT IiFWLLLTLLI 780 

RKMKRSSSEI KTDYIiSIZMD PDEVPLDEQC BRIiPYDASKW EFARERLfOiG KSLGRGAFGK 840 

WQASAFGIK KSPTCRTVAV KMLKBGATAS EYKALMTEIiK ILTHIGHHLN WNLIiGACTK 900 

QGGPIiMVIVE YCKYGNLSNY LKSKRDLFFL NKDAALHMEP KKEKMEPGLE QGKKPRLDSV 960 

TSSESFASSG FQEDKSLSDV EEEBDSDGFY KEPITMEDLI SYSFQVARGM EFliSSRKCIH 1020 

75 RDIiAARNILXi SENNWKICD FGLARDIYKN PDYVRKGDTR LPLKWMAPES IFDKIYSTKS 1080 

DVWSYGVIiLW EIFSLGGSPY PGVQMDEDFC SRLREGMRMR APBYSTPBIY QIMLDCWHRD 1140 
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PKERPRFAEL VEKLGDLLQA NVQQDGKDYI PINAILTGNS GFTYSTPAFS EDFFKESISA 1200 

PKFNSGSSDD VRYVNAFKFM SLERIKTFEE LLPNATSMFD DYQGDSSTIiL ASPMIjKRFTW 1260 

TDSKPKASLK IDLRVTSKSK ESGLSDVSRP SFCHSSCGHV SEGKRRFTYD HAELERKIAC 1320 
CSPPPDYNSV VLYSTPPI 



Seq ID NO: 216 DMA geou^nce 

Nucleic Acid Accession NM_0246e9 

Coding sequence: 76-624 (underlined sequences correspond to start and stop codons) 
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1 
I 

CTCTTTGGCC 
AGCCCAGAGC 
TGGCTGGCCC 
TCCCTTTCTT 
GGTCTTGATA 
GAAATAAGAT 
TCTTATCCTG 
CTGGTCAGCA 
CCJ^AAGACCT 
CAGGCCAAGC 
AAGTTCCTGT 
AGAACTTCCA 
AGTAATCATT 
AGAATTCACC 
GACAACTCCTl 
TGAGCCCACC 
CTGGACAGGT 
TTACCAAAGG 
CCCTAGTGAT 
CTTTTACTTT 
TTTATGTTTC 
TTTTTTGAGC 
GGGGAAGCTA 
TAGATTTCGG 
GTGTATCAGG 
AGATTTATTT 
TGGTAGGCCT 
ATTCCTTGTT 
GCCCACCCAG 
TCACATCTAT 
GAATACAGCC 
TTTTATCGAC 
AGCATCACCC 
CAATA6AGAC 
AGGATGAGGA 
TTTTTATTTT 
AAGTGATTTA 
ATGAAACCTT 
TAATTGTGGG 
TGTAA6TGGG 
GTCTCTTAAG 
TAGCAGTTAT 



11 
I 

AAGCCCTGCC 
CCAA GATGG A 
TGCTGCTGTG 
CTCTGGTGCC 
AATGCAATGC 
CTGACAACTG 
CAAATTACTC 
AATATCAAAA 
GCAGCATTGA 
GCCTCACGCC 
GTATGCTGAG 
GAAAGTGTTA 
AAAGAGGCTT 
AACACACAGG 
AAGCCCCGGC 
CCAATCCAGA 
CTTCCCTATG 
CCCACATAAC 
GGATGAACTC 
TTAAGTACCT 
CATTCTGGTA 
AAACACTCGG 
CTTCTCCTCT 
TCTTCATTGC 
CAGGGTTCTA 
CAAAGAATTG 
GCAATCTGTA 
CCTTA7UUUVA 
ATTACCTAGA 
GAAATGCCTT 
TAGCCAAGTT 
CGTCTTCAGA 
TGAACCAAAG 
ATTGACTGGT 
AACCAGGCAC 
TCACTGGGAG 
GAAACTCCAA 
ATTTTATTGG 
TTTGCACATG 
ACCTTGGGGA 
CCTGTTCCTG 
CTATTGTTGT 



21 



TCTGTACAGC 
GCCCCAGCTG 
GGTCTCAGCC 
CCAAGTCAGA 
CTGCATCGGG 
GCTGGCTTCC 
AGATGATTCC 
CGAGATCTCA 
GCGTGTCCTG 
GGACCTGGTG 
ATAA CACCAG 
GCCTTCTCCC 
CTGCATCAAA 
CCCACCAGCA 
TCTTTCCACC 
TGTGATCCCC 
AGATAGAACC 
TTCTAAATTT 
TCTTATCTCT 
CCATCAGA6T 
AGAACTCTTT 
GGGTATGGAT 
ATTCAGATTT 
TGTCCATTTT 
CCAGAGAAAC 
ATTTACATGA 
AACCTTTGGG 
ATCTGTTTTT 
TAATCTCTTT 
CACAGCAACA 
GACACATAAA 
CTGTTAAGGA 
GCCCCTATCA 
TGGCTGGCTT 
GGGA6AGGGA 
GTGGTAAGTT 
AGCAATT6GT 
AAATGGTTGG 
GCCAGCACAT 
Q6AGCTGCCT 
CTACAGTTAT 
GTATTAAACC 



31 
I 

CTCGAGTGGA 
GGGCCTGAGG 
CTGAGCT6TT 
ACCAGCTACA 
ACATCTATTT 
CACCTTGGAC 
AAAATCTGGC 
GACAGGAAAA 
CGGAAAACAG 
CAGGACTGTC 
TGAAAAAGCC 
AACTGTGTTA 
CCTTCACATG 
ACAGGCTACC 
ACACTGTGGT 
CT6TGATCTA 
TGATAAGGAG 
TGGTCTGGTC 
GGCTTCTAGA 
CAT6AAATCA 
AAATGAGGAC 
6AAA6CCAAT 
CACTAAAATC 
TGTAATGAAC 
AGAACCAGTA 
TTGTGGGGAT 
CAGGAGCTGA 
GTTCTTAAGG 
TACTTAAAGT 
CCTAGATTAG 
ATTAACCATC 
TTGTGGTAGA 
AGTAACAATA 
CCCAAGGGAT 
GGGGCAACAG 
AGCCCTGTT6 
AATCCCCAAA 
TTTCCCAATT 
GCCAAACAGA 
CCATCATAAA 
AGAGGTTGCT 
ATTTCAACAC 



41 
I 

CAGCCAGAG6 
CTGCCGCCCT 
CTTTCTCCTT 
ATTTTGGAAG 
GCAAGAAGTT 
TGCCTCCCGA 
GCCCTGTGGA 
TCTGTGCCTC 
AGAGGTTCCA 
ACCAGGGCCA 
TGGCATGGAG 
TACCAACCAC 
CAGCTCCCAT 
TTTGCACAAT 
CCCCTAGATG 
CTTCTGGCAA 
CTAGGGCAAT 
TGAAGGAAAA 
GGGAAAAAAA 
CCTGTCAAGA 
ACTGCTGATT 
CGCAGGTCAA 
TTCCAAGATG 
GAGTGTTTTT 
GGAGATACAT 
TGGCAAGTCG 
TGCTGTAGTT 
GCTTTGAATG 
AAACTGATTG 
CATTCAATTG 
ACAGCAACAT 
GAACTGTGAC 
TAGCCAAGCA 
AGCACCAGAC 
AGGTCCAGGG 
CCCATGTATG 
AT6GQTGTAT 
CTGTTTGCCA 
AGTAGACAAA 
GGGAGGGGTT 
CAGAACCTTC 
AT 



51 
I 

CTGCAGCTGG 
CCGCCCTGGC 
GCCAGCTTCT 
GACTTTCCTC 
CTTTAAAGAA 
TTCCTTGCTT 
GATCTTTAGA 
TGCATCAGCC 
GAAATGGCTG 
GAGAGAACTA 
CCCAGCACTG 
ATTTTCAAAT 
GCCACCCTCC 
ATTCTCTGAT 
GGGCTGTTGC 
GATTCTCAGT 
TCTGACAACA 
CCTGTTCTCG 
AAGCATACCT 
CTATCTATCT 
GCTG6TGATG 
ATGACTCCTT 
AAAGCAAATC 
CCTTTAGCTA 
ATACATGTCC 
AAAATCCATA 
TGCAGATAGA 
ATTGGATCAG 
TAGGTGCTAA 
AATAACTGGG 
GCCTGCTAAA 
AGCCACTCTC 
AAATTCCAGT 
AAGAAATGCA 
TTTGGTTATC 
CAGATGGGAG 
CTGGTTTGAA 
TTGGCCAATA 
GGTCTCACTC 
AGTAAAAATG 
TCAGCAAATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 



Seq ID NO: 217 Protein aemience: 
Protein Accession #: NP 078965.1 



51 



60 



65 



70 



75 



1 11 21 31 41 

I I I r I , 

MEPQIX3PEAA AI^PGWlAJbL LWVSALSCSF SLPASSLSSL VPQVRTSYNP GRTFLGLDKC 60 
NACIGTSICK KPPKEEIRSD NWLASHIiGLP PDSLLSYPAN YSDDSKIWRP VEIFRLVSKY 120 
QNEISDRKIC ASASAPKTCS lERVLRKTBR FQKWLQAKRL TPDLVQDCHQ GQRELKFLCM 180 
IiR 

Seq ID NO: 218 DNA aeouenee 

Nucleic Acid Accession #• AF075027.1 

Coding sequence: 3-269 (underlined sequences correspond to start and stop codons) 



1 
I 

GA TTAA TTAA 
TGGGATGGCT 
CTTTCTGTGC 
CTTTCTCCCT 
CCATAGATCT 
GACAAAA6TG 



11 
I 

GTGCTTTAAA 
GTAGCTTGAG 
CAAGGTGCTG 
GGAGTAA6GC 
CATGCATCAT 
GTGGGTTTTC 



21 



31 



CGGTCTTGGT AAATATTCCG 
TTGAATTTTA ACTGTCCTCA 
TGTTACGGGA GAGAGTGACT 
CGAAGACTGQ ATTACTACAC 
TAATGCCA TA Tg ACATTtsnr 
ATTGTCTTCA CTGATTGTCA 



41 
I 

CGGGA6CTG6 
TTCTGGGTTT 
GGAAAGTAAC 
GCCTAGACGT 
ATTTTCTTTC 
ATGCATTAAT 



51 
I 

GGAGGACCGT 60 

TGTCGCTCTG 120 

AAA6CTGAAT ISO 

GACACTACAC 240 

TCAGTTCACQ 300 

AAA6AAGATG 360 



282 



NSOOCIO: <WO 02079492A2J.> 



wo 02/079492 



PCT/US02/04915 



TGTGGT 

Seq ID NO; 219 Protein secruence ; 
^ Protein Accession #: AF075027 

1 11 21 31 41 51 

I I I I I I 

ERKWQCHMAL MMHEIYGCSV TSRRWIQSS ALLQGERFSF VTPQSLSPVT QHLGTESRAT 60 
KPRMRTVKIQ LKLQPSQRSS PAPAEYLPRP FKALN 

Seq ID NO: 220 DNA sequence 

Nucleic Acid Accession #: AL133411.8 

Coding sequence: 1-1395 (underlined sequences correspond to start and stop codons) 

15 1 11 21 31 41 51 

I I I t I I 

ATG GGCAAGG ACTTCATGAC TAAAACACTA AAAGCAATGG CAACAAAAGC CAAAATTGAC 60 

AAATGGGATC TAATCAAATT AAAGAGCTTC C6CACAGCAA AAGAAACTAT TATCAGAGT6 120 

AACAGGCAAC CTACAGAATG GGAGAAAAAT TTTGCAATGT ATCCATCTGA CAAAGGGCTG 180 

20 ACATCCAGAA' TCTATAAG6A ACTTAAACAA TTTTACAAGA AAAAACCAAA CAACGCCATC 240 

AAAAAGGACA TGGATGAAGC TGGAAACCXST CATTCTCAGA AAACTAACAC AGGAACAGAA 300 

AACCAAACAC CACATGTTCT CACTCATAAG TGGGAGTTGA ACAATGAGAA CACATGGACA 360 

CAGGGAGGGG AACATCACAC ACTGGGGCCT GTCAGAAGCC CCTCTGGCCT CCTGGCTGGC 420 

CTTGAACATG CTGGGAGGAA ATTACAATTC. ATCCATGGGC TGTTTACCCT TGAAAATGAA 4 80 

25 TGGGCCCAGG AACAATCCAT AATACAAAAG AAATATGCAT TATGGATTGG AACCAAGCAG 540 

ATCTGGGTGG CACAAACTCC TGGTGAATCT ATCTCCAGTT CACCAGCATT GCCTAATGTG 600 

CTACCTTTAA ATGT^GATGT TAATAAGCAG GAAGAAAAGA ATGAAGATCA TACTCCCAAT 660 

TATGCTCCTG CTAATGAGAA AAATGGCAAT TATTATAAAG ATATAAAACA ATATGTGTTC 720 

ACAACACAAA ATCCAAATGG CACTGAGTCT GAAATATCTG TGAGAGCCAC AACTGACCTG 780 

30 AATTTTGCTC TAAAAAACGA TAAAACTGTC AATGCAACTA CATATGAAAA ATCCACCATT 840 

GAAGAAGAAA CAACTACTAG CGAACCCTCT CATAAAAATA TTCAAAGATC AACCCCAAAC 900 

GTGCCTGCAT TTTGGACAAT GTTAGCTAAA GCTATAAATG GAACAGCAGT GGTCATGGAT 960 

GATAAAGATC AATTATTTCA CCCAATTCCA GA6TCTGATG TGAATGCTAC ACAGGGAGAA 1020 

AATCAGCCAG ATCTAGAGGA TCTGAAGATC AAAATAATGC TGGGAATCTC GTTGAT6ACC 1080 

35 CTCCTCCTCT TTGTGGTCCT CTTGGCATTC T6TAGTGCTA CACTGTACAA ACTGAGGCAT 1140 

CTGAGTTATA AAAGTTGTGA GAGTCAGTAC TCTGTCAACC CAGAGCTGGC CACGATGTCT 1200 

TACTTTCATC CATCAGAAGG TGTTTCAGAT ACATCCTTTT CCAAGAGTGC AGAGAGCAGC 1260 

ACATTTTTGG GTACCACTTC TTCAGATATG AGAAGATCAG GCACAAGAAC ATCAGAATCT 1320 

AAGATAATGA CGGATATCAT TTCCATAGGC TCAGATAATG AGATGCATGA AAACGATGAG 1380 

40 TCGGTTACCC G GTGA 



45 



Seq ID NO: 221 Protein sequencer 
Protein Accession #: AIil33411.8 



50 



55 



MGKDFMTKTL 
TSRIYKELKQ 
QGGEHHTLGP 
IWVAQTPGES 
TTQNPNGTES 
VPAFWTMLAK 
LIiLFWIiLAF 
TFIiGTTSSDM 



11 
I 

KAMATKAKID 
FYKKKPNNAI 
VRSPSGLLA6 
ISSSPALPNV 
EISVRATTDIi 
AINGTAWMD 
CSATLYKXiRH 
RRSGTRTSES 



21 
I 

KWDlilKLKSF 
ECKDMDEAGNR 
LEHAGRKLQP 
LPIiNEDVNKQ 
NFALKNDKTV 
DKDQIiFHPIP 
LSYKSCESQY 
KIMTDIISIG 



31 

I 

RTAKETIIRV 
HSQKTNT6TE 
IK6LFTLENE 
EEKNEDHTPN 
NATTYEKSTI 
ESDVNATQ6E 
SVNPEIiATMS 
SDMEMHENDE 



41 
I 

NRQPTEWEKN 
NQTPHVIiTHK 
WAQEQSIIQK 
YAPANEKNGN 
EEETTTSEPS 
NQPDIiSDLKI 
YFMPSEGVSD 
SVTR 



51 

I 

FAMYPSDKGL 
HEimiENTWT 
KYAIjWIGTKQ 
YYKDIKQYVF 
HKNIQRSTPN 
KIMLGISIiMT 
TSFSKSAESS 



60 
120 
180 
240 
300 
360 
420 



60 



Seq ID NO: 222 DNA sequence 

Nucleic Acid Accession #: AL050295.1 

Coding sequence: 237-2073 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

65 GAAGGGGACA GAAGGCAGTT CACCTCTGCT CCCGACAGCC TGGGAACCCG CAAGAGCCCC 60 

AGCATTTGAA GTCTGGTCTT GTGAAACCCC ACCCTCCTCT GGCTGTGTGA TTGAATGGGA 120 

TGCCCTCGAG GTACACCTCA CCTGAGAGGG TTTTGGGCAG ATCAGCAGTA AGGTGTTAAA 180 

TTTTAGAAGC CTGAAAACTC CAGAAGAGAA AGGCCAACCA ACTCAAACTT (3AAGA CATGA 240 

AATCCCCAAG GAGAACCACT TTGTGCCTCA TGTTTATTGT GATTTATTCT TCCAAAGCTG 300 

70 CACTGAACTG GAATTACGAG TCTACTATTC ATCCTTTGAG TCTTCATGAA CATGAACCAG 360 

CTGGTGAAGA GGCACTGAGG CAAAAAC6AG CCGTTGCCAC AAAAAGTCCT AC(36CTGAAG 420 

AATACACTGT TAATATTGAG ATCAGTTTTG AAAATGCATC CTTCCTGGAT CCTATCAAAG 480 

CCTACTTGAA CAGCCTCAGT TTTCCAATTC ATGGGAATAA CACTGACCAA ATTACTGACA 540 

TTTTGAGCAT AAATGTGACA ACAGTCTGCa^ GACCTGCTGG AAATGAAATC TGGTGCTCCT 600 

75 GCGAGACAGG TTATGtSGTGG CCTCGGGAAA GGTCTCTTCA CSATCTCATT TGTCAAGAGC 660 

GTGAGGTCTT CCTCCCAGG6 C31CCATTGCA GTTGCCTTAA AGAACTGCCT CCCAATGGAC 720 



283 



wo «»2/«794«)2 



PCT/US«2/049I5 



10 



15 



20 



25 



CTTTTTGCCT 
TTCAAGAAGA 
AAACAGCGTT 
GGTTCAAGTC 
A6TTAATACA 
ACTACAACTC 
TCATCTTTGA 
TGTCTTGGCG 
ACACCGCACT 
CAGGTGATGC 
AGAAGAAAAT 
GCGACAACAA 
TAGAAT6GAA 
CTAGCTGCAG 
GAACAACAGT 
ACATAAAAGT 
ITTCTGAGGG 
TTTATTGGAA 
ATCTTGATGG 
CCTATCACTG 
TTCACCCGCT 
GCAGTGGTTC 
TCCATATGGG 



GCTTCAGGAA 
CCTCATGAAC 
CCGGAAGGGT 
TGGAAGTGTG 
TAAAGCCAAT 
CTTTCAAGCA 
AGGGGACACA 
CTAT6AA6AA 
TTTCAACAAC 
AGGTGAATAT 
AGAT6TTATG 
TCCTGTATCT 
GCAGGAAGGA 
CAGATACACC 
CATCTACACT 
GACATTCATC 
ACAAAACTTT 
CACTTCTGCT 
AGCAGAATCA 
CATATTTAGA 
GCCTCTAAAG 
CCATCACATC 
TTCCTCATCC 



GATGTTACCC 
ACTTCCTCCG 
TACGGAATTT 
GTTGTGACAT 
GAACAAGTTG 
GTTACTATCA 
GTCAGTCTGG 
CAGCAGTTGG 
ATGACTTCGG 
GTTTGCAAAC 
CCCATCCAAA 
TTGAACTGCT 
AAAATAAATA 
CTCAAGGCTG 
TGTGAGTTCA 
TCTGTG6CCA 
TCTATAAAAT 
GGAATTAAAA 
GTACTGACAG 
TATAAGAATT 
CTGAACATCA 
AAGTGCTGCA 
CTTCCTGCTG 



TGAACATGAG 
CCCTCTATAG 
TACCAGGCTT 
ATGAAGTCAA 
TACAGAGCCT 
ATGAAAGCAA 
TGTGTGAAAA 
AAATCCAGAA 
TGTCCAAGCT 
TGATATTAGA 
TTTTGGCAAA 
6CAGTCAGGG 
TTCCAG6AAC 
ATGGAACCCA 
TCAGTGCCTA 
ATCTAACAAT 
GCATCAGTGA 
TATACCAAAG 
TCAAGACCTC 
CATACAGTAT 
TGATTGATCC 
TAGAGGAGGA 
TAAA AAAAAA 



AGTCAGACTA 
GTCCTACAAG 
CAAGGGCGTG 
GACTACACCA 
CAATCAGACC 
TTTCTTTGTC 
GGAAGTTTTG 
CAGCAGCAGA 
CACCATCCAC 
CATTTTTGAA 
TGAAGAAATG 
TAATGTTAAT 
CCCTGAGACA 
6TGCCCAA6C 
TGGAGCCAGA 
AACCCCGGAC 
TGT6AGTAAC 
ATTTTATACC 
GACCAGGGAG 
TGCAACCAAA 
TTTGGAAGCT 
TGGAGACTAC 
AAAAAAAAAA 



AATGTAGGCT 
ACCGACTTGG 
ACTGTGACAG 
CCATCACTTG 
TACAAAATGG 
ACACCAGAAA 
TCCTCCAATG 
TTCTCGATTT 
AACATCACTC 
TATGAGTGCA 
AAGGTGATGT 
TGGAGCAAAG 
GACATAGATT 
G66TCGTCTG 
GGCAGTGCAA 
CCAATTTCTG 
TATGAT6AGG 
ACGAGGAGGT 
T6GAATGGAA 
GACGTCATTG 
ACTGTTTCAT 
AAAGTTACTT 
A 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



Seq ID NO; 223 Protein sequencer 
Protein Accession CAB43394.1 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



1 
I 

MKSPRRTTLC 
EEYTVNIEIS 
SCETGYGWPR 
GFQEDLMNTS 
LELIHKANEQ 
NVSWRYEEQQ 
CKKKIDVMPI 
DSSCSRYTLK 
SVSEGQNFSI 
GTYHCIPRYK 
TFHMGSSSLP 



11 
I 

LMFIVIYSSK 
FENASFLDPI 
ERCIiHNLICQ 
SALYRSYKTD 
WQSLNQTYK 
IiEIQHSSRFS 
QIZiANEEMKV 
ADGTQCPSGS 
KCISDVSNYD 
NSYSIATKDV 
AVKKKKKK 



21 
I 

AALNWNYBST 
KAYIiNSLSFP 
BRDVFLPGHH 
IiETAFRKGYG 
MDYNSFQAVT 
lYTAIiPNNMT 
MCDNNPVSW 
SGTTVIYTCE 
EVYWNTSAGI 
IVHPLPIiKIiH 



31 

I 

IHPLSIiHEHE 
IH6NNTDQ1T 
CSCLKELPPN 
ILPGFKGVTV 
INESNFFVTP 
SVSKLTIHNI 
CCSQGNVNWS 
FISAYGARGS 
KIYQRFYTTR 
IMIDPLEATV 



41 

I 

PAGEEALRQK 
DILSINVTTV 
GPFCLLQEDV 
TGFKSGSVW 
EIIPE6DTVS 
TPGDAGEYVC 
KVEWKQEGKI 
ANIKVTFISV 
RYIiDGAESVL 
SCS6SHHZKC 



51 

I 

RAVATKSPTA 
CRPAGNEIWC 
TLNMRVRLNV 
TYEVKTTPPS 
LVCEKEVIjSS 
KLIIjDIFBYE 
NIPGTPETDI 
ANLTITPDPI 
TVKTSTREWN 
CIBEtXSOYKV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 224 DNA sequence 

Nucleic Acid Accession #: NM_007268 

Coding sequence: 46-1245 (underlined sequences correspond to start and stop codons) 



1 

I 

GGTAGCAG6A 

GGCCTGCTAC 

CCAGAGAGTG 

CTGCAAGGCT 

ACCATCTTTC 

CTGCATGTGA 

GATGACCGGA 

GTGAGAGATA 

ACAACTGGCA 

CAGGCTCGGG 

GAACCCATCA 

TCAGGCTCCT 

GTGAAGTTTG 

ACCATGACAT 

GACATGGATG 

GCCATCATCC 

CTCTGTCGGA 

GAGGCCAACG 

GATGAGCCAA 

GAGTACCAGA 

CTGGATTATG 

CCAGGATCTG 

GCTACCTCTC 

TCACTGGCTT 

GCTCTGGGCC 

GGGAAGATGC 

GCATCTTGCC 

TATCCAGGAT 

ACAGGCCAG6 



11 
I 

GGCTGGAAGA 
TCCTG6GGCA 
TAACAGGACC 
ACACCCAAGT 
TACGTGACTC 
GCCACAAGGT 
GCCACTACAC 
AGATTACTGA 
GCGGTTATGG 
GTTCTCCTCC 
AAGTAGCAAC 
ATTTCTGCAC 
TGGTCAAAGA 
ACCCCTTGAA 
GCTACCTTGG 
TCATCATCTC 
AGACATCCCA 
ACTCTGGAGA 
CTTCCCAGAA 
TCATCGCCCA 
AGTTTCTGGC 
CTGACATAAT 
TTCCTGGATA 
TGCCCTGGAA 
CTTCTAGTAT 
CCATAGCACT 
ACCAGAAGAC 
CATTTCTCTT 
GTTCAGTTCT 



21 
I 

AAGGACAGAA 

CCTAACAGTG 

TTGGAAAGGG 

CTTGGTGAAG 

TTCTGGAGAC 

TCCAGGAGAT 

GTGTGAAGTC 

GCTCCGTGTC 

CTTCACGGTG 

CATCAGTTAT 

CCTAAGTACC 

TGCCAAGGGC 

CTCCTCAAAG 

AGCAACATCT 

AGAGACCAGT 

CTTGTGCTGT 

ACAAGAGCAT 

AACCATGAGG 

TCTGGGCAAC 

GATCAATGGC 

CACTGAGGGC 

TGCCTAGTCA 

6CCCAAAGT6 

TTTGCCAGAT 

CTCTGCCGGG 

A6GACTT66T 

CCGAGGGAG6 

TCTTCAGGGC 

GCTCCTCCAC 



31 

1 

GTAGCTCTGG 

GACACTTATG 

GATGTGAATC 

TGGCTGGTAC 

CATATCCAGC 

GTATCCCTCC 

ACCTGGCAGA 

CAGAAACTCT 

CCCCAGGGAA 

ATTTGGTATA 

TTACTCTTCA 

CAGGTTGGCT 

CTACTCAAGA 

ACAGTGAAGC 

GCTGGGCCAG 

ATGGTGGTTT 

GTCTACGAAG 

GTGGCCATCT 

AACTACTCTG 

AACTACGCCC 

AAAAGTGTCT 

GTCCTTGCCT 

TCCGCCTACC 

GCATCrCAAG 

GGCTTCT6GT 

CATCATGCCT 

CTCAGCTCTG 

CAGACAGCTT 

TATAAGTCTA 



41 

I 

CTGTG ATG GG 
GCCGTCCCAT 
TTCCCTGCAC 
AACGTGGCTC 
AGGCAAAGTA 
AATTGAGCAC 
CTCCTGATGG 
CTGTCTCCAA 
TGAGGATTAG 
AGCAACAGAC 
AGCCTGCGGT 
CTGAGCAGCA 
CCAAGACTGA 
AGTCCTGGGA 
GAAAGAGCCT 
TTACCATGGC 
CAGCCAGGGC 
TCXSCAAGTGG 
ATGAGCCCTG 
GCCTGCTGGA 
G TTAAA AATG 
TCTGCATGGC 
AACACTGGAG 
TAAGCCAQCT 
ACTCCTCTCT 
ACAGACACTA 
CCAGCTCAGA 
TTAATTGAAA 
ATQTTCTGAC 



51 

1 

GATCTTACTG 

CCTGGAAGTG 

CTATGACCCC 

AGACCCTGTC 

CCAGGGCCGC 

CCTGGAGATG 

CAACCAAGTC 

GCCCACAGTG 

CCTTCAATGC 

TAATAACCAG 

GATAGCCGAC 

CAGCX3ACATT 

GGCACCTACA 

CTGGACCACT 

GCCTGTCTTT 

CTATATCATG 

ACATGCCAGA 

CTGCTCCAGT 

CATAGGACAG 

CACAGTTCCT 

CCCCATTAGG 

CTTCTTCCCT 

CCGCTGGGAG 

GCTGGATTTG 

AAATACCAGA 

TTCAACTTTG 

G6ACCAGCTA 

TTGTTATTTC 

TCTCTCCTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
lOBO 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



284 



MSDOCIO:<WO 02079492A2J.> 



wo ()2/»79492 



PCT/IJS02/(M915 



TGCTCAATAA ATATCTAATC ATAACAGCAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 225 Protein sequence; 
Protein Accession #; NP 009199.1 



1 

I 

MGIIiliGLLLL 
GSDPVTIFLR 
DGNQWRDKI 
QTNNQEPIKV 
TEAPTTMTYP 
MAYIMLCRKT 
PCIGQEYQII 



11 
I 

GHLTVDTYGR 
DSSGDHIQQA 
TELRVQKLSV 
ATLSTLLFKP 
LKATSTVKQS 
SQQEHVYEAA 
AQIN6NYARL 



21 
I 

PILEVPESVT 
KYQGRIiHVSH 
SKPTVTTGSG 
AVIADSGSYF 
WDWTTDMDGY 
RAHAREANDS 
liDTVPIiDYEF 



31 

I 

GPWKGDVNLP 
KVPGDVSLQL 
YGFTVPQGMR 
CTAKGQVGSE 
LGETSAGPGK 
GETMRVAIFA 
LATEGKSVC 



41 
I 

CTYDPLQGYT 
STLEMDDRSH 
ISLQCQARGS 
QHSDIVKFW 
SLPVFAIILI 
SGCSSDEPTS 



51 

I 

QVLVKWLVQR 60 

YTCEVTWQTP 120 

PPISYIWYKQ 180 

KDSSKLIiKTK 240 

ISLCCMWFT 300 

QNLGNNYSDE 360 



Seq ID NO: 226 DNA sequence 

Nucleic Acid Accession #: XM_64321 

Coding sequence: 1-2079 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGGTCGCCA 
CCAGGACCAC 
GAGCCTGGTA 
GGGGTTGGGA 
ATCCGTCAGG 
CCATACGAAG 
CTCACAAATG 
CATAATCAAT 
TTCCCCTCCA 
CCTGAAGTTA 
TCCCTTTGTG 
GAAAGCA6AA 
CGCAAGTCAC 
GTTCACACCA 
CAGCCAATAC 
GGTGATCTTG 
CGGAGGAGCC 
CTCAAGAGGT 
CCCACATTCA 
TCCCTGCTCC 
GGGGGGCGCC 
AGGCCAGGGA 
TCCTCTTTAG 
TGCTTAGAAG 
GGTCCAGACA 
CAGCACCAGA 
AATGAATATG 
TATTCCCCAA 
GTCTCCCTTT 
GTCGTAGCAG 
CAGAAGCAGC 
CGCAAGTTCC 
CCCCCGGCCG 
TGCTCCTGCT 
TTCATTCCAC 



11 

I 

GTTCCGATCA 
GCCTCCGCTC 
CTTACAGGGA 
TCACGGGAAA 
AGGATGCCTT 
CTACCTTGCA 
GCTACCTGCC 
ATCCTAATGG 
CAGCCACCAC 
AACTAAAAAT 
GAGACCTTTT 
AAGAAAAGAG 
ACAAAATCCC 
TATCT^AAAA 
TATCTTCTGT 
TGTGGTCCAA 
ATCACTGTTC 
CTTTCATGGT 
AGGGAACTGC 
TTGGTCATTG 
GCTGGGTGTG 
AGGAACCTCT 
CCACCTCTCA 
ACTATGCAGG 
GCAGGACTGG 
GATATCTCCT 
ATTGCAAGCT 
CCCACATCCT 
CCAA6TTCCT 
TACGTAGGAG 
CCTGTCCTGC 
ACX3TGATGGC 
CGCCCCGCAA 
CCCAGGATGT 
ATCCCTGGGT 



21 

I 

AGACAGAGCC 
TGCCCAGAGG 
GGGTGGTGGA 
CACAGTTCAA 
TGATAACAAA 
GCAAAGCTTT 
ATCAATCAGC 
AAATTCTAAA 
TTCGGTACCA 
AACCl^AAACT 
AAATGAAGTA 
GAAAAAACCC 
CAAATTAGAA 
ACCAAGGG7U\ 
TCCAACAACA 
GGTGACGGTC 
CAGCTGCCTG 
TTCTTCCTTG 
CCAGATGGGC 
GGAAGGAACA 
GCAGCATCAG 
GAGACTCAGT 
GGGCTGCTGG 
GCGCCGCCAT 
AAGCCTTAGA 
GCAAGGCCTC 
AGAGACGA6A 
TCAGTCTJSAA 
CAAACGCAAA 
ATCCAATATG 
CAAGTACACG 
TCA6AAGAGG 
G6TGGCTGAC 
GTATCTGACT 
GCCCTTCGGC 



31 

I 

CCGTATCTTC 
CCAAAAGCAG 
GCCATCGTCC 
CAACCACCTC 
ATTGACATTG 
CAATACTCAC 
ATGTATGAAA 
CAGJ\AGACCA 
CAAACTGTGA 
ATCCAGAATG 
CAGGCAAGTG 
AAAAAGCATG 
CCAGAGGAAC 
GATCCAGTAC 
GAA6TGTCCA 
ACACCCTGTT 
GAGATCTTGG 
AAGTTCCTCA 
TGGTCACCTA 
GACCAGATGT 
AAGCCTCAGA 
TTCCTACGAT 
TGTTCGCCCC 
TTGACACTCA 
GCTGTCGGCA 
CTAGGTGGGT 
GAAGCGGCGT 
AGTGCCCCTA 
GCAAACAGCC 
CCTGGCACAA 
CCTGCCTGCC 
GGCCTGTCAG 
AGACGCCA6C 
GGAGTTTCTG 
TCCTC CTAG 



41 
I 

CAGGGACACT 
CCCAACAAGA 
TCACGTATGC 
AACTCACTGA 
CTGAAGATGG 
CTACAACAGA 
TTCAAACCAA 
CATTAAATTC 
TTCCAAAGAA 
GCAGGGAATT 
AGCACACGAA 
ACTCATCAAG 
AAAATAGACC 
TAAAAGAGGA 
CTGGT6TTAA 
GG6TGCCCCG 
TGCTGGTGCC 
CCTCCACGGG 
TGGCCTCCAC 
CATCCAGGGG 
TCCGCATCTC 
GTGAAGTGGA 
CAGACCACGT 
GAGCCCAGGA 
AGAGATACTG 
TCTTGGAAGA 
CCTCAACTCC 
ACCACTACTT 
ATTTCCTGCA 
GGGGGTGGG6 
ACGCACAAT6 
GAAGATGTAG 
AGCTGCCGGG 
GATTAAAGGC 



51 

I 

AGACAAGATG 
GCCCGGCATT 
GCTGGGGATC 
CTCCGCCAGC 
TGGCCAGACA 
TCTTCCTCCA 
ATACCAGTCG 
TAGAAAACCC 
GAGT6GCTCA 
GTTCAA6TCT 
GTCAAAGCAT 
ATCTGAAGAG 
AAATGAGAGG 
AGCCCCAGTT 
GTTTCAAGTT 
CCTGCGAGGA 
AGCCCTCAGC 
CAAACAGAAG 
GACCAATGTC 
CCCGGAATTT 
CATCTGCCAC 
GAGAAGAATC 
CTGTGAGAAA 
AGCCTTTCTT 
CAGGAACAGC 
AA66AAT6CC 
AAGAATCCCG 
TCCCTACCAC 
CCTGTGTGCA 
T6GCCACAAA 
GGAGACATTC 
GGGCCAGCAG 
GGCTCCGGGC 
CAGTCGTGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



Seq ID NO: 227 Protein sequence: 
Protein Accession #: XP_064321.1 

1 11 21 31 41 51 

I I.I I I I 

MVASSDQDRA PYLPGTLDKM PGPRLRSAQR PKAAQQEPGI EPGTYREGGG AIVLTYAI/3I 60 

GVGITGNTVQ QPPQLTDSAS IRQEDAFDNK IDIAEDGGQT PYBATLQQSF QYSPTTDLPP 120 

IjTNGYIiPSIS MYEIQTKYQS HNQYPNGNSK QICTTUISRKP FPSTATTSVP QTVIPKKSGS 180 

PEVKIiKITKT IQNGRELFKS SLCQDUjNHV QASEHTKSKH ESRXEKRKKP RKHDSSRSEB 240 

RKSHKIPKLE PBEQNRPNER VHTISEKPRB DPVLKEEAPV QPILSSVPTT EVSTGVKPQV 300 

GDXiVWSKVTV TPCWVPRIiRG RRSHHCSSCL EIIiVIiVPAIiS LKRSFMVSSIj KFLTSTGKQK 360 

PTFKGTAQMG WSPMASTTNV SUiLGHWEGT DQMSSRGPEF GGRRWVWQHQ fCFQIRISICH 420 

RPGKEPUUjS FIiRCEVERRI SSIiATSQGCW CSPPDHVCEK CLEDYAGRRH LTLRAQEAFL 480 

GPDSRTGSIjR AVGKRYCRNS QHQRYLIiQGIj LGGFLEERNA NEYDCKLETR EAASSTPRIP 540 

YSPTHILQSE SAPNHYFPYH VSLSKFUCRK ANSHFLHIiCA WAVRRRSNM PGTRGWGGHK 600 

QKQPCPAKYT PACHAQWETF RKFHVMAQKR GLSGRCRGQQ PPAAPRKVAD RRQQIjPGAPG 660 
CSCSQDVYIjT GVSGLKASRG FIPHPWVPFG SS 
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Seq ID NO: 228 DNA aecruence 

Nucleic Acid Accession #: NM_006033 

Coding sequence: 253-1752 (underlined sequences correspond to start and stop codons) 
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AGCAGCGAGT 
CCGTTGACAC 
CCTTCTCTGC 
GCTGGAAACA 
TGGCGGGGCA 
TTTGCTGCGG 
AAACCCAAAG 
AAGGACCCAG 
TGCAGTTTCA 
ATCTTTGAAA 
AATGTAGTTG 
AATACCAGGG 
GATTTTTCTC 
TATGCAGGCA 
CCCATGTTTG 
GATGTCCTCC 
GGCCACATTG 
GTCTTGGGAT 
GCCGTCCACC 
TGCACTGACT 
AATAGCATT6 
AAAACCCGGG 
AGTTACAAGA 
GCAGATTCCC 
ACCTTCCTGG 
GAGGGGGCCT 
CCCCGCAACC 
CAGCGGAAAC 
GAGCTCTGGT 
GTGGAGCTTC 
TATCCAAGCC 
AGCCTTGACC 
GGAGGGGACT 
TCTGTCCACA 
GCCTCTCGTG 
GAGGAACGCT 
GAGCCTCAGT 
ATCTTGCTCG 
TCTTAGCCAT 
TATTTTTCTG 
ATGTGGTAAT 
TAGACCCTCT 
CTGGGGGAGG 
CTTCTTAGTT 
GTGTTTGATC 
GCCGGTTGCC 
ATGACATATG 
TCTAAGATTT 
CTTTTCTTGT 
ACACAATATC 
GTA6GGGGAG 
TCAGCCACAT 
TTCCTGGAGG 
AATATGCCCC 
ATCCTAAGCC 
TTATGGCTGA 
TGTACTTAGG 
TTAAACTTCT 
GTGTGCTTCT 
TATCCCTTAT 
TTTGGAATGT 
AAAATTCATT 
TATTTTAAAT 
CATTTTTTGT 
TTTTTGGATT 
TCTGTTTTTA 



11 

I 

CCTTGCCTCC 
TCGCTCCCTG 
CTCCAGTCCC 
CCAAGAGGTG 
GGATGAGCAA 
GGAGCCCCGT 
CTACACAGAC 
AGCATGAAGG 
ACATGACAGC 
ACTGGCTGCA 
TGGTTGACTG 
TGGTGGGACA 
TCGGGAATGT 
ACTTCGTGAA 
AAGGGGCCGA 
ACACCTACAC 
ACATCTACCC 
CAATTGCATA 
TCTTTGTTGA 
CCAATCGCTT 
GCTACAATGC 
CAGGCATGCC 
ACATGGGAGA 
AGACTCTGCC 
TCTACACCGA 
CTCAGTCTTG 
CCGGACGGGA 
TGACATTTTG 
TTCGCAAGTG 
CCEGAGGGTG 
CATGGAGGAA 
CTGGAGCACT 
GCGCTGCTAT 
CCTCCAGAGC 
CACACTGGAT 
66CTCCGAAG 
GAGAAGTCCT 
GGCCCTAGCT 
TCC6TCCTGC 
TTCATTTTTT 
GGACATATTA 
TTCTGTTTGG 
CTATAGGATA 
ATTATGTGCC 
ACTAGCAGAA 
AGATATAACT 
TAATACACAT 
AATACAGTGC 
TGAACTTCTT 
TGAGACACTT 
AAGGGCAACT 
GCCTAGACTT 
CAAAGTCTAT 
AGACGTGAGA 
CTTTTATTAA 
GATTCGGGAG 
TTTTCTAAGG 
GAAGACATAA 
CTTGTTTCTG 
CCAAAATGCT 
TTGCATATAC 
6ATGTGTCAG 
AGTTGTGTAC 
CTTGTTGGTG 
AGGGTTGCTC 
CTAAT6GAAG 



21 
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CGGCX3GCTCA 
CCACCGCCCG 
CCAGCCCCTG 
GTrTTTGTTT 
CTCCGTTCCT 
ACCTTTTGGT 
TGAGGTCAAA 
ATGCTACCTC 
TAAAACCTTT 
CAAACTCGTG 
GCTCCCCCTG 
CAGCATTGCC 
CCACTTGATC 
AGGAACGGTG 
CATCCACAAG 
GCGTTCCTTC 
CAATGGGGGT 
TGGAACAATC 
CTCTCTGGTG 
CAAAAAGGG6 
CAAGAAAATG 
TTTCAGAGTT 
AATTGAGCCC 
ACTGGAAATA 
GGAGGACTTG 
GTACAACCTG 
GCTGAATATC 
TACAGAAGAC 
TCGGGATGGC 
CCCGGGCAAG 
AGTTACTGCT 
GGGAACAACT 
AGCTCTTGCT 
ACCAAGTCCA 
TGGTTTCTCA 
AGGCCCTGTG 
TCCGACAGGA 
GTTGGGGTTC 
TCCCCAGCTC 
AATTGAGCAA 
CTGAGCCTCT 
ATGGTGTAT6 
TAAGCATTAG 
ACCTTCTTAG 
TAGCAAGCAG 
GCTTTGGAGC 
CTGTGTACAC 
TTTTTTTCCT 
GGAAAAGCCA 
ACACTTTTCA 
ATTATTATCC 
ATATACTAGT 
CTCTGAAACT 
CAAACAAGGA 
TGTATAACCA 
GAAGTGTGAC 
ACATTGTTTT 
CCAGTTGAGT 
TGATTGCTTT 
TGGAACCAGA 
ATAATGA6AT 
TTACACCTTA 
ATGAAGCATG 
CTCAAAAAGT 
AACCCATATT 
CTTTGCA 
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GGACGAGGGC 
GGCTCCGTGC 
GCCGAGAGAA 
TTTAAAACTT 
CTGCTCTGTT 
CCAGAGGGAC 
CCATCTGTGA 
TCCGTCGGCC 
TTCATCATTC 
TCAGCCCTGC 
GCCCACCAGC 
AGGATGCTCG 
GGCTACAGCC 
GGCCGAATCA 
AGGCTCTCTC 
GGCTTGAGCA 
GACTTCCAGC 
ACAGAGGTGG 
AATCAGGACA 
ATCTGTCTGA 
AGGAACAAGA 
TACCATTATC 
ACCTTTTACG 
GTGGAGCGGA 
GGAGACCTCT 
TGGAAGGAGT 
AGGCGCATCC 
CCTGAGAACA 
TGGAGGATGA 
TCTTGCCAGC 
GAGGACCCAC 
GGTCTCCTGT 
GCCTCTCTTG 
GATTTGTGTG 
GTTGCTGGGC 
TAGAAGGCTG 
GCTGACTCAT 
TCATGGGTTG 
ACTCTCTGAA 
ATGTCTATTG 
CCATTTGGAA 
TGTATATGCA 
6GACCCTGAG 
TTATTATGTG 
AGTATCATTC 
AAATCTCTTC 
AGAAACCGGC 
CTTTGAAATA 
CCAATTCTAG 
AAAGATTTGT 
CTATTTTACA 
TAGTGGTGCA 
CCATGAAGAC 
CTTTTTTTTT 
GGAGAACATC 
ACCAAGCAGG 
AATCTGTATC 
CTTATTTCAA 
CTAGCCAAAG 
AGT6TTTCAA 
ATTTTGGGAA 
TCCACATAGC 
GTTTGTGGTA 
TTTGGATTTT 
ATTG6CTGTA 



41 



51 



AGATCTCGTT 
CGCCAAGTTT 
GGGTCTTACC 
CTGTTTCTTG 
TCTGGAGCCT 
GGCTGGAAGA 
GGTTTAACCT 
ACAGCCAGCC 
ACGGATGGAC 
ACACAAGAGA 
TTTACACGGA 
ACTGGCTGCA 
TCGGAGCGCA 
CAGGTTTGGA 
CGGACGATGC 
TTGGTATTCA 
CAGGCTGTGG 
TAAAATGTGA 
AGCCGAGTTT 
GCTGCCGCAA 
GGAACAGCAA 
AGATGAAAAT 
TCACCCTTTA 
TCGAGCAGAA 
TGAAGATCCA 
TTCGCAGCTA 
GGGTGAAGTC 
CCAGCATATC 
AAAACGAAAC 
AAGGCAGCAA 
CCAATGGAAG 
GATGGCTGGG 
AATAGCTCTA 
TAAGCAGCTG 
GAGCCTGTAC 
TCAGCTGCTC 
GTCAGGATGG 
CACTGACCAT 
GCACACATCA 
AACACTTAAA 
CCCAGTG6AG 
TGGG6AAAGG 
6CTTTAAGTG 
CCACCTCCCC 
ATGCTG6GGC 
TGTTTAGAGA 
ACCTGCCAGA 
TTTTACTTTA 
ATCTTGATTT 
GTATGCATTG 
AAACTGAGGC 
GCCAGGGAGA 
TTTTGCAGCC 
TATATAGAGC 
TGTGCCAACG 
AGAGGAAGAA 
GTGCCAAAGT 
GATATGTTCT 
CGAA6CTTGT 
ATTTTAGATT 
TAGGACCCGA 
CTGAGGGTAA 
ACTTATGTGA 
GGAGCATTTC 
CATCCTGGTC 



CTGGGGCAAG 
TCATTTTCCA 
GGCCGGGATT 
GGAGGGGGTG 
CTGCTATTGC 
TAAGCTCCAC 
CCGCACCTCC 
CTTAGAAGAC 
GATGAGCGGT 
GAAAGACGCC 
TGCGGTCAAT 
GGAGAAGGAC 
CGTGGCCGGG 
TCCTGCCGGG 
AGATTTTGTG 
GATGCCTGTG 
ACTCAACGAT 
GCATGAGCGA 
TGCCTTCCAG 
GAACCGTTGT 
AATGTACCTA 
CCATGTCTTC 
TGGCACTAAT 
TGCCACCAAC 
GCTCACCTGG 
CCTGTCTCAA 
TGGGGAAACC 
CCCAGGCCGG 
CAGTCCCACT 
GACTTCCTGC 
GATTCTTCTC 
ACTCCTCGCG 
ACTCCAAACC 
GGTGCCTGGG 
TCTGCCTGAC 
AGCCTGCTTT 
CAGGCCTGGT 
ACTGCTTACG 
TTGGCTTTCC 
ATTAATTAGA 
TTGGGATTTC 
CACCTGGGGC 
GTTTCTATTT 
TATGAGTGAC 
CAGAAT6ATG 
GATAGAAGTT 
CAGAGCTGGT 
ATACCAGTGC 
GAATTAATAC 
CCTAATTAGA 
TTAGTGAGGT 
GGACTCAGAT 
AGTTCCCACC 
CATCCATAAA 
GTTGGACTTT 
TGATTTTCTT 
TGTATCACTG 
CAAGCCAATT 
ACAGGTTGAG 
ATTTTCAGAT 
GCCTAAACAC 
TTTTATACGA 
GGGGTTTTCC 
G6ATTTTGGA 
ACTTCTGACT 
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Seq ID NO; 229 Protein aemienee = 
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Protein Accession #: NP_00€024.l 

1 11 21 31 41 51 

I I I I I I 

MSNSVPIjLCF WSLCYCFAAG SPVPPGPEGR LEDKLHKPKA TQTEVKPSVR FMLRTSKDPE 60 

HEGCYIjSVGH SQPliBDCSFN MTAKTPFIIH GWTMSGIFEN WLHKXiVSALH TREKDANVW 120 

VDWLPIiAHQIi YTDAVNNTRV VGHSIARMLD WLQEKDDFSI* GNVHLIGYSli GAHVAGYACai 180 

FVKGTVGRIT GLDPAGPMFE GADIHKRLSP DDADPVDVIiH TYTRSFGIiSI 6XQMPVGHID 240 

lYPMGGDFQP GCGLNDVLGS lAYGTITEW KCEHERAVHIi FVDSLVNQDK PSFAFQCTDS 300 

NRFKKGICLS CRKNRCNSIG YNAKKMRNKR NSKMYLKTRA GMPFRVYHYQ MKIHVFSYKN 360 

MGEIEPTFYV TLYGTNADSQ TLPLEIVERI BQNATNTFLV YTEEDLGDLIi KIQLTWEGAS 420 

QSWYNIiWKEF RSYLSQPRNP GRELHIRRIR VKSGBTQRKIi TFCTEDPBNT SISPGRELWF 480 
RKCRDGWRMK NETSPTVBLP 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
5 application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMEP IS; 



1 1 . A method of detecting an angiogenesis-associated transcript in a cell in 

2 a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridized to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1 -8. 

1 2. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 3. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 4. The method of claim 3, wherein the nucleic acids are mRNA. 

1 S. The method of claim 3, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 6. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-8 . 

1 7. The method of claim 1 , wherein the polynucleotide is labeled. 

1 8. The method of claim 7, wherein the label is a fluorescent label. 

1 9. The method of claim 1, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 10. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat a disease associated with angiongenesis. 

1 11. The method of claim 1 » wherein the patient is suspected of having 

2 cancer. 

1 12. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1*8. 

1 13. The nucleic acid molecule of claim 12, which is labeled. 

1 14. The nucleic acid of claim 13» wherein the label is a fluorescent label 
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1 1 5. An expression vector comprising the nucleic acid of claim 12. 

1 1 6. A host cell comprising the expression vector of claim 1 5. 

1 1 7. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-8 

1 1 8. An antibody that specifically binds a polypeptide of claim 1 7. 

•1 19. The antibody of claim 18, fiirther conjugated or fused to an effector 

2 component. 

1 20. The antibody of claim 19, wherein the effector component is a 

2 fluorescent label. 

1 21. The antibody of claim 1 9, wherein the effector component is a 

2 radioisotope. 

1 22. The antibody of claim 19, which is an antibody firagment. 

1 . antibody of claim 19, which is a himianized antibody 

1 24. A method of detecting a cell undergoing angiogenesis in a biological 

2 sample fi^om a patient, the method comprising contacting the biological sample with an 

3 antibody of claim 1 8. 

1 25. The method of claim 24, wherein the antibody is fiirther conjugated or 

2 fiised to an effector component. 

1 26. The method of claim 25, wherein the effector component is a 

2 fluorescent label. 

1 27. The method of detecting antibodies specific to angiogenesis in a 

2 patient, the method comprising contacting a biological sample firom the patient with a 

3 polypeptide which is encoded by a nucleotide sequence of Tables 1-8. 
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